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Welcome  Remarks 


Since  its  inception  in  1913,  when  radio  techniques  were  primitive  and  largely  dependent  on  the 
ingenuity  of  its  pioneers,  URSI,  in  accordance  with  the  initial  stimulus  has  been  promoting 
international  cooperation  while  expanding  and  deepening  its  field  of  activities.  Some  of  the 
research  it  promoted  opened  up  new  fields  of  knowledge  in  other  disciplines,  such  as  metrology, 
solar  physics,  meteorology,  astronomy  and  space  science.  With  the  increasing  inter-disciplinary 
nature  of  science,  it  is  indeed  almost  inevitable  that  the  free  pursuit  of  basic  radio  research  should 
lead  the  Union  into  new  fields  of  endeavor.  At  the  present,  the  ten  scientific  Commissions  of 
URSI  cover  the  following  broad  areas:  electromagnetic  metrology;  fields  and  waves;  signals  and 
systems;  electronics  and  photonics;  electromagnetic  noise  and  interference;  wave  propagation 
and  teledection;  ionospheric  radio  and  propagation;  waves  in  plasmas;  radioastronomy  and 
electromagnetics  in  biology  and  medicine.  The  primary  objective  of  the  Union  remains:  to 
stimulate  and  coordinate,  on  an  international  basis,  studies  in  the  field  of  radio 
telecommunications  and  electronic  sciences.  New  scientific  knowledge  arising  from  these  studies 
often  lead  to  engineering  efforts,  the  consequences  of  which  are  far-reaching  for  our  society. 
Today,  on  the  other  hand,  the  time  between  discovery  and  its  practical  application  is  getting 
shorter.  With  this  in  mind,  a  special  1987  conference  of  URSI  officers  and  outside  experts, 
expressed  the  opinion  that  it  would  be  desirable  to  put  greater  emphasis  on  telecommunications, 
and  hence.  Commission  C  on  Signal  and  Systems  and  Commission  D  on  Electronics  and 
Photonics  would  have  a  major  roles  to  play.  Each  Commission,  though  in  its  own  field  of 
activity,  has  in  common  full  participation  in  the  world-wide  explosion  we  see  today  in 
telecommunications  technology.  And  each  have  utilized  new  knowledge,  inventions  and 
discoveries  in  the  radio  and  electronic  technologies  often  which  crosses  the  perceived  boundaries 
of  the  Commissions.  In  accordance  with  the  recommendations  made  to  the  URSI  General 
Assembly  in  Tel  Aviv  later  in  1987,  it  was  decided  to  initiate  a  series  of  triennial  international 
symposia  with  the  aim  of  covering  all  the  fields  of  telecommunications,  in  particular  the 
activities  of  Commissions  C  and  D  and  of  promoting  the  exchange  of  experience  and  results 
between  scientists  and  engineers  working  in  these  multi-disciplinary  areas.  The  first  URSI 
International  Symposium  on  Signals  Systems  and  Electronics  (ISSSE)  in  the  series  was  held  in 
Erlangen,  Germany  in  1989.  The  second  Symposium,  ISSSE'92,  was  held  in  Paris  at  the 
Conservatoire  National  des  Arts  et  M'etiers  (CNAM)  in  1992. 

The  third  URSI  International  Symposium  on  Signals,  Systems  and  Electronics  (ISSSE'95) 
will  be  held  in  San  Francisco,  California,  USA  at  the  Parc  Fifty  Five  Hotel.  For  this  Symposium 
we  have  the  pleasure  of  co-sponsorship  with  the  Institute  of  Electrical  and  Electronic  Engineers 
(IEEE)  Microwave  Theory  and  Techniques  Society,  Army  Research  Office  and  Office  of  Naval 
Research  as  well  as  cooperative  sponsorship  of  the  IEEE  EDS,  LEOS,  COM,  and  C&S  Societies. 
In  response  to  the  call  for  papers,  many  high  quality  papers  were  submitted.  Through  the  diligent 
work  of  the  Technical  Program  Committee  a  well  balanced  technical  program  of  reasonable  size, 
covering  all  the  research  areas  of  major  interest  has  been  developed.  In  addition  to  the  regular 
sessions  including  both  invited  and  contributed  papers,  a  poster  session  is  organized.  The 
symposium  invited  three  world  renowned  experts  for  keynote  presentations. 
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As  chairman  of  the  Symposium,  I  would  like  to  take  this  opportunity  to  express  our  thanks 
to  the  all  authors  who  have  responded  to  the  Call  for  Papers  and  to  the  Technical  Program 
Committee  who  selected  the  papers  and  established  the  program.  In  addition  I  would  like  to 
thank  all  members  of  the  Steering  Committee  for  their  untiring  efforts  to  make  this  Symposium  a 
success.  The  technical  contents  of  the  Symposium  cover  wide  ranges  of  interest  on:  Signal  and 
Information  Theory;  Signal  Processing  and  System  Theory;  Communications  Systems; 
Electronics;  and  Photonics.  Several  sessions  cover  topics  of  interest  spanning  both  Commissions 
C  and  D.  We  do  hope  that  the  3rd  International  Symposium  on  Signals,  Systems  and  Electronics 
will  attract  a  great  number  of  participants  and  we  are  looking  forward  to  welcoming  them  in  San 
Francisco  in  October. 


James  W.  Mink 

Chairman  of  the  Steering  Committee 
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W.  C.  B.  Peatman,  M.  Hurt,  R.  Tsai,  T.  Ytterdal,  H.  Park,  J.  Gonzales,  M.  Shur,  Univ.of  Virginia,  USA . 163 

WB3-2:  Analysis  of  The  Hot  Electron  Regime  of  Operation  in  Heterostructure  Field  Effect  Transistor 

E.  Martinez,  M.  Shur+,  F.  Schuermeyer,  Wright  Lab.,  USA  (+  Virginia  Univ.,  USA) . . 167 

WB3-3:  60  GHz  MMIC  Mixer  Using  A  Dual-Gate  PM  HEMT 

R.  Allam,  C.  Kolanowski,  D.  Langrez,  P.  Bourne,  J.  C.  De  Jaeger,  Y.  Crosnier,  G.  Salmer,  Institut 
d’Electronique  et  de  Microelectronique  du  Nord,  France . 171 

WB3-4:  Low-Frequency  Noise  Performances  of  Si02/InP  Metal-Insulator  -Semiconductor  Field-Effect 
Transistor 

Y.  K.  Su,  S.  C.  Shei,  K.  C.  Gan,  M.  Yokoyama,  National  Cheng  Rung  University,  Taiwan,  R.  O.C . 175 

SESSION  WC3:  Detection  and  Estimation  II 

Convener  :  E.  Biglieri,  Inst,  of  Elect,  and  Telecom.  Poly.,  Italy 

WC3-1:  A  New  Form  of  The  Fourier  Transform  for  Time-Varying  Frequency  Estimation 

V.  Katkovnik,  Univ.  of  South  Africa,  South  Africa . 179 

WC3-2:  Estimation  of  The  Surface  Reflectivity  of  Sar  Images  Based  on  a  Marked  Poisson  Point  Process 
Model 

J.  S.  Daba,  M.  R.  Bell,  Univ.  of  Tech.  Sydney,  Australia . 183 

WC3-3:  An  Improved  Direct  Inverse  Problem  Solver  for  Fractal  Interpolation  Functions  with  Application 
to  Signal  Compression 

J.  Wen,X.  Zhu,  Tsinghua  Univ.,  China . 187 

WC3-4:  Optimum  CFAR  Detection  Against  Correlated  K-Distributed  Clutter 

T.  Bucciarelli,  P.  Lombardo,  S.  Tamburrini ,  Univ.  of  Rome,  Italy . 191 

SESSION  WD3:  Applications  of  DSP  to  Telecommunications 

Convener:  P.  Pirsch,  Univ.  of  Hannover,  Germany 

WD3  Tutorial:  A  DSP  Perspective  of  Sigma-Delta  Techniques  with  Applications  in  Telecommunications 

M.  Bellanger,  CNAM,  France . . 195 

WD3-1:  Adaptive  Change  of  Code  Rate  in  DS-SSMA  Systems 

Y.  Ryu,  S.  C.  Kim,  1.  Song,  S.  Park,  H.  M.  Kim,  KAIST,  Korea . 199 

WD3-2:  Modified  Delay-Locked  Loop  for  Fading  Channels 

A.  Wilde,  DLR  Oberpfaffenhofen,  Germany . 203 

WD3-3:  Railway  Cab  Signaling  by  Track  Circuits  and  Spread  Spectrum  Modulations 

E.  Costamagna,  P.F.  Pellegrini,  Univ.  of  Pavia,  Italy . 207 

*  Papers  not  available  at  the  time  of  printing. 
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SESSION  ThAl:  Mixed  Mode  Circuit  Technology 

Convener:  F.  Schuermeyer,  Wright  Lab.,  USA 


*ThAl-T  Tutorial:  Compound  Semiconductor  Technology  for  Mixed  Mode  Circuit  Applications 

P.  Greiling,  Hughes,  USA . 

ThAl-1:  Microwave  Modeling  of  A  Photodetector-Amplifier  Used  in  Optical  Communication  Systems 

V.  Anand,  University  College,  London,  UK. . 211 

ThAl-2  Invited:  On  The  Applicability  on  The  Transimpedance  Amplifier  Concept  for  40Gb/s 
Optoelecetronic  Receivers  Based  on  InAIAs/InGaAs  Heterostructure  Fieldeffect  Transistors 

R.  M.  Bertenburg,  G.  Janssen,  S.  Waasen  Van,  R.  Reuter,  F.  J.  Tegude,  Gerhard-Mercator-Univ. ,  Germany  ...215 

*ThAl-3:  Info-Theoretic  Approach  to  ADC  Circuits 

A.  M.  Dighe,  Visvesvaray  Regional  Coll,  of  Eng.,  India . * 

ThAl -4:  High  Electron  Mobility  Transistor  as  Electronic  Flute 

—M.  1.  Dyakonov.  M  Shur,  Univ.  of  Virginia,  USA . 219 

ThAl -5:  An  Algorithm  for  Quaternary  ADCs 

A.  S.  Gandhi,  A.  M.  Dighe,  VRCE,  India . 223 


SESSION  ThBl:  Microwave  Systems  Components 

Convener:  J.  Henaff,  CNET,  France 
Chairman:  O.  Lloas,  LAAS-CNRS,  France 


ThBl-T  Tutorial:  Time  Domain  Identification  of  Non-Linear  Systems 

M.  1.  Sobhy,  E.  A.  Hosny,  M.  W.  R.  Ng,  A.  Bakkar  E-S,  Univ.  of  Kent  at  Canterbury,  UK . 227 

ThBl-1:  A  Computer-Aided  Impedance  Characterization  for  Microwave  Resonant-Cap  Circuits 

S.  Kar,  Inst,  of  Radio  Phys.  &  Elec,  India . 231 

ThBl-2:  New  Current  Mode  Universal  Filter 

R  .  Nandi,  Jadavpur  Univ.,  India . 235 

ThBl -3:  A  New  Type  CPW-Resonator  Bandpass  Filter  with  Widley  Suppression  Characteristics  of 
Harmonic  Resonance  Responses 

Y.  Noguchi,  K.  Wada,  T.  Nakamine,N.  Okamoto,  Kinki  Univ.,  Japan . 237 

ThBl -4:  A  Linear  Sinewave  voo  Design  and  its  Digital  Control 
S.  K.  Sanyal,  R.  Nandi,  S.  K.  Debroy,  D.  Pal,  Jadavpur  Univ.  India . 241 

*ThBl-5:  Optimum  Large-Signal  Performance  of  Microwave  Field  Effect  Transistors 

M.  C.  E.  Yagoub,  H.  Baudrand,  E.N.S.E.E.I.H.T.,  France . 


SESSION  ThCl:  Time-Frequency  Analysis 

Convener:  H.  Ling,  Univ.  of  Texas,  Austin,  USA 


ThCl  Tutorial:  Joint  Time-Frequency  Analysis  of  Electromagnetic  Backscattered  Data 

H.  Ling,  Univ.  of  Texas,  Austin,  USA . 243 

*ThCl-l  Invited:  Phase-Space  Processing  of  Electromagnetic  Scattering  Data 

L.  Carin,  NY  Poly  Tech,  USA  . . * 

The  1-2:  Region-of-Interest  Tomography  Using  Biological  ZImage  Perception  Concepts 

A.  Moghaddar,  Ohio  State  Univ.,  USA  ( withdrawn ) . * 

ThCl-3:  Joint  Time-Frequency  ISAR  Algorithm  for  Imaging  Targets  with  Non-Point  Scattering  Features 

L.  C.  Trintinalia,  H.  Ling,  Univ.  of  Texas,  Austin,  USA . 247 
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The  1-4:  Reconstruction  of  ISAR  Image  by  Time-Frequency  Distribution  Series 

V.  C.  Chen,  S.  Qian,  National  Instruments,  USA . 251 

SESSION  ThDl:  Personal  and  Mobil  Communications 

Convener:  A.  Acampora,  UCSD,  USA 

ThDl  Tutorial:  An  Adaptive  High-Speed  Wireless  Transceiver  for  Personal  Communications 

G.  Pottie,  UCLA,  USA . 255 

ThDl-1:  Homodyne  Receiver  Technology  for  Small  and  Low-power  Consumption  Mobile  Communications 
Equipment 

M.  Hasegawa,  T.  Fukagawa,  M.  Mimura,  M.  Makimoto,  Matsushita ,  Japan . 259 

ThDl -2:  System  Design  for  Continuous  Cell  Coverage  in  High  Density  Microcelluar  Networks 

B.  C.  Jones,  DJ.  Skellern,  Macquarie  Univ.,  Australia . 263 

ThDl-3:  The  Effect  of  Receiver  Collisions  on  Throughput  Performance  of  Multichannel  Multiaccess 
Protocols 

I.  E.  Pountourakis,  National  Technical.  Univ.  of  Athens,  Greece . 267 

SESSION  ThA2:  New  Device  Technologies  for  Future  DSP  Systems 

Convener:  E.  Martinez,  Wright  Lab.,  USA 

ThA2  Tutorial:  Silicon-on-Insulator  (SOI)  Technology  for  Low  Power  Electronics 

Z  Lemnios,  ARPA,  USA . 271 

ThA2-l  Invited:  Complementary  GaAs  Technology  and  Applications 

J.  Hallmark,  J.  Abrokwah,  B.  Bernhardt,  B.  Crawforth,  D.  Foster,  J-H.  Huang,  M.  LaMacchia,  R.  Lucero,  B. 

Mathes,  B.  Ooms,  Motorola,  USA . 275 

ThA2-2  Invited:  Manufacturable  GaAs  BiFET  Technology  for  High  Speed  Digital  Processing 

F.  Chang,  Rockwell,  USA . 279 

*ThA2-3  Invited:  InP  HBT  Technology  for  High  Speed  Signal  Processing  in  Electronic 

Y.  K.  Chen,  AT&T  Bell  Labs,  USA .  * 

*ThA2-4  Invited:  Title  will  be  announced  at  the  conference 

A.  Old,  TRW,  USA . .  * 

SESSION  ThB2:  Photonics:  Devices,  Links  and  Applications 

Convener:  C.  Cox,  MIT  Lincoln  Lab.  USA 

ThB2  Tutorial:  Techniques  and  Applications  of  Analog  Fiber  Optic  Links 

C.  Cox,  MIT  Lincoln  Lab.  USA . 283 

ThB2-l  :  Novel  Photonic  Oscillator  for  Signal  Generation  and  Distribution 

L.  Maleki,  JPL,  USA .  * 

ThB2-2:  Photonic  Systems  for  Antenna  Control 

M.  VanBlaricam,  C.  Swann,  Toyon,  USA . 287 

*ThB2-3:  External  Modulators  for  Cable  TV  Distribution  Applications 

G.  Gopalakrishnan,  General  Instrument,  USA . * 

*ThB2-4:  Insertion  of  Photonic  Technology  in  RF  Phased-Array  Antennas 

L.  Lembo,  TRW,  USA .  * 

*  Papers  not  available  at  the  time  of  printing. 
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SESSION  ThC2:  Optical  Communication  Networks 

Convener;  E,  Yablonovitch,  UCLA,  USA 


*ThC2  Tutorial:  A  Model  Architecture  for  A  Management-Scaleable  All-Optical  Telecommunications 
Infrastructure 

A.  Acampora,  UCSD,USA .  * 

ThC2-l:  Performance  Analysis  of  Optical  Fiber  TDMA  Systems 

J-G  Zhang,  Asian  Institute  of  Technology,  Thailand . 291 

ThC2-2:  An  Error  Floor  Reduction  Scheme  for  DPSK  Receivers  Impaired  by  Phase  Noise 

K.  Kiasaleh,  Univ.  of  Texas,  Dallas,  USA . 295 

*ThC2-3:  On  The  Placement  of  Optical  Amplifiers  in  Digital  Links 

H.  Waldman,  R.  N.  Ribeiro,  A.  A.  Surerus,  DECOM/FEE/UNICAMP,  Brazil . * 

ThC2-4:  Multi-Wavelength,  Optical  Code-Division-Multiplexing  Based  on  Passive,  Linear,  Unitary,  Filters 

C  F.  Lam,  E.  Yablonovitvh,  UCLA,  USA . 299 


SESSION  ThD2:  Image  Processing 

Convener:  W.  Sander,  ARO,  USA 


ThD2  Tutorial:  Application  of  Antenna  Array  Processing  to  Problems  in  Image  Understanding 

T  Kailath,  H.  Aghajan,  Stanford  Univ.,  USA . 303 

ThD2-l :  Phase  Retrieval  Algorithm  Based  on  Maximum  Entropy  Method 

A.  Bajkova,  Institute  of  Applied  Astronomy,  Russia . 307 

ThD2-2:  A  Method  to  Decrease  Nonlinear  Image  Reconstruction  Distortions  Caused  by  Measurement 
Errors 

Bajkova  A,  Institute  of  Applied  Astronomy,  Russia . 31 1 

ThD2-3:  Synthesis  of  Data-Dependent  Filters  for  Digital  Image  Processing  and  Recognition 

A.  V.  Kovalenko,  V.  N.  Kurashov,  N.  G.  Nakhodkin,  Taras  Shevchenko  Univ.,  Russia . 315 


SESSION  ThA3:  Application  of  Signal  Processing  Techniques  for  Efficient  Device  Modeling  and 
Characterization 

Convener:  E.  K.  Miller,  Los  Almos  Lab.,  USA 
Chairman:  T.  Sarkar,  Syracuse  Univ.,  USA 


ThA3  Tutorial:  The  Principle  of  Analytic  Continuation  -  How  To  Use  It? 

R.  S.  Adve,  T.  K.  Sarkar,  S.  M.  Rao,  E.  K.  Miller,  Syracuse  Univ.  USA . 319 

ThA3-l:  Time-Domain  Signal  Restoration  and  Parameter  Reconstruction  on  An  LCRG  Transmission  Line 

J.  Lundstedt,  S.  Stroem,  S.  He,  Royal  Inst,  of  Tech.  Sweden . 323 

ThA3-2:  On  Incorporating  Signal  Processing  and  Neutral  Network  Techniques  with  The  FDTD  Method  for 
Solving  Electromagnetic  Problems 

J.  Litva,  E.  Navarro,  McMaster  Univ.,  Canada . 327 

ThA3”3:  Simulated  Images  for  Wide  Band  Appellation,  An  Alternative  to  Complex  Images  of  Multilayer 
Media 

Y.  L.  Chow,  A.  Torabian-Esfahani,  N.  Hojjat,  S.  Safavi-Naeini,  Univ.  of  Waterloo,  Canada . 331 

ThA3-4:  Interpolation/Extrapolation  of  Frequency  Domain  Responses  Using  The  Hilbert  Transform 

S.  M.  Narayana,  G.  S.  Rao,  R.  S.  Adve,  T.  K.  Sarkar,  M.  Wicks,  E.  K.  Miller,  Syracuse  Univ.  USA . 335 
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SESSION  ThB3:  Microwave  Systems  Components 

Convener:  E.  Kollberg,  Chalmers  Univ.  Sweden 


ThB3  Tutorial:  Cryogenically  Cooled  HEMT’s  From  The  Device  Towards  The  Applications 

O.  Llopis,  L.  Escotte,  LAAS  CNRS,  France . 339 

ThB3-l:  High  Temperature  Microwave  Modeling  and  Circuit  Design  with  MESFET’s  and  HBT’s 

V.  Krozer  V,  M.  Schuessler,  K.  Fricke,  W.  Y.  Lee ,  H.  L.  Hartnagel,  Tech.  Univ.  Darmstadt,  Germany . 343 

ThB3-2:  AIGaAs/GaAs  and  Galn/GaAs  HBT  for  High  Temperature  Microwave  Operation 

M.  Schuessler,  V.  Krozer,  W.  Y.  Lee,  H.  L.  Hartnagel,  Tech.  Univ.  Darmstadt,  Germany . 347 

ThB3-3:  Heteroj unction  Bipolar  Phototransistors  for  Analog  Microwave  Photonic  Systems 

A.  Paolella,  L.  E.  M.  deBarro,  P.  Herczfeld,  P.  Enquist,  W.  VanMeerbeke,  US  Army  Research  Lab.,  USA . 351 

ThB3-4:  Modeling  for  Dispersion  of  Shielded  Microstrip  Linr  Using  SLR  Technique 

A.  K.  Verma,  R.  Kumar,  Univ.  of  Delhi,  India . 355 

SESSION  ThC3:  Adaptive  Algorithms  and  Systems 

Convener  :  E.  Deprettere,  Tech.  Univ.  of  Delft,  Netherland 

ThC3  Tutorial:  Adaptive  Asymptotic  Optimal  Algorithms  for  Detecting  Signals  in  Autoregressive  Noise 

B.  B.  Shishkov,  Y.  P.  Georgiev,  SN.  Stoyanor,  Tech.  Univ.  of  Sofia,  Bulgaria . 359 

ThC3-l:  A  Fuzzy-Rule-Based  Phase  Error  Detector 

F.  Daffara,  Phillips,  France . 363 

*ThC3-2:  An  Adaptive  Spatial  Processing  Algorithm  Using  Special  Parametric  Form  of  Reference  Signals 

P.  V.  Gorev,  Radiophys.  Research  Inst. ,  Novgorod,  Russia .  * 

*ThC3-3:  Fast  Switching  Frequency  Synthesizer  Employing  Adaptive  Phase  Locked  Loop  Operating  in  The 
Pseudo  Linear  Region 

SR.  Al-Araji,  A.  J.  Al-Dweik,  M.  S.  AbuRajab,  Univ.  of  Yarmouk,  Jordan .  * 

SESSION  ThD3:  Recent  Advances  in  Speech  Processing 

Convener:  A.  Alwan,  UCLA,  USA 

ThD3  Tutorial:  Speech  Production  and  Perception  Models  and  their  Applications  to  Synthesis,  Recognition, 
and  Coding 

A.  Alwan,  S.  Narayanan,  B.  Strope  and  A.  Shen,  UCLA,  USA . 367 

ThD3-l  Invited:  Microphone  Arrays  for  Speech  Processing 

E-E.  Jan,  J.  L.  Flanagan,  Rutgers  Univ.,  USA . 373 

ThD3-2  Invited:  Vocal-Tract  Acoustics  and  Speech  Synthesis 

S.  Maeda,  CNRS,  France . . ^77 

ThD3-3  Invited:  Digital  Signal  Processing  for  An  Ear  Level  Binaural  Hearing  Aid 

S.  Soli,  House  Ear  Institute,  USA . 

ThD3-3  Invited:  Recent  Advances  and  Current  Challenges  in  Speech  Recognition 
P.  Price,  SRI,  USA .  * 

*  Papers  not  available  at  the  time  of  printing. 
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SESSION  FAl:  Modeling  and  Simulation  I 

Convener:  M.  Stroscio,  ARO,  USA 


*FA1  Tutorial:  Confined  Phonons 

M  Stroscio,  ARO,  USA . * 

FAl-1:  Topological  Pulse  Modulation  of  Field  and  New  Microwave  Circuits  Design  for  Superspeed 
Operating  Computers 

G.  A.  Kuzaev,  V.  /.  Gvozdev,  Moscow  Inst.  ofEle.  and  Math,,  Russia . 383 

*FA1“2:  Modeling  of  Improvement  Self-Protected  Microwave  Detector 

L.  M.  Drozdovskaia,  Special  Research  Bureau  of  Moscow,  Russia . * 

*FAl-3:  Optimization  Criteria  of  A  Microwave  Generator  Circuit  by  An  Appropriate  Choice  of  The  E.F. 
Transmission 

G.  Alquie,  S.  Aidel,  A.  Said,  Univ.  of  Constantine,  Algeria . * 

FAl-4:  Low-power  PSA  transistors  for  Microwave  Communication  Systems:  Comparison  Among 
Bias-Dependent  Models  Extracted  form  Measured  Noise  and  Scattering  Parameters 

A.  Caddemi,  M.  Sannino,  Univ.  of  Palermo,  Italy . 385 

SESSION  FBI:  Modeling  and  Simulation  II 

Convener:  M.  Shur,  Univ.  of  Virginia,  USA 

FBI  Tutorial:  Recent  Advances  in  Time-Domain  Simulation 

B.  Houshmand,  T.  Itoh,  UCLA,  USA . 389 

FBl-1  Solution  of  EM  Fields  by  Asymptotic  Waveform  Techniques: 

M.  Li,  Q.  J.  Zhang,  M.  S.  Nakha,  Carleton  Univ.,  Canada . . 393 

FBI -2:  Design  and  Simulation  of  Microwave  Filter 

A.  M.  Nassar,  A.  M.  Moselhy,  M.  A.  EUGazzar,  Egyptian  Air  Academy,  Egypt . 397 

FBI -3:  Statistical  Modeling  of  Idd  Testing  Efficiency  of  Analogue  Integrated  Circuits 

V.  Musil,  Univ.  of  Brno,  Czech  Republic . 401 

FBI -4:  A  Simulated  Annealing  Algorithm  for  The  Optimization  of  Communication  Networks 

E.  Costamagna,  A.  Fanni,  G.  Giacinto,  Univ.  of  Pavia,  Italy . 405 

SESSION  FCl:  Array  Processors,  Modulation  and  Coding 

Convener:  R.  L.  Pickholtz,  George  Washington  Univ.,  USA 

FCl  Tutorial:  On  Optimization  of  Broadband  Beamformer  for  Hearing  Aid  Applications 

A.  Wang,  F.  Lorenselli,  D.  Korompis,  R.  E.  Hudson,  K.  Yao,  UCLA,  USA . 409 

FCl-1:  Optimum  Detection  of  The  Discrete  Spectrum  Signals,  with  Applications  to  Array  Beamforming  in 
A  Waveguide 

A.  I.  Malekhanov ,  Russian  Academy  of  Science,  Russia . * 

FCl-2:  Pseudo-Random  Turbo-Codes 

G.  Battail,  ENST,  France . 419 

FCl-3:  An  Automatic  Procedure  to  Construct  The  Optimal  Soft-Decision  Quantizers  and  Branch  Metrics 
for  Viterbi  Decoders 

T.  F.  Wong,  D.  J.  Skellern,  L.  H.  Lee,  Macquarie  Univ.,  Australia . 423 
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SESSION  FDl:  Digital  Filters  and  Filter  Banks 

Convener:  G.  Pottie,  UCLA,  USA 


FDl  Tutorial:  State  Space  approach  in  Time-Varying  Biorthogonal  Filter  Banks 

A.  N.  Lemma,  E.  Deprettere,  Tech.  Univ.  of  Delft,  Netherland . 427 

FDl-1:  Piecewise-Linear  Characteristics  of  MSLMS  Filters 

H.  Q.  Xu,  J.  N.  Lin,  R.  Unbehauen,  Univ.  Erlangen-Nuernberg,  Germany . . 435 

FDl -2:  Variance  of  Quantization  Error  at  The  Output  of  Recursive  Digital  Filter 

G.  Mollova,  Higher  Inst,  of  Arch,  and  Civil  Engr.,  Bulgaria . 439 


SESSION  FA2:  Packaging  and  Amplifiers 

Convener:  M.  Levinshtein,  A.  F.  Ioffe  Institute,  Russia 


FA2  Tutorial:  RF  Characterization  of  A  Low  Cost  Multichip  Packaging  Technology  for  Monolithic 
Mircrowave  and  Millimetenvave  Integrated  Circuits 

K.  Jayarai,  T.  E.  Noll,  Foster-Miller,  Inc.  And  D.  Singh,  Honeywell  Tech.  Ctr.  USA . 443 

FA2-1:  Novel  Approach  Using  Tapers  for  High  Power  FET  Chips  Characterization 

M.  Zoyo,  C.  Galy,  A.  Darbandi,  L.  Lapierre,  J.  F.  Sautereau,  ALCATEL  ESPACE,  France . 447 

FA2-2:  High  Temperature  Operational  Amplifier  with  Low  Offset  Voltage 

J.  Bergmann,  A.  Have  ten,  Inst,  for  Electronics,  Bochum,  Germany . . 451 

FA2-3:  Design  of  A  High  Gain  Power  Amplifier  Using  A  Bias  Defendent  Large  Signal  Mestet  Model  and 
The  Describing  Function  Technique 

S.  Munoz,  J.  L.  Sebastian,  J.  L.  Sebastian,  Univ.  of  Complutense,  Spain . 455 

FA2-4:  Simulation  and  Characterization  of  A  High  Temperature  SIMOX-Operational  Amplifier 

P.  Gehse,  C.  Eisenhut,  Inst,  for  Electronics,  Bochum,  Germany . 459 


SESSION  FB2:  Modeling  and  Simulation  HI 

Convener:  K.  Weiser,  Technion,  Israel 


FB2  Tutorial:  Conformal  Mapping  Analysis  of  Microshield  Transmission  Lines 

E.  Costamagna,  A.  Fanni,  Univ.  of  Pavia,  Italy . 463 

FB2-1 :  Device  Models  in  FDTD  Analysis  of  Microwave  Circuits 

C.  N.  Kuo,  B.  Houshmand,  T.  Itoh,  UCLA,  USA . 467 

FB2-2:  A  Simple  Technique  for  Determining  Transmission  Losses  in  Saw  Delay  Lines 

T.  E.  Taha,  Faculty  of  Electrical  Engineering,  Egypt . 471 

FB2-3  :  Comparison  Study  Between  A  Computer  Alied  Analysis  Program  Using  State  Space  Technique  and 
Spice  Program 

A.  M.  Nassar,  A.  M.  Moselhy,  M.  A.  El-Gazzar ,  Egyptian  Air  Academy,  Egypt . 477 

*FB2-4:  Nonlinear  Emissions  Dynamic  of  Heterolaser  Caused  by  Nonlinear  Waveguiding  in  The  Active 
Layer 

V.  V.  Tchernyi,  G.  A.  Zhuravely,  Volgograd  State  Univ.  Russia .  * 


*  Papers  not  available  at  the  time  of  printing. 


SESSION  FC2:  Signal  Processing  and  System  Theory 
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ABSTRACT 

Active  integrated  antennas,  in  which  solid-state 
devices  and  antennas  are  integrated  together  on  a 
single  substrate,  offer  significant  advantages  in 
performance,  size  and  cost  for  microwave  wireless 
systems.  Recent  developments  include  elements  for 
integrated  transmitters,  heterodyne  receivers, 
transceivers  and  transponders  operating  at 
frequencies  from  UHF  through  millimeter-wave. 
These  elements  can  be  used  alone  or  in  phased 
arrays  for  wireless  communication,  identification 
and  sensing. 

1.  INTRODUCTION 

In  conventional  microwave  and  millimeter  wave 
wireless  system  design,  the  signal  processing 
circuitry  and  the  antenna  are  often  treated  as 
separate  subsystems.  Combining  these  elements  to 
form  integrated  active  antennas  can  result  in  a 
high-performance  circuit  with  the  minimized  size, 
complexity  and  cost  demanded  by  modern 
commercial  wireless  applications  [1].  In  this 
approach,  a  radiating  structure  serves  as  both  an 
antenna  and  a  circuit  element  such  as  a  filter, 
diplexer,  or  resonator.  This  allows  a  wide 
assortment  of  printed  antennas  to  be  used  to  form 
efficient  compact  oscillators,  mixers,  and  receivers 
which  interact  directly  with  the  radiated  fields  [2]. 

2.  TRANSMITTERS 

A  simple  radio  transmitter  is  formed  when  an 
oscillating  active  device,  such  as  a  Gunn  diode  or 
MESFET,  is  coupled  to  a  printed  antenna.  In  this 
situation,  the  antenna  may  function  as  both  the 
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load  and  the  resonant  element  which  provides 
positive  feedback  for  oscillation  at  a  desired 
frequency.  An  example  is  the  circuit  shown  in 
Figure  1,  which  comprises  a  GaAs  MESFET 
coupled  to  a  microstrip  slot  antenna.  The  FET  is 
destabilized  via  reactive  loading  with  mistrostrip 
stubs,  to  present  a  negative  resistance  to  the  slot 
anteima.  This  results  in  oscillation  at  25  GHz, 
producing  a  10  mW  signal  which  is  directly 
radiated  [3].  Many  similar  circuits  have  been 
designed,  employing  resonant  antennas  such  as 
rectangular  and  circular  microstrip  patches,  CPW 
slots  and  printed  dipoles.  Traveling-wave  type 
antennas  such  as  the  exponentially-tapered  slot 
have  also  been  used  for  wideband  operation  with 
tunable  sources  [4].  Much  of  the  recent  interest  in 
the  area  of  active  antennas  has  been  in  using 
arrays  of  oscillators  in  spatial  power  combining 
schemes,  where  the  radiated  power  from  many 
low-power  devices  is  combined  in  free  space  to 
form  a  powerful  beam.  This  is  especially  useful  at 
millimeter- wave  frequencies,  where  the  power 
available  from  each  active  device  is  relatively  low. 
For  example,  with  a  large  array  of  n  sources,  the 
total  radiated  power  as  well  as  the  array 
directivity  both  increase  with  n,  resulting  in  an 
effective  isotropic  radiated  power  (EIRP)  which 
increases  as  n^.  This  EIRP  may  exceed  that  which 
is  possible  from  a  conventional  waveguide  power¬ 
combining  network  feeding  an  antenna  of  equal 
directivity,  due  to  relatively  lossless  free  space 
propagation.  A  recent  example  is  a  6-element 
linear  array  of  oscillators,  with  elements 
comprising  GaAs  MESFETs  coupled  to 
rectangular  microstrip  patches  [5].  The  array 
produces  a  broadside  beam  at  10.8  GHz,  with  an 
EIRP  exceeding  100  W. 
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Figure  1.  25  GHz  active  antenna  oscillator  with  FET  and 
microstrip  slot  antenna. 


Figure  2.  Transmitting  phased  array  using  the  unilateral 
injection-locking  technique.  A  reference  signal  is  applied  to 
the  first  element  only. 


The  oscillation  frequency  of  an  active  antenna 
can  be  controlled  by  injection-locking  it  to  a  low- 
power  reference  signal.  This  allows  a  modulated 
carrier  to  be  effectively  amplified  and  radiated  by 
an  element.  In  an  array,  injection-locking  allows 
the  radiated  power  from  all  elements  to  add 
coherently  and  form  a  beam.  This  is  done  by 
distributing  a  reference  signal,  or  through 
electromagnetic  coupling  between  the  oscillators 
that  allows  the  array  to  self-synchronize.  By 
controlling  this  coupling  and  the  tuning  of  selected 
oscillators  in  the  array,  the  transmitted  beam  can 
be  made  to  scan  in  various  directions.  In  contrast 
to  conventional  phased  arrays,  this  method  does 
not  require  phase  shifters.  As  the  RF  source  is 
distributed  over  the  array,  the  conventional  feed 
network  is  eliminated  as  well.  This  makes  possible 
inexpensive  phased  arrays  for  such  applications  as 
portable  satellite  uplink  stations,  as  well  as  other 
transmitters  that  are  thin  and  conformally 
mountable.  One  example  is  an  X-band  array  which 
uses  strong  coupling  between  oscillator  elements, 
in  conjunction  with  detuning  the  free-running 
frequencies  of  the  end  elements  of  the  array.  A 
progressive  interelement  phase  shift  results  when 
the  elements  injection-lock  each  other  to  the  same 
frequency,  resulting  in  a  continuously  steerable 
radiated  beam  [6] .  Another  recent  array  uses 
unilateral  injection-locking  at  6  GHz,  where  the 
phase  reference  signal  is  passed  from  one  element 
to  the  next  in  one  direction  (Figure  2).  This  results 
in  greater  control  over  the  phase  progression  in  the 
array,  allowing  the  scanning  of  both  sum  and 
difference  patterns 


[7].  Applications  include  automotive  radar,  for 
road  sensing  and  collision  avoidance. 

3.  RECEIVERS 

Receivers  with  integrated  antennas  include  quasi- 
optical  mixers  and  detectors,  which  are  highly 
compact  frequency-converting  circuits.  The  input 
RF  and  local  oscillator  (LO)  signals  are  coupled 
into  the  structure  via  radiation  and  the  mixing 
product  (IF),  usually  the  difference  frequency,  is 
extracted  through  a  printed  waveguide.  The 
mixing  devices  are  typically  Schottky  diodes  or 
MESFETs.  At  millimeter-wave  frequencies  and 
above,  the  entire  antenna  structure  including  the 
devices  may  be  fabricated  monolithically  on  a 
single  semiconductor  substrate.  In  quasi-optical 
mixers,  the  active  devices  are  mounted  directly  in 
the  antenna  or  feed  structure  where  the  impedance 
is  desirable.  Antenna  types  include  various  slots 
(slot  ring,  tapered  slot,  and  coupled  rectangular 
slots),  patches,  bowtie  dipoles,  and  log  spirals. 
Since  the  LO  signal  must  be  radiated  in,  such 
mixers  may  be  impractical  as  down-converters  in 
microwave  wireless  systems  (although  they  can  be 
used  “in  reverse”  as  upconverters,  such  as  for  a 
transponder).  Instead,  self  oscillating  quasi-optical 
mixers  can  be  employed  which  generate  their  own 
LO  signal,  often  with  the  same  nonlinear  active 
device  used  for  frequency  conversion  [8] . 

Quasi-optical  mixers  can  also  be  arrayed  to 
form  more  complex  integrated  receivers,  such  as 
for  coherent  monopulse  tracking  (four- quadrant 
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Figure  3.  A  planar  6  GHz  Doppler  radar  module  with 
azimuth  tracking  capability. 
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Figure  4.  A  microwave  noncontact  identification  card, 
using  6  GHz  interrogation. 


sum  and  difference  patterns).  One  such  monopulse 
receiver  operates  at  35  GHz,  using  a  2x2  array  of 
slot  ring  mixers  fabricated  monolithically  on  a 
GaAs  substrate  [9].  The  mixer  element  uses  diodes 
placed  at  a  A,/4  interval  within  a  lAr-circumference 
annular  slot,  which  is  etched  into  metal  that  clads 
the  dielectric  substrate.  This  slot-ring  acts  as  both 
antenna,  hybrid  and  diplexer  to  provide  balanced 
mixing  between  orthogonally-polarized  RF  and  LO 
beams.  Feeding  a  mixer-antenna  array  with  in- 
phase  LO  signals  results  in  coherently  down- 
converted  IF,  such  that  the  array  “stares”  in  a 
fixed  direction.  By  varying  the  LO  phases,  it  is 
possible  to  steer  the  receiving  beam  to  a  desired 
angle.  This  technique  has  been  used  in  an 
adaptively-steered  receiver  array,  for  reception  of 
Japanese  DBS  satellite  television  at  12  GHz  [10].  A 
216-element  rectangular,  patch  array  is  fabricated 
in  a  multilayer  structure,  which  includes  low  noise 
downconverters  in  coplanar  waveguide  underneath 
each  9-element  subarray.  The  IF  output  signals  are 
applied  to  self-phasing  circuits,  which  combine  the 
subarrays  with  phasing  such  that  the  receiving 
beam  tracks  the  satellite  automatically.  This 
planar  circuit  replaces  a  33  cm.-diameter  parabolic 
dish  and  its  feedhorn  electronics. 

4.  TRANSCEIVERS  &  TRANSPONDERS 

The  active  antenna  approach  is  especially  well 
suited  to  wireless-system  components  where  both 


transmit  and  receive  capabilites  are  required.  In 
this  case,  a  whole  system  can  be  integrated  onto  a 
single  substrate.  An  example  is  the  Doppler  radar 
transceiver  shown  in  Figure  3.  This  2-element 
microstrip  patch  array  uses  GaAs  MESFETs  as 
transmitting  oscillators  and  as  receiving  mixers,  by 
virtue  of  their  nonlinearity.  A  sum  pattern  is 
transmitted  at  6.5  GHz,  and  both  sum  and 
difference  patterns  are  available  at  the  IF  (Doppler 
shift  frequency)  to  obtain  the  range-rate  and  rough 
azimuth  tracking  of  a  target  [llj.  This  information 
can  be  used  by  intrusion-detection  sensors  and 
process  control  systems.  The  circuit  is  a  compact 
and  efficient  equivalent  to  the  bulky  Guim 
oscillator /horn  antenna  conventionally  used.  Other 
transceivers  have  been  designed  for  point-to-point 
microwave  radio  communications.  A  recent 
example  is  the  polarization-duplexed  6  GHz 
transceiver  based  on  an  active  inverted  circular 
patch  [12].  An  FET  is  embedded  in  a  circular 
microstrip  patch  to  form  an  oscillator,  whose  signal 
is  used  as  the  transmitted  carrier  as  well  as  the 
local  oscillator  for  downconverting  incoming 
signals  from  another  transceiver.  For  receiving,  a 
Schottky  diode  is  mounted  in  the  patch  at  an 
optimum  impedance  point  to  form  a  microwave 
mixer.  The  incoming  and  outgoing  signals  are 
orthogonally  polarized  as  a  result  of  the  two 
resonant  modes  used  for  transmitting  and  mixing 
by  the  patch  antenna.  These  signals  are  also 
frequency  spaced  to  provide  a  200  MHz  IF  output 
at  each  station. 
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A  transponder  is  a  component  that  transmits  a 
signal  only  in  response  to  an  interrogation  signal. 
The  use  of  small,  inexpensive  microwave  “tag” 
transponders  for  non-contact  identification  (RFID) 
such  as  entry  systems,  toll  collection,  and  inventory 
control  has  attracted  interest  in  recent  years.  An  ID 
card  was  recently  developed  which,  when 
illuminated  by  a  6  GHz  signal,  transmits  a  unique 
digital  identification  code  at  a  response  frequency 
near  12  GHz.  The  coded  response  is  readable  from  a 
distance  of  10  feet  away  using  a  simple  transceiver, 
and  no  microwave  source  is  required  on  the  card 
(Figure  4).  This  circuit  is  based  on  a  quasi-optical 
mixer  originally  designed  for  mm-wave  arrays,  and 
demonstrates  active-anteima  techniques.  Only  three 
Schottky  diodes  are  needed  for  the  microwave 
processing,  including  interrogation  detection  for 
activating  the  card  from  a  power  conserving  state 
[13].  Other  transponders  have  been  developed  which 
translate  a  12  GHz  interrogation  signal  to  7  GHz  to 
provide  a  unique  response  signature,  using 
microstrip  patch  anteimas  integrated  with  two 
microwave  FETs  on  a  single  substrate  [14].  For 
long  range  operation,  em  8-element  array  type 
transponder  was  recently  demonstrated  which  uses 
diode  mixers  with  integrated  patch  anteimas  to 
retro-reflect  a  6  GHz  interrogation  signal  back  to  its 
source.  This  circuit  provides  the  function  of  a  radar 
corner-reflector  in  a  planar,  conformal  circuit  which 
can  be  electronically  modulated  [15]. 

5.  CONCLUSION 

Active  integrated  antennas  are  compact,  efficient 
circuits  that  can  provide  the  essential  functions 
required  in  microwave  wireless  systems.  The 
advantages  of  this  integrated  approach  have  been 
demonstrated  by  several  recently  developed  circuits. 
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ABSTRACT 

Unilateral  and  bilateral  injection-locked  oscillator  ar¬ 
rays  show  promise  for  cost-effective  implementation 
of  intelligent  scanning  antenna  systems,  especially  for 
emerging  millimeter- wave  applications.  This  paper  re¬ 
views  some  of  these  techniques  and  some  remaining 
technical  challenges  for  practical  implementation  of  the 
concepts. 

INTRODUCTION 

A  number  of  commercial  opportunities  are  emerging  in 
the  wireless  communications,  imaging,  and  automotive 
electronics  markets  for  active,  “smart”  antenna  array 
systems.  These  applications  have  brought  a  number  of 
new  constraints  that  could  make  conventional  phased 
and  adaptive  array  techniques  unattractive.  Most  no¬ 
tably  these  are  cost,  size,  and  efficiency. 

It  is  anticipated  that  automotive  collision  avoidance  or 
blind  spot  indicators  should  cost  less  than  US$200  for 
market  viability.  Auto  manufacturers  are  also  reluctant 
to  allocate  more  than  a  few  square  inches  of  surface  area 
for  possible  antenna  aperture,  which  not  only  makes  sys¬ 
tem  integration  difficult  but  also  forces  the  use  of  higher 
frequencies  in  order  to  satisfy  antenna  beamwidth  re¬ 
quirements.  Initial  systems  are  targeting  24  GHz  and 
77  GHz,  but  there  is  already  talk  of  94-150  GHz  sys¬ 
tems  for  further  miniaturization.  Such  high  frequencies 
stress  current  device  technology. 

In  the  wireless  sector,  compact  scanning  arrays  could  be 
useful  in  hand-held  or  mobile  systems  to  reduce  power 
consumption  through  “adaptive  power  control”,  where 
the  array  alternates  between  a  low  gain  “search”  mode 
and  a  high  gain  “acquisition”  mode  much  like  a  track¬ 
ing  radar  system.  In  the  high  gain  mode  the  array  has 
a  high  directivity  which  requires  less  power  for  effective 
communication,  and  also  reduces  the  susceptibility  to 
interference  when  receiving.  Compact  scanning  systems 
could  also  be  used  for  mobile  satellite  links  which  also 
require  high  directivity  antennas.  These  wireless  ap¬ 
plications  will  likely  suffer  similar  constraints  to  those 
mentioned  above. 


Principles  of  conventional  phased- arrays  are  well  known. 
Typically  coherence  is  maintained  between  array  ele¬ 
ments  using  a  single  source  distributed  to  every  array 
element  through  a  corporate  feed  network.  Separate 
phase  shifters  are  required  at  each  element,  each  of 
which  requires  biasing  and  multiple  control  wires  (fig  1). 
It  is  usually  difficult  to  integrate  such  systems  success¬ 
fully  into  a  small  package  since  the  antennas,  feed  net¬ 
works,  phase-shifters,  and  control  signals  occupy  a  lot  of 
space.  Phase  shifters  can  also  add  considerable  expense 
to  a  system,  depending  on  the  size  of  the  array  and 
scanning  requirements.  At  millimeter- wave  frequencies 
it  is  also  difficult  to  generate  the  required  output  power 
with  a  single  solid-state  source,  and  this  problem  is  com¬ 
pounded  by  large  losses  in  the  feed  network  and  phase 
shifters. 


Figure  1  -  Conventional  approach  for  an  electronically 
scanned  transmitter. 

This  has  led  a  number  of  researchers  to  explore  new 
techniques  for  beam  control  that  either  eliminate  or 
combine  the  functions  one  or  more  components  in  con¬ 
ventional  phased- arrays  systems.  A  number  of  interest¬ 
ing  techniques  have  been  realized  using  injection-locked 
or  mutually  sjoichronized  oscillator  arrays,  and  depend 
on  the  unusual  phase  dynamics  of  nonlinear  oscillators 
to  achieve  the  scanning  objective.  The  common  feature 
of  these  alternatives  is  that  they  eliminate  the  phase- 
shifter  circuitry  at  each  array  element,  and  in  some  cases 
the  RF  and  control  signal  distribution  networks.  The 
proposed  techniques  appear  to  have  some  advantages 
over  conventional  phased  arrays,  but  as  with  any  real 
system  this  typically  comes  at  the  expense  of  some  other 


0-7803-2516-8/95  $4.00  ©  1 995  IEEE  5 


performance  criterion;  whether  these  are  acceptable  al¬ 
ternatives  must  be  considered  on  a  case-by-case  basis. 

2.  INJECTION-LOCKING  TECHNIQUES 

The  practical  use  of  nonlinear  oscillators  for  phase  con¬ 
trol  is  based  on  the  phenomenon  of  injection  locking, 
sometimes  called  frequency  entrainment.  This  effect  has 
a  long  history  dating  back  to  observations  of  synchro¬ 
nized  mechanical  pendulums  by  Huygens,  and  has  since 
been  observed  in  a  wide  variety  of  self-sustained  (non¬ 
linear)  biological  and  physical  oscillators. 


Figure  2  -  Injection-locked  oscillator  and  plot  show¬ 
ing  behavior  of  oscillator  phase  as  injection  frequency  is 
varied  relative  to  the  free-running  frequency. 


Using  a  simple  single-tuned  oscillator  model  it  can  be 
shown  [1]  that  the  output  phase  of  an  oscillator  with  an 
injected  signal  is  given  by  Adler’s  equation 

where  9  =  and  ^inj  —  are  the  instan¬ 

taneous  phases  of  the  oscillator  and  the  injected  signal, 
respectively,  Amj  and  A  are  the  amplitudes,  similarly, 
and  Q  is  the  quality-factor  of  the  oscillator  resonant  cir¬ 
cuit.  When  the  oscillator  locks  onto  the  injected  signal, 
d6/dt  =  cjinj  in  the  steady-state,  and  (1)  becomes 

^inj  sin  A9  (2) 

where  Aum  =  Ainju;o/2QA.  Auju  is  called  the  locking 
bandwidth  of  the  oscillator,  and  is  the  steady-state 
phase  difference  between  the  oscillator  and  injected  sig¬ 
nal.  Equation  (2)  indicates  that  as  the  injected  signal 
frequency  is  tuned  over  the  locking  range  of  the  oscilla¬ 
tor,  'ujQ  ±  AuJmi  tliG  phase  difference  will  vary  between 
-90°  <  A9  <  90°  (see  figure  2). 

The  most  straightforward  application  of  the  injection¬ 
locking  technique  to  phased  arrays  is  shown  in  fig¬ 
ure' 3.  Each  array  element  is  a  self-contained,  voltage- 
controlled  oscillator  (VCO)  that  delivers  its  energy  to 
an  antenna.  The  oscillators  are  all  slaved  to  a  common 
signal  (the  desired  output  signal)  which  is  distributed 
using  a  standard  feed  structure  like  a  corporate  feed 
network.  The  phase  of  each  oscillator  can  be  changed 


relative  to  the  reference  signal  (and  hence  the  other  os¬ 
cillators)  by  adjusting  the  oscillator  tuning  voltage  (the 
free-running  frequency)  according  to  figure  2  above.  It 
should  be  noted  that  oscillator  arrays  like  this  also  serve 
as  efficient  power  combiners;  in  fact,  most  of  the  tech¬ 
niques  described  in  this  paper  have  evolved  from  re¬ 
search  groups  investigating  quasi-optical  power  combin¬ 
ing  methods  [2]. 


Figure  3  -  Injection-locked  oscillator  array.  Output 
can  be  scanned  through  electronic  control  of  each  oscil¬ 
lator  tuning  voltage. 


A  4  X  4  array  using  the  topology  of  figure  3  has  been 
reported  [3],  although  the  emphasis  was  on  the  power¬ 
combining  aspect  of  the  array  and  not  the  possible  scan¬ 
ning  feature.  The  technique  shown  in  figure  3  will  be 
limited  in  bandwidth  by  the  locking  range  of  the  os¬ 
cillators,  which  in  turn  is  related  to  the  injected  signal 
strength.  This  array  topology  should  degrade  gracefully. 


antenna 


Tuning  ports 


Figure  4  -  Cascade  of  injection-locked  VCOs  slaved 
to  a  low- noise  master  oscillator,  after  Itoh  [4].  Voltage¬ 
tuning  each  VCO  can  create  a  progressive  phase  shift. 

A  variation  of  this  idea  that  eliminates  the  corpo¬ 
rate  feed  structure  involves  an  injection-locked  cascade 
whereby  each  array  element  is  slaved  to  the  preceding 
element  in  the  array.  This  is  shown  in  figure  4,  and 
has  been  tested  experimentally  at  UCLA  [4].  Ampli¬ 
fiers  are  used  between  the  oscillators  for  isolation  and 
also  help  boost  the  signal  strength  for  a  better  lock¬ 
ing  range.  This  does  increase  the  circuit  complexity. 
The  extension  to  two-dimensional  arrays  is  straightfor¬ 
ward.  One  feature  of  all  injection- locked  arrays  is  that 
the  oscillators  will  assume  the  noise  properties  of  the 
master  oscillator  or  reference  signal,  even  if  the  oscilla¬ 
tors  themselves  are  quite  noisy.  Therefore  the  array  in 
figure  4  will  collectively  have  a  low  phase  noise  as  long 
as  the  injected  signal  is  derived  from  a  “quiet”  source. 
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3.  COUPLED  OSCILLATOR  AND 
PHASE-LOCKED  LOOP  ARRAYS 

A  less  obvious  extension  of  the  injection-locking  con¬ 
cept  is  an  array  of  mutually  synchronized  oscillators, 
such  as  illustrated  in  figure  5  below.  Each  oscillator  is 
bilaterally  coupled  to  neighboring  array  elements.  This 
system  was  first  proposed  by  Stephan  [5].  Mutual  co¬ 
herence  is  achieved  via  the  inject  ion- locking  process,  but 
the  steady-state  phase  relationships  are  more  difficult  to 
calculate  since  each  oscillator  depends  nonlinearly  on  its 
neighbors. 


tuning  ports 


Figure  5  -  Coupled-oscillator  array 


vantage  of  this  technique  over  the  unilateral  injection- 
locked  oscillators:  only  peripheral  array  elements  need 
to  be  voltage-controlled. 

Although  (4)  seems  to  imply  that  any  phase  shift  A0 
can  be  obtained,  a  stability  analysis  of  (3)  puts  limits 
on  this  quantity  [6].  For  the  special  case  of  ^  =  0°,  the 
limits  are 

-90°  <  A0  <  90° 

In  a  typical  array  with  element  spacing  of  d  =  Ao/2, 
this  phase  shift  is  sufficient  to  scan  the  beam  over  a 
±30°  range.  This  simplified  analysis  serves  to  intro¬ 
duce  the  beam-scanning  concept,  but  in  reality  the  dy¬ 
namics  are  complicated  by  many  factors  that  have  been 
left  out  of  (1),  such  as  amplitude  dynamics  and  non¬ 
uniformities,  frequency-dependent  coupling  networks, 
non-nearest-neighbor  interactions,  non-uniform  tuning 
profiles  of  the  VCOs,  and  frequency-dependent  device 
characteristics  [7]. 


Analysis  of  the  pheise  dynamics  can  be  simplified  by 
assuming  nearest- neighbor  interaction  and  identical  os¬ 
cillators.  The  mutual  interaction  between  adjacent  os¬ 
cillators  is  described  using  a  complex  coupling  coeffi¬ 
cient  e:exp(-j$),  which  can  be  related  to  the  y-  or 
z-parameters  of  the  coupling  circuit  that  connects  the 
oscillators  [7].  The  phase  dynamics  of  a  system  of  N 
coupled  oscillators  can  then  be  derived  as  [6,7] 


E  +  z  =  l,2,...,Ar 

j^i  ■ 

(3) 

where  u)i,  and  9i  are  the  frequency  and  instantaneous 
phase,  respectively,  of  oscillator  z,  and  Q  is  the  Q-factor 
of  the  oscillator  embedding  circuits.  This  is  obviously 
just  a  generalization  of  Adler’s  equation  (1).  When  the 
free-running  frequencies,  cj^,  are  similar  enough,  then 
the  oscillators  can  lock  to  the  same  frequency  so  that 
dOi/dt  =  cj  in  the  steady-state.  For  beam-scanning 
a  constant  progressive  phase  shift  of  A0  is  required, 
represented  mathematically  as  —  9i-i  —  A0  where 
i  =  2 . . .  iV.  Substituting  this  condition  into  (3)  and 
assuming  $  =  0°  leads  to  a  set  of  conditions  on  the 
free-running  frequencies  [6] 


'  cjo/  [1  -  Au;m  sin  A0] 

CJi  =  <  CJo 

^a;o/[l±  Aa;m  sinA0] 


if  z  =  1 
if  1  <  z  <  N 
ifz  =  iV 


(4) 


where  a;o  is  the  desired  steady-state  synchronized  fre¬ 
quency,  and  Au;m  =  eu)^l2Q  is  the  locking  range.  This 
indicates  that  a  constant  phase  shift  can  be  programmed 
by  slightly  detuning  only  the  end  elements  of  the  array 
in  opposite  directions.  This  appears  to  be  the  key  ad¬ 


Several  coupled  oscillator  arrays  have  been  constructed 
during  the  past  two  years,  using  packaged  MESFET 
devices  and  patch  antennas.  The  oscillators  are  typi¬ 
cally  coupled  with  resistively- loaded  transmission  lines 
[7] .  A  recent  version  was  built  in  collaboration  with  re¬ 
searchers  at  the  Jet  Propulsion  Laboratory,  operating  at 
8.4  GHz  [8].  It  used  a  novel  VCO  based  on  a  varactor- 
tuned  patch  antenna,  and  produced  over  1  Watt  with  an 
8-element  array,  and  had  a  total  scanning  range  of  45"^ 
(-15°  to  ±30°  around  broadside). 


Figure  6  -  An  improved  version  of  the  scanning  oscilla¬ 
tor  concept  uses  frequency  doublers  to  increase  the  scan 
range,  and  a  low-noise  source  to  stabilize  the  array. 


A  limitation  of  both  the  injection-locked  and  coupled- 
oscillator  topologies  is  the  limited  range  of  phase  shifts 
that  can  be  synthesized.  This  can  be  solved  using  the 
topology  shown  in  figure  6  below.  A  frequency  doubler 
circuit  is  used  at  each  array  element  which  effectively 
doubles  the  inter-element  phase  shift,  which  extends  the 
theoretical  scanning  range  to  nearly  full  hemispherical 
coverage  [9]. 

This  technique  has  other  potential  benefits:  the  oscilla¬ 
tors  are  designed  at  half  the  desired  output  frequency, 
which  reduces  the  cost  of  the  device  technology  and  sim- 
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plifies  design.  The  range  of  oscillator  tuning  required  to 
achieve  a  given  scan  range  is  also  significantly  reduced. 
For  ±45°  scanning  in  a  frequency-doubled  array  (half¬ 
wavelength  spacing),  the  relationship  between  oscillator 
tunings  and  interelement  phase-shift  (fig.  2)  is  quite  lin¬ 
ear,  which  simplifies  calibration  and  scan  control  circuit 
design.  Doublers  (and  possibly  amplifiers)  following  the 
oscillators  also  provide  a  desirable  measure  of  isolation 
between  the  antenna  and  oscillator. 


tion.  The  technology  is  still  relatively  new,  and  it  is  not 
clear  whether  the  benefits  of  the  new  techniques  out¬ 
weigh  the  new  difficulties  that  are  introduced.  In  the 
near  term,  these  approaches  could  find  use  in  low  cost, 
low  to  mdeium  performance  mm- wave  imaging  systems, 
as  they  are  inherently  suited  to  the  FMCW  technique. 
As  the  understanding  of  coupled-oscillator  systems  im¬ 
prove,  it  may  well  be  possible  to  achieve  high  perfor¬ 
mance  with  the  proposed  methods. 


Phase  noise  in  a  firee-running  coupled-oscillator  array  is 
typically  poor,  and  this  is  exacerbated  by  the  frequency 
doublers.  Low  noise  can  be  achieved  by  injecting  a  sta¬ 
ble  reference  signal  (which  could  also  contain  modula¬ 
tion)  into  the  center  element  of  the  array,  as  suggested 
in  the  figure  6.  Dramatic  improvement  in  phase  noise 
has  been  measured,  suitable  for  sensitive  receiver  appli¬ 
cations  [10]. 


Figure  7  -  The  coupled-oscillator  technique  can  be  used 
for  scanning  receivers  by  using  the  array  as  a  distributed 
local  oscillator. 


The  scanning  oscillator  configurations  can  also  be  used 
for  receiving  applications  [11].  This  is  accomplished  by 
using  the  scanning  oscillator  array  as  the  local  oscillator 
for  a  set  of  mixers,  as  shown  in  figure  7.  Using  one  of 
our  early  array  prototypes  and  some  commercial  pack¬ 
aged  mixers,  this  concept  was  tested  by  first  measuring 
the  scanning  properties  in  transmit  mode  (oscillators 
coupled  directly  to  antennas),  followed  by  the  receive 
mode  as  shown.  Identical  scan  ranges  and  patterns  ob¬ 
served  in  each  case,  as  expected.  It  may  be  possible  to 
merge  the  transmit  and  receive  functions,  especially  for 
FMCW  imaging  arrays,  by  making  each  array  element 
a  self-contained  FMCW  transmitter  and  receiver,  where 
each  array  element  is  coupled  to  its  neighbors.  This  has 
not  yet  been  tested. 

4.  CONCLUSIONS 

Several  interesting  new  techniques  using  injection  locked 
and  coupled  oscillators  are  currently  under  investiga¬ 
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ABSTRACT 

The  direct  modulation  of  oscillators  in  an  active  ar¬ 
ray  provides  an  efficient  means  of  communicating  over 
millimeter-wave  carriers.  This  paper  describes  how  var¬ 
ious  forms  of  digital  modulation  can  be  applied  and 
gives  details  of  the  development  of  a  28  GHz  transmis¬ 
sion  system  designed  to  operate  at  2Mbit /s  using  fre¬ 
quency  shift  keying. 

1.  INTRODUCTION 

Millimeter- wave  wireless  local  and  wide  area  networks 
offer  the  promise  of  high-bandwidth  communications 
within  small  to  medium  sized  cells.  However,  even 
for  modest  data  rates  and  coverage  areas,  the  power 
requirements  exceed  the  levels  possible  with  existing 
semiconductor  based  amplifiers.  Several  devices  must 
therefore  be  used  in  conjunction  with  power  combin¬ 
ers  to  generate  adequate  power  levels  for  most  appli¬ 
cations.  These  power  combiners  can  be  quite  lossy 
though,  especially  when  large  arrays  of  devices  are  in¬ 
volved.  One  solution  to  this  problem  is  to  use  quasi- 
optical  power  combining  [1],  in  which  the  power  gen¬ 
erated  by  each  element  of  an  array  of  solid-state  oscil¬ 
lators  or  amplifiers  is  radiated  into  free-space,  where 
power-combining  losses  are  negligible. 

Most  of  the  array  power  combining  research  done 
thus  far  has  focused  on  creating  arrays  with  future  ap¬ 
plication  in  radar  and  point-to-point  communications. 
Recently  a  quasi-optical  array  which  could  be  used  in 
a  multipoint  distribution  system  was  presented  [2] . 

Although  active  antennas  capable  of  both  receiving 
and  transmitting  have  been  created  [3,  4],  separation 
of  these  functions  into  different  arrays  has  a  number 
of  benefits.  It  allows  greater  isolation,  which  decreases 
coupling  of  the  strong  transmitted  signals  into  the  sen¬ 
sitive  receivers,  and  it  simplifies  full-duplex  operation. 
Additionally,  separating  the  receive  and  transmit  func¬ 
tions  gives  more  freedom  in  optimizing  the  different 
elements. 


fi  f2  fi  f2 


Figure  1:  Comparison  of  power  spectra  for  ASK,  FSK, 
and  MSK. 

Because  most  wireless  services  are  migrating  to¬ 
wards  digital  modulation  techniques,  this  paper  will 
discuss  the  application  of  active  antenna  elements  to 
digital  communication  links.  Digital  signals  are  more 
secure  and  easier  to  multiplex,  switch,  and  compress. 
Also,  they  can  be  broadcasted  over  large  distances  with¬ 
out  degradation,  using  repeaters  which  can  faithfully 
reconstruct  the  original  signal  [5] . 

2.  FREQUENCY-SHIFT  KEYING 

Aside  from  amplitude-shift  keying  (ASK),  which  is  the 
modulation  scheme  of  turning  the  carrier  on  and  off, 
frequency-shift  keying  (FSK)  is  the  simplest  digital 
modulation  scheme  to  implement  with  active  antenna 
elements.  In  the  basic  binary  FSK,  the  array  frequency 
is  shifted  synchronously  between  two  frequencies,  fi 
and  /2,  which  represent  the  two  data  states.  Since 
the  operating  frequencies  of  oscillator  arrays  are  sen¬ 
sitive  to  bias  voltage,  this  modulation  is  easily  imple¬ 
mented  by  switching  the  bias  voltage  between  two  val¬ 
ues.  Alternatively,  varactors  can  be  incorporated  into 
the  circuits  to  provide  frequency  control  [6].  FSK  is 
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A  practical,  inexpensive  noncoherent  receiver  can 
be  constructed  for  use  with  a  quasi-optical  array  FSK 
transmitter  fairly  easily  utilizing  an  MCI 3055  “Wide¬ 
band  FSK  Receiver”  IC  [8],  This  integrated  circuit 
contains  the  detectors  and  comparator,  as  well  as  an 
input  amplifier /limiter.  Fig.  3  shows  a  block  diagram 
for  a  complete  28  GHz  receiver  using  this  IC. 


Figure  2;  (a)  Coherent  binary  FSK  receiver  [4].  (b)  Figure  3:  Practical  implementation  of  a  noncoherent 

Noncoherent  receiver  [3] .  The  switches  are  used  to  sam-  28  GHz  binary  FSK  receiver, 

pie  at  intervals  of  To . 


preferred  to  ASK,  even  though  it  generally  occupies  a 
larger  bandwidth  (see  Fig.  1),  because  it  has  fixed  op¬ 
timum  states  (/i  &  /2),  unlike  the  states  of  ASK  which 
are  sensitive  to  fading. 

The  separation  between  the  two  frequencies.  A/  = 
/2  ”  /15  should  be  made  as  small  as  possible  to  reduce 
the  transmission  bandwidth.  In  order  to  maintain  or¬ 
thogonality  between  the  two  states,  the  minimum  value 
of  the  separation  should  be 


where  To  is  the  time  allocated  per  bit  (reciprocal  of 
data  rate).  Modulation  using  this  A/  is  referred  to  as 
minimum-shift  keying  (MSK)  [5]. 

FSK  receivers  can  either  be  coherent  or  noncoher¬ 
ent.  The  coherent  configuration  (Fig.  2a)  uses  a  pair  of 
phase-locked  loops  (PLL)  to  generate  sinusoids  at  fre¬ 
quencies  /i  and  /2  which  are  in  phase  with  the  received 
signal.  Each  of  these  is  then  mixed  with  the  received 
signal,  and  the  resultant  is  integrated.  The  data  value 
(0  or  1)  is  determined  by  comparing  the  outputs  of  the 
integrators  sampled  at  intervals  of  Tq.  The  noncoher¬ 
ent  receiver  (Fig.  2b)  instead  uses  a  pair  of  bandpass 
filters  followed  by  envelope  detectors  to  generate  the 
signals  which  are  fed  to  the  output  comparator. 

While  noncoherent  receivers  are  substantially  easier 
and  cheaper  to  build,  the  coherent  versions  yield  better 
performance.  The  theoretical  difference  in  the  proba¬ 
bility  of  errors  between  the  two  is  a  factor  ranging  from 
two  to  ten,  depending  upon  the  signal-to-noise  ratio  of 
the  received  signal. 


The  automatic  frequency  control  (AFC)  block  ad¬ 
justs  the  frequency  of  the  150  MHz  LO  to  compensate 
for  any  drift  in  the  transmitter  array.  It  senses  the 
frequency  deviations  from  the  center  frequency  set  in 
the  MCI 3055  and  adjusts  to  maintain  a  zero  average 
deviation.  This  circuit  has  a  capacity  of  2Mbit/s,  but 
more  recent  ICs,  such  as  the  MC13155,  could  be  used 
for  higher  speeds. 

Work  is  currently  progressing  on  implementation  of 
this  receiver  to  demonstrate  FSK  of  a  28  GHz  endfire 
active  antenna  element  [9] . 


3.  LINEAR  MODULATION 

Frequency  shift  keying  is  a  nonlinear  modulation 
scheme  that  is  best  suited  for  millimeter-wave  systems 
with  carriers  that  have  insufficient  phase  stability  to 
perform  carrier  phase  estimation [7].  In  contrast,  lin¬ 
ear  modulation  schemes,  such  as  amplitude  and  phase 
modulation,  place  more  stringent  requirements  on  the 
carrier  stability  to  perform  phase-coherent  detection. 
Most  microwave  digital  radio  systems  use  linear  mod¬ 
ulation  schemes  to  achieve  optimal  spectral  efficiencies 
(Bits/Sec/Hz)  with  good  power  efficiency  [10]. 

Phase  modulation  can  be  achieved  in  several  differ¬ 
ent  ways.  One  approach  is  to  place  the  active  oscillator 
element  into  a  phase  locked  loop  [11]  (see  Fig.  4).  The 
oscillator  acts  as  a  voltage  controlled  oscillator.  When 
the  loop  is  locked,  the  average  VCO  frequency  equals 
the  stable  reference  frequency.  The  baseband  data  is 
fed  into  the  low-frequency  section  of  the  PLL. 
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Figure  4:  Phase-locked  loop  for  phase  modulation  of 
an  oscillator  element  (VCO). 

Four- level  phase-shift  keying  (4-PSK),  which  is 
identical  to  four  level  quadrature  amplitude-shift  key¬ 
ing,  has  a  spectral  efficiency  of  2  Bits/Sec/Hz.  Higher 
order  modulation  schemes,  for  example  16-PSK  and 
16-QAM,  afford  higher  data  rates  but  require  larger 
carrier-to-noise  ratios  (C/N).  For  these  higher  order 
schemes,  n-QAM  is  preferable  to  n-PSK  because  it  re¬ 
quires  a  lower  C/N  than  n-PSK.  However,  the  stan¬ 
dard  rectangular  constellation  n-QAM  (see  Fig.  5)  is 
difficult  to  implement  in  an  active  antenna  for  n>16 
because  it  requires  both  a  quadrature  and  an  in-phase 
oscillator.  The  circular  variation  of  QAM,  on  the  other 
hand,  can  be  more  easily  achieved,  by  separately  phase 
and  amplitude  modulating  the  oscillator  element.  An¬ 
other  benefit  of  the  circular  scheme  is  that  it  requires 
slightly  less  peak  power  to  achieve  the  same  probability 
of  error. 

Constant-phase  amplitude  modulation  can  be 
achieved  using  a  class-C  oscillator  of  the  type  described 
in  reference  [12].  Furthermore,  this  element  can  be 
phase  modulated  using  a  PLL  circuit  which  drives  a 
varactor  at  the  gate  of  the  transistor,  allowing  genera¬ 
tion  of  circular  constellation  n-QAM. 


Figure  5:  In-phase  and  quadrature  constellation  for  16- 
PSK  and  rectangular  and  circular  implementations  of 
16-QAM. 
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ABSTRACT 

In  this  paper,  we  consider  an  inverted  stripline 
active  antenna  configuration  for  system 
applications.  Solid-state  devices  can  be  directly 
integrated  with  the  antenna  to  perform  various 
functions.  A  two-way  communication  system 
using  two  integrated  active  antenna  transceivers 
was  demonstrated. 

1.  INTRODUCTION 

Integrated  and  active  integrated  antennas  receive 
a  great  deal  of  attention  because  they  can  reduce 
the  size,  weight  and  cost  of  many  transmit  and 
receive  systems.  Passive  and  active  solid-state 
devices  can  be  configured  to  provide  various 
functions  with  several  components  at  the 
terminals  of  the  antenna.  Two-and  three- 
terminal  solid-state  devices  have  been  integrated 
with  various  antennas  to  form  switches,  tuners, 
detectors,  mixers,  amplifiers,  oscillators, 
Doppler  sensors  and  receivers.  The  choice 
between  transistors  and  diodes  is  not  clear  cut. 
It  depends  on  the  type  of  microwave  component 
required,  operating  frequency  range,  RF  power 
output  desired,  available  DC  power  input  and 
other  considerations.  For  integrated  antennas, 
the  choice  of  solid-state  device  also  involves  its 
effect  on  the  radiation  characteristics.  Although 
integration  can  reduce  the  size,  weight  and  cost 
of  many  microwave  systems,  active  integrated 
antennas  have  shown  a  deterioration  of  both  the 
antenna  and  component  performance.  Unlike  a 
passive  antenna,  an  integrated  antenna  includes 
the  effect  of  device  packages  and  bias  lines  on 
the  electromagnetic  fields  and  currents  which 


flow  on  it.  These  effects  often  cause  changes  in 
operating  frequencies,  lower  conversion 
efficiencies  and  higher  cross-polarization  levels 
(CPL).  If  an  integrated  antenna  can  maintain 
reasonable  component  specifications  ■vrith  little 
degradation  in  the  radiation  characteristics,  the 
approach  would  be  attractive  for  many 
commercial  and  military  systems. 

To  meet  the  performance  requirements,  an 
inverted  stripline  active  antenna  was  developed. 
The  configuration  has  many  advantages  over 
other  microstrip  antennas  such  as  low  cross 
polarization  levels,  small  disturbances  from 
solid-state  device  mounting  and  bias  lines,  ease 
of  hermetically  sealing,  and  excellent  radiation 
patterns.  Solid-state  devices  such  as  PIN  diodes 
[1],  varactors  [1],  Gunn  [2],  FETs  [3],  ^d 
mixers  [4]  have  been  successfully  integrated  with 
the  antennas  to  perform  switching,  tuning, 
oscillation,  beam  steering,  and  transceiving 
functions. 

To  demonstrate  the  system  applications,  a  two- 
way  communication  system  using  two  integrated 
active  antenna  transceivers  was  built.  The 
transmitter  provides  63  mW  output  power  with 
an  estimated  maximum  link  distance  of  4.8  km 

[4]. 

2.  ANTENNA  AND  CIRCUIT 
CONFIGURATIONS 

Figure  1  shows  the  inverted  stripline  antenna 
integrated  with  a  Gunn  diode.  A  single  diode 
exhibited  an  output  power  of  60-70  mW  at  X- 
band  [2].  Power  combining  efficiencies  of  over 
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89%  have  been  demonstrated  for  a  four-element 
square  array.  Beam  steering  was  demonstrated 
by  varying  bias  voltage  to  the  individual  antenna 
element  [5], 

The  inverted  stripline  antenna  was  successfiilly 
integrated  with  an  FET  as  shown  in  Figure  2  [3] 
The  FET  provides  57  mW  at  5.69  GHz  with  a 
DC-to-RF  efficiency  of  over  50%.  The  cross¬ 
polarization  levels  are  less  than  19  dB  compared 
to  the  co-polarization  levels. 

PIN  diodes  or  varactors  can  be  integrated  with 
the  inverted  stripline  to  perform  switching  or 
tuning  functions.  Figure  3  shows  the  integration 
of  these  devices  at  the  radiating  edges  of  the 
antenna  [1].  By  varying  the  bias  voltages  of  the 
PIN  diodes,  the  antenna  reflection  coefficient  at 
the  operating  frequency  varies  from  a  VSWR  of 
1.014  to  22.87.  For  varactor  diodes,  the 
operating  fi-equency  was  tuned  over  31% 
centered  at  3.4  GHz. 

The  integration  of  an  FET  oscillator  and  a  mixer 
diode  with  the  antenna  forms  a  transceiver.  Part 
of  the  FET  output  power  can  be  used  as  the 
local  oscillator  pump  to  the  mixer.  Figure  4 
shows  the  circuit  configuration  [4].  The  mixer 
location  was  optimized  for  impedance  matching. 
The  transmitter  provides  63  mW  and  the  mixer 
has  an  isotropic  conversion  loss  of  5.5  dB. 

3.  TWO-WAY  COMMUNICATION 
SYSTEM 

Two  transceivers  described  in  the  previous 
section  were  used  to  build  a  two-way 
communication  system.  The  arrangement  is 
shown  in  Figure  5  [4].  The  receive  pattern  is 
perpendicular  to  the  transmit  pattern  for  LO  to 
RF  isolation.  The  maximum  distance  that  the 
two  transceivers  can  be  separated  was  found  to 
be  4.8  km  for  a  bandwidth  of  6  MHz. 
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Figure  1.  A  Gunn  diode  integrated  with 
an  inverted  stripline  antenna 


fm' 

wWi 

hfM 

» 
Imi 

y^i 

hs. 
» 


Figure  2.  An  FET  device  integrated  with 
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Figure  5. 


A  LIQUID-CRYSTAL  OPTOELECTRONIC 
PHASED  ARRAY  BEAM  FORMING  NETWORK 

Osamu  KOBAYASHI  and  Hiroyo  OGAWA 

NTT  Wireless  Systems  Laboratories 
1-2356  Take,  Yokosuka,  Kanagawa  238-03  JAPAN 

Phone:  +81  (468)  59  3352  Fax:  +81  (468)  59  3351 


ABSTRACT 

An  optoelectronic  beam  forming  network  (BFN)  is 
presented  for  a  3-element  phased  array  antenna  that 
utilizes  electrically  controllable  birefringence  mode 
nematic  liquid-crystal  cells  (ECB  mode  NLC  cells)  for 
phase  shifting  and  amplitude  control.  In  the  circuit,  a 
microwave  signal  is  carried  by  a  pair  of  orthogonal 
linearly  polarized  lightwaves  (signal  and  reference 
lightwaves)  using  the  optical  heterodyning  technique.  A 
prototype  circuit  is  built  and  its  performance 
experimentally  examined.  With  small  deviations  among 
the  three  cells,  phase  shifts  of  up  to  240  degrees  are 
achieved  for  MW  signals  from  0.9  GHz  to  20  GHz  with 
good  stability;  attenuation  of  more  than  18  dB  was 
achieved. 


1.  INTRODUCTION 

We  are  planing  to  realize  the  advanced  personal 
communication  service  by  using  the  multiple  beam 
geostationary  communication  satellite  system  [1].  It 
offers  direct  links  between  the  low-cost  customer- 
premises  earth  stations/handy  terminals  and  the 
geostationary  satellite  as  depicts  in  Fig.  1 .  For  realization 
of  that  service,  the  satellite  must  offer  large  effective 
isotropic  radiated  power  (e.  i.  r.  p.).  Large  deployable 
reflector  antennas  with  diameters  exceeding  ten  meters 
[2]  fed  by  active  phased  array  antennas  will  be  adopted 
as  the  onboard  antennas. 

Active  phased  array  antenna  systems  are  suitable 
and  will  be  widely  used  not  only  for  satellite  onboard 
antenna  systems,  but  also  for  microwave  (MW)/ 
millimeter-wave  (MMW)  systems  because  of  the 
following  advantages.  First,  radiated  MW/MMW  power 
level  can  be  increased  by  the  spatial  power  combining 
technique  (simply  increasing  the  number  of  antenna 
elements).  The  spatial  power  combining  technique  is 
inevitable  in  MMW  transmitting  systems  where  efficient, 
high-power  active  devices  are  hard  to  obtain  by  existing 
technologies,  as  well  as  communication  satellite  onboard 
antenna  systems  which  must  offer  very  large  e.  i.  r.  p. 
Second,  antenna  beam  shaping/steering  can  be  achieved 
by  changing  the  amplitude  and  phase  distributions  of  the 
antenna  array  elements.  Adaptive  antenna  beam  shaping/ 
steering  is  regarded  as  a  promising  countermeasure  to 
reduce  unnecessary  and  undesired  electromagnetic 
interference. 

Unfortunately,  the  configuration  of  the  BFNs 
becomes  very  complex  for  large  scale,  multiple  beam 
phased  array  antenna  systems.  The  number  of  the  input/ 
output  ports  of  the  BFNs  must  increase  to  achieve  large 
e.  i.  r.  p.  The  number  of  the  output  ports  will  increase 


Fig.  1  Multibeam  geostationary  communication  satellite 

because  the  BFN  must  fed  MW/MMW  signals  to  a  large 
number  of  active  elements.  As  the  diameter  of  the 
reflector  antenna  increases,  antenna  spot  beam  size 
reduces,  and  the  required  number  of  antenna  beams 
increases  to  cover  the  main  islands  of  Japan  and  its 
archipelago.  It  will  increase  the  number  of  the  input  ports. 
Furthermore,  it  requires  a  significant  number  of  phase 
shifters  and  attenuators,  and  the  level  of  interconnection 
between  these  components  will  be  very  complex. 

The  approach  that  utilizes  photonic  technologies 
is  promising  to  build  large  scale  phased  array  antenna 
BFNs  due  to  the  massively  parallel  signal  processing 
capability  of  optical  systems  [3].  Various  optoelectronic 
phased  array  beam  forming  networks  have  been  proposed 
and  studied  [4].  The  BFNs  that  utilize  Fourier  optics  are 
reported  in  [5]  and  [6].  The  BFN  that  gives  true  time 
delay  for  antenna  elements  is  presented  in  [7].  Twisted 
nematic  liquid-crystals  were  used  to  switch  the  optical 
path  to  give  different  time  delay.  Experiments  on  the 
phase  shifting  of  a  single  radio  frequency  (RF)  signal  by 
using  a  liquid-crystal  cell  were  reported  in  [8]. 
Unfortunately,  the  processed  signal  was  limited  to  the 
RF  band  by  the  restriction  of  the  acoustic-optical 
modulator  and  laser  source  (He-Ne  laser  of  633  nm)  used 
in  the  circuit. 

The  authors  have  developed  a  BFN  for  a  3-element 
phased  array  antenna  that  deals  with  MW  signals.  In  the 
circuit,  a  microwave  signal  is  carried  by  a  pair  of 
orthogonal  linearly  polarized  lightwaves  using  the  optical 
heterodyning  technique.  ECB  mode  NLC  cells  are  used 
to  control  the  phase  and  amplitude  of  the  signals  in  an 
analogue  manner  in  the  optical  domain.  The  phase  shifts 
and  attenuation  of  the  MW  signals  of  0.9  GHz  to  20  GHz 
were  measured.  The  configuration  and  performance  of 
the  circuit  is  described  in  this  paper. 
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2,  ATTENUATION  AND  PHASE  SHIFT  OF 
MICROWAVES  WITH  POLARIZATION 
MULTIPLEX  OPTICAL  HETERODYNING 

We  adopted  ‘polarization  multiplex  optical  heterodyning’ 
to  transmit  and  process  MW  signals  in  the  BFN.  Fig.  2 
shows  the  scheme  that  realizes  amplitude  or  phase  control 
of  a  MW  signal  with  this  technique.  The  MW  signal 

cos  co^t  is  fed  to  the  E/0  converter,  and  the  optical  signal 

cos(m^p, +  £0^)f  is  obtained.  Here,  co^  and  denote 

the  angular  frequencies  of  the  input  MW  signal  and  the 
laser  source,  respectively.  The  E/0  conversion  scheme 
is  optical  frequency  shifting.  Therefore,  the  output  optical 
signal  of  the  E/0  converter  has  the  same  spectrum  as  the 
input  MW  signal,  but  the  frequency  is  different.  The  E/ 

O  converter  also  provides  the  optical  signal  coso)^^/. 
The  signal  lightwave  cos((0„p,  +  and  the  reference 

lightwave  cos  are  assigned  to  the  orthogonal  linear 

polarizations  and  combined  for  transmission  through  the 
BFN  circuit. 

The  lightwaves  are  input  to  a  homogeneous  type 
ECB  mode  nematic  liquid-crystal  cell.  The  LC  cell  is 
made  by  filling  liquid-crystals  between  two  parallel  glass 
plates  on  which  transparent  electrodes  are  formed. 
Nematic  liquid-crystal  molecules  lay  parallel  to  the  glass 
plate  when  no  voltage  is  supplied,  and  the  direction  of 
the  major  axes  are  the  same  throughout  the  liquid-crystal 
layer.  An  ECB  mode  NLC  cell  with  local  Cartesian 
coordinate  xyz  is  illustrated  in  Fig.  2.  The  glass  plates 
are  placed  in  the  plane  parallel  to  the  xy  plane,  and  the 
direction  of  the  major-axes  of  liquid-crystals  become 
parallel  to  the  y  axis.  Lightwaves  propagate  along  the  z 
axis.  The  liquid-crystal  molecules  begin  to  raise  respect 
to  the  glass  plate  as  the  voltage  applied  to  the  cell 
increases.  The  refractive  indices  of  an  ECB  mode  NLC 
cell  for  the  linear  polarization  parallel  to  the  x  and  y  axes 
are  written  as  follows. 


Here,  and  n^.  denote  the  refractive  indices  for  the 
respective  polarizations,  is  the  refractive  index  for 
the  polarization  parallel  to  the  major  axes  of  the  liquid- 


liquid-crystal  cell 


Fig.  2  Polarization  Multiplex  Optical  Heterodyning 


crystal  molecules,  and  is  for  the  polarization  parallel 

to  the  minor  axes.  0^  is  the  declination  angle  of  the  liquid- 
crystal  molecules  with  the  respect  to  the  glass  plate. 

The  ECB  mode  NLC  cell  is  placed  on  the  global 
Cartesian  coordinate  XYZ.  We  set  to  the  z-axis  to  match 
the  Z-axis,  and  the  cell  is  rotated  by  e  around  the  Z- 
axis.  Behind  the  cell,  an  analyzer  is  placed  so  that  it 
passes  the  tilted  linear  polarization  component  of  6^ ,  In 
that  configuration,  the  complex  amplitude  of  the  output 
lightwaves  for  the  polarization  parallel  to  the  X  and  Y 
axes  are  respectively  written  as  follows. 

Ef,  =  [cos'  6  +  sin^  9  exp{-jkAn  r/)]cos  0^, 

-i-[sin  9  cos  0  {]  -  exp(“j/:Aw  J)}]sin  9^ 

Ejf  =  [sin  9  cos  9  cxp{-jkAn  r/)}]  cos 

(3) 

+[sin^  0  +  cos'  9  e\p{-jkAnd)\sin  9^, 

Here,  superscripts  X  and  Y  denote  the  input  polarizations, 
and  subscript  //  denotes  the  output  component  of  the 
analyzer. 

The  microwave  signal  is  retrieved  as  the  beat  of 
the  two  components.  The  amplitude  and  phase  changes 

as  kAnd  ,  i.e.  the  LC  cell  applied  voltage,  is  changed. 
When  9  =  ±k/  A  and  0^,  =  0,  ;r  /  2 ,  the  circuit  works  as 
an  MW  signal  attenuator.  When  0  =  0,  ;r/ 2  and 
0^  =  ±;r/4,  only  the  phase  changes,  and  the  circuit 
works  as  a  phase  shifter.  The  advantages  of  polarization 
multiplex  optical  heterodyning  are  as  follows: 

(i)  The  phased  array  beam  forming  network  requires  many 

optical  interferometers  for  phase  shifting.  Because 
an  interferometer  is  formed  on  a  single  signal  path, 
the  complexity  of  the  optical  circuit  is  reduced. 

(ii)  Because  signal  and  reference  lightwaves  propagate 
over  the  same  path,  the  phase  stability  of  the  output 
MW  signal  is  very  good. 

(iii)  Because  the  reference  lightwave  is  provided  with 
the  signal  lightwave,  the  0/E  conversion  block 
becomes  very  simple.  Only  a  photodiode  and  an  MW 
amplifier  are  required. 


3.  THE  EXPERIMENTAL  SETUP 

The  optoelectronic  phased  array  beam  forming  network 
built  for  the  experiments  is  illustrated  in  Fig.  3.  Two 

laser  diode  pumped  Nd:YAG  lasers  of  1.32  //m  were 
used  for  the  transmission  of  MW  signals  in  the  circuit. 
Lasers  were  single  mode  and  single  frequency  type,  and 
were  controlled  by  a  phase  locked  loop  (PLL)  so  that 
their  optical  frequency  difference  remained  constant  and 
corresponded  to  the  MW  carrier  frequency.  One  laser 
output  was  considered  as  the  signal  lightwave,  and  the 
other  the  reference  lightwave. 

The  outputs  of  the  two  lasers  were  input  to  a  2  by 
4  polarization  maintaining  fiber  (PMF)  coupler  tree.  The 
polarization  states  of  the  two  input  lightwaves  were  set 
to  be  linearly  orthogonal,  i.e.,  the  polarization  plane  of 
the  signal  lightwave  is  parallel  to  one  of  the  two  optical 
axes  of  the  PMF;  that  of  the  reference  lightwave  is  parallel 
to  the  other  optical  axis.  The  signal  and  reference 
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lightwaves  are  combined  and  divided  into  two,  to  be 
maintained  the  polarization  states  by  the  first  PMF 
coupler.  Further  division  is  performed  by  the  PMF 
couplers  of  the  next  stage.  The  signal  and  the  reference 
lightwaves  propagate  the  same  path  through  the  circuit 
but  with  orthogonal  polarization  state. 

The  outputs  of  the  PMF  coupler  tree  were  input 
to  a  pair  of  collimator  arrays.  A  1  by  3  ECB  mode  NLC 
cell  array  was  installed  between  the  pair  of  collimator 
arrays.  The  LC  cells  were  set  so  that  the  direction  of  the 
major-axes  of  the  LC  molecules  when  no  voltage  was 
supplied  was  vertical.  In  that  configuration,  only  the 
refractive  index  for  y-polarization  (vertical  polarization) 
is  changed  as  the  LC  molecules  incline  from  the  vertical, 
while  the  refractive  index  for  x-pol.  (horizontal 
polarization)  remains  constant.  Therefore,  we  can  set 
the  phase  difference  between  the  signal  and  the  reference 
light  using  the  ECB  mode  NLC.  The  amount  of  the  phase 
shift  is  determined  by  the  voltage  applied  to  the  LC  cell. 
The  two  orthogonal  components  of  the  outputs  of  the  LC 
cells  were  combined  by  an  analyzer  whose  direction  was 
set  to  45  degrees.  For  the  attenuation  experiments,  the 
LC  cells  were  rotated  by  45  degrees,  and  the  analyzer 
was  set  so  that  y-pol.  (or  x-pol.)  lightwave  passed  through 
it. 

The  combined  lightwaves  were  input  to  the 
photodiodes  (PDs)  to  retrieve  the  MW  signals  as  the  beats 
of  the  signal  and  reference  lightwaves.  The  microwave 
signals  preserved  the  results  of  phase  shifts  and  amplitude 
change  made  in  the  optical  domain.  The  microwave 
signals  output  by  the  receivers  were  input  to  a  vector 
network  analyzer  in  order  to  measure  amplitude  and  phase 
difference. 


4.  EXPERIMENTAL  RESULTS  AND  DISCUSSIONS 

We  measured  phase  shifts  and  amplitude  change  of  the 
MW  signals  for  the  three  signal  paths  as  functions  of  the 
LC  cell  applied  voltages.  Two  cases  were  examined;  the 
case  when  LCs  were  used  for  phase  shifts,  and  the  case 
when  LCs  were  used  for  amplitude  control.  The  results 
of  phase  control  experiments  of  the  MW  signals  of  0.9 
GHz  are  illustrated  in  Fig.  4(a),  and  those  of  the  amplitude 
control  experiments  are  illustrated  in  Fig.  4(b).  Fig.  4(a) 
shows  that  phase  shifts  of  about  240  degrees  were 
observed  for  the  three  cells.  It  is  also  observed  that 
amplitudes  remained  constant.  The  deviations  among 
the  three  cells  were  small.  Note  that  the  identical  PD 
and  the  MW  amplifier  were  used  when  measuring  the 
amplitude  and  phase  of  the  MW  signals.  From  Fig.  4(b), 
it  is  observed  the  amplitude  changed  with  the  LC  cell 
applied  voltages.  More  than  30  dB  attenuation  was 
achieved  for  the  best,  and  more  than  18  dB  attenuation 
was  possible  even  in  the  worst  case.  The  phase  shifts 
changed  with  the  MW  signals.  Phase  shifts  of  k  were 
observed  at  the  maximum  attenuation  level. 

Phase  control  experiments  of  MW  signals  of  0.9 
GHz,  5  GHz,  and  20  GHz  were  performed.  Results  for 
the  same  cell  are  illustrated  in  Fig.  5.  Phase  shifts  of 
about  240  degrees  were  achieved  in  all  cases.  The  results 
were  independent  of  MW  signal  frequency.  Small 
fluctuations  in  amplitude  were  observed  as  the  LC  applied 
voltages  increased.  It  was  about  ±  0.9  dB  for  the  worst 
case.  The  fluctuation  may  be  improved  by  more  precisely 


aligning  the  orientation  of  the  LC  cells  and  polarization 
planes  of  the  lightwaves.  Wideband  phase  controllability 
of  the  circuit  was  confirmed  from  Fig.  5. 

Fig.  6  shows  the  phase  stability  of  the  output  MW 
signal  of  0.9  GHz.  Because  the  signal  and  reference 
lightwaves  propagate  through  the  same  path,  disturbances 
that  commonly  affected  the  two  lightwaves  were 
suppressed,  and  excellent  phase  stability  of  the  beat  signal 
was  achieved.  The  fluctuation  was  about  a  degree.  This 
includes  the  phase  fluctuation  of  the  laser  sources,  the 
fluctuation  of  the  orientations  of  LC  molecules,  and  the 
thermal  instability  of  PMF  length. 

5.  CONCLUSION 

A  3-element  optoelectronic  phased  array  antenna  BFN 
that  utilizes  ECB  mode  NLC  cells  for  both  phase  shifting 
and  amplitude  control  was  presented.  A  microwave 
signal  is  carried  by  a  pair  of  orthogonal  linearly  polarized 
lightwaves  using  the  optical  heterodyning  technique. 
Therefore,  the  configuration  of  the  optical  circuit  is  very 
simple,  and  the  phase  stability  of  the  microwave  signals 
was  improved.  Phase  shifts  of  up  to  240  degrees  were 
achieved  for  MW  signals  of  0.9  GHz,  5  GHz  and  20  GHz 
with  good  phase  stability.  Attenuation  of  more  than  18 
dB  was  achieved.  Deviations  among  the  three  cells  was 
small.  The  described  system  can  be  expanded  to  2D 
phased  array  beam  forming  network  by  using  a  2D  array 
of  LC  cells,  wherein  the  parallel  signal  processing 
capability  of  the  optical  signal  processing  technique  can 
be  fully  utilized. 
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.  4(a)  Phase  shift  as  a  function  of  the  liquid-crystal  Fig.  4(b)  RF  attenuation  as  a  function  of  the  liquid-crystal 
cell  applied  voltage  at  0.9  GHz  cell  applied  voltage  at  0.9  GHz 
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Fig.  5  Phase  shift  of  MW  signals  of  0.9  GHz,  5  GHz,  and  Fig.  6  Phase  stability  of  the  output  MW  signal  of  0.9 
20  GHz  GHz 
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ABSTRACT 

A  dielectric  leaky-wave  antenna  with  a 
new  perturbation  scheme  is  developed.  The 
attenuation  constant  with  different  gap  width 
between  metal  patches  is  determined 
experimentally.  The  dielectric  waveguide  used 
in  this  paper  is  an  anisotropic  Sapphire  rod. 
The  conventional  effective  dielectric  constant 
(EDC)  method  is  modified  to  calculate  the 
unperturbed  propagation  constant  and  then  used 
to  predict  the  main  beam  position. 
Experimental  data  agree  quite  well  with  that  of 
simulation. 

I.  INTRODUCTION 

Dielectric  antennas  provide  superior 
properties  in  the  low  guided- wave  loss,  simple 
design,  and  light  weight  in  the  millimeter- wave 
frequency  range.  A  dielectric  leaky-wave 
antenna  also  possesses  the  advantage  of 
electronic  scanning  of  the  main  beam  by  simply 
varying  the  operating  frequency.  Directivity  or 
beamwidth  of  the  leaky-wave  antenna  can  be 
easily  controlled  by  properly  designing  the 
antenna  length. 

The  complex  propagation  constant  of  a 
uniform  leaky-wave  antenna  with  a  finite 
transverse  substrate  was  analyzed  by  Mittra  and 
Kastner  based  on  the  spectral  domain  approach 

[1] .  Radiation  characteristics  of  antennas  with 
perturbation  strips  on  either  the  broad  or 
narrow  side  of  the  dielectric  rod  were  studied 

[2] .  The  directivity  at  the  transverse  plane  of  an 
antenna  was  improved  using  a  pair  of  metal 
flares  along  the  longitudinal  direction  [3].  A 
series  of  studies  based  on  a  waveguide  model 
for  analyzing  several  dielectric  leaky-wave 
structures  was  published  by  Ghomi  et  al  [4]- 


[7].  The  Ritz-Galerkin  variational  approach 
was  applied  to  analyze  an  inset  dielectric 
waveguide  antenna  [8].  A  theory  based  on  a 
novel  multimode  transverse  equivalent  network 
was  proposed  by  Guglielmi  and  Oliner  to 
calculate  the  dispersion  behavior  of  a  metal- 
strip-loaded  image  leaky-wave  antenna  [9]. 
Moreover,  mode-matching  and  point-matching 
techniques  were  applied  to  the  analysis  of 
metal-strip-loaded  dielectric  antennas  [10]. 

It  is  noticed  that  metal  strips  or  patches 
of  all  pervious  references  were  deposited  on  the 
dielectric  rod  directly.  These  designs  have  the 
advantage  of  the  most  antenna  elements  in  the 
transverse  direction  if  a  dense  planar  array  is 
needed.  For  applications  of  a  very  narrow 
beamwidth,  the  dielectric  antenna  is  very  long 
electrically.  The  attenuation  constant  of  above 
directly  metal-deposited  techniques  is, 
however,  too  high  for  such  a  long  antenna 
design.  Furthermore,  the  fabrication  process 
also  becomes  very  difficult  for  a  physically 
long  antenna.  In  this  paper,  we  use  the  field 
coupling  mechanism  as  shown  in  Fig.  1  to 
perturb  the  guided  surface  wave.  Then  an 
appropriate  attenuation  constant  can  be  obtained 

for  the  design  of  a  nearly  300  Xq  long  leaky- 
wave  antenna. 

II.  ANTENNA  DESIGN 

Fig.  1  depicts  a  dielectric  leaky-wave 
antenna  with  the  proposed  perturbation  scheme 
which  is  suitable  for  a  very  long  antenna 
design.  The  metal  patches  are  placed  on  both 
sides  of  a  high  dielectric  constant  rod  so  that  a 
proper  attenuation  constant  can  be  obtained. 
The  dielectric  waveguide  used  here  is  a  low- 
loss  Sapphire  rod  with  an  anisotropic  dielectric 
constant  tensor. 
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r  9.3  0  O' 

[e]  =  eo  0  9.3  0 

-  0  0  11.5. 


(1) 


(3) 


where  e©  is  the  free  space  permittivity.  The 
cross  dimension  of  the  Sapphire  rod  is  chosen 
so  that  only  the  dominant  mode  exists  in  the 
operating  frequency  range. 

Due  to  its  anisotropy,  a  rigorous  full- 
wave  analysis  is  necessary  to  accurately 
calculate  the  propagation  constant  of  Fig.  1. 
However  the  conventional  effective  dielectric 
constant  ^DC)  method  can  be  modified  to 
quickly  give  us  the  approximated  but  good 
enough  data.  In  the  formulation  of  the  EDC 
along  the  y-direction  for  a  multilayered  slab 
waveguide  with  an  anisotropic  layer,  we 
assume  that  only  TM  modes  propagate  and 
there  is  no  x  variation.  The  field  is 
concentrated  in  the  Sapphire  region.  Derived 
from  Maxwell's  equations  and  compared  with 
the  isotropic  case,  one  can  have  the  anisotropic 
propagation  constant  as, 

p»i = v?-  ® 

V 

where  Pig©  is  calculated  from  the  conventional 
EDC  method  by  replacing  the  Sapphire  layer 

with  a  dielectric  constant  isotropic  material. 
The  conventional  EDC  method  is  then  used  to 
formulate  the  eigenvalue  equation  along  the  x- 
direction  since  there  is  no  Ex  component  in  our 
assumption.  Fig.  2  shows  dispersion  curves  of 
Fig.  1  without  perturbation  metal  patches  and 
with  dimensions:  w  =  24  mil,  t  =  23.7  mil,  and 
h  =  10  mil.  It  can  be  seen  that  only  the 
dominant  mode  exists  for  frequencies  less  than 
100  GHz.  The  phase  constants  of  Fig.  1  with 
and  without  metal  patches  are  assumed  to  be  the 
same  since  the  leaky  constant  is  usually  very 
small  in  the  leaky-wave  antenna  design. 
Therefore,  one  can  use  the  unperturbed  data  to 
predict  some  important  antenna  parameters. 
For  example,  the  main  beam  position  of  the  - 1 
space  harmonic  wave  is  simply  decided  by 
changing  the  perturbation  period  d  according  to 
the  following  equation. 


where  0  is  the  angle  with  respect  to  the  broad 

side  direction,  p/kq  is  the  normalized  phase 
constant  of  the  dominant  mode. 

The  attenuation  constant  of  Fig.  1  is 
studied  experimentally  by  taking  the  ratio  of 
IS21I  with  and  without  unifomaly  perturbed 
patcheSj  respectively.  The  metal  patches  are 
fabricated  on  a  custom-made  10- mil  thick 
Teflon  sheet.  Each  metal  patch  has  the 
dimension:  20  mil  x  50  mil.  The  parameter 
used  to  control  radiation  is  the  gap  width  g 
between  a  pair  of  patches.  The  measured 
attenuation  constant  versus  gap  width  is  shown 
in  Figs.  3  and  4  for  two  different  frequencies. 
Solid  lines  are  measured  data  and  dash  lines  are 
curve-fitting  results.  Based  on  these  curves,  a 
new  tapering  method  by  changing  the  gap 
width  between  patches  is  employed  and  a  36- 
inch  long  antenna  with  linearly  tapered  gap 
width  is  designed.  The  initial  gap  width  is  62 
mil  and  the  final  one  is  40  mil. 

m.  ANTENNA  PERFORMANCE 

The  main  beam  positions  of  two  leaky- 
wave  antennas  with  different  perturbation 
periods,  80-mil  and  50-mil,  are  assembled  and 
measured.  Fig.  5  depicts  measured  and 
calculated  antenna  performance.  Although  the 
modified  EDC  method  is  an  approximated 
method,  this  efficient  technique  can  predict  the 
antenna  performance  quite  well.  The  deviation 
between  theoretical  and  measured  data  is  about 
5°  for  the  80-mil  period  antenna  and  is  3°  for 
the  50-mil  one.  The  measured  scanning  range 
from  91  to  97  GHz  is  about  16.5*  and  24.5"  for 
the  80-mil  and  50-mil  perturbation  periods, 
respectively. 


IV.  CONCLUSION 

A  millimeter-wave  leaky-wave  antenna 
with  an  anisotropic  Sapphire  rod  and  a  new 
perturbation  scheme  is  analyzed  and  designed. 
The  conventional  EDC  method  is  modified  to 
include  the  anisotropy  of  the  Sapphire  dielectric 
waveguide.  The  calculated  main  beam  position 
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and  the  scan  range  agree  well  with  measured 
data. 
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Sapphire  waveguide 


Fig.  1  A  dielectric  leaky-wave  antenna,  (a)  S 


Fig.  2  Dispersion  curves  of  Fig.  1  without  perturbation 
metal  patches. 


*  *  Curve  Ri 


Fig.  4  Attenuation  constant  versus  gap  width 
at  96  GHz. 


Main  beam  position  (Degree) 


Fig.  3  Attenuation  constant  versus  gap  width 
at  90  GHz. 


Fig.  5  Measured  and  calculated  main  beam  position 
of  antennas  with  different  pertubation  periods. 
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Abstract 

The  demand  for  mobile  and  personal  communications 
systems  is  expected  to  produce  a  huge  market  for  handy 
phone  sets  and  their  MMICs  in  coming  years.  A  number  of 
ICs  in  silicon  bipolar,  silicon  Bi-CMOS,  GaAs  MESFET, 
HBT,  and  HEMT  are  now  becoming  available  for  system 
application.  There  are  tradeoffs  among  the  competing 
technologies  with  regard  to  performance,  cost,  reliability, 
and  time-to-market.  This  paper  describes  process  selection 
and  requirements  of  cost  and  RF  performances  to  MMICs 
for  digital  cellular  and  cordless  telephones.  Furthermore, 
new  circuit  techniques  which  were  developed  by  NTT  are 
presented. 

1.  Introduction 

Many  cellular  and  cordless  systems  are  presently  being 
planned  and  operated  in  the  world  Trends  for  system 
standards  are  moving  towards  a  higher  frequency  and  from 
analog  to  digital  modulation  techniques.  Cordless  systems 
are  being  planned  as  the  next  generation  systems  of  personal 
communications.  Such  systems  are  called  the  Personal 
Hancfy  phone  System,  or  PHS  in  Japan,  the  Personal 
Communication  Service,  or  PCS  in  the  US,  and  the  Digital 
European  Cordless  Telecommunication,  or  DECT  in  Europe. 
The  concept  of  the  cordless  system  is  to  combine  mobile 
communication  and  public  telephones,  where  hancfy  phone 
sets  can  be  used  in  the  home  as  cordless  telephones  and  in 
town  as  cellular  telephones.  Since  the  base  stations  of 
cordless  systems  cover  a  very  small  area,  only  about  200 
meters  in  diameter,  the  systems  are  called  microcellular 
systems. 

The  1.9  GHz-band  Jc^anese  PHS  employs  7r/4-shift 
QPSK  digital  modjlation.  Time  Division  Multiple  Access 
(TDMA),  and  the  Time  Division  Duplex  (TDD)  scheme. 
The  bit  rate  and  the  channel  spacing  are  set  at  384  kbps  and 
300  kHz,  respectively. 

Han(fy  phone  sets  should  be  small,  light,  and  provide 
extended  communication  service  time.  In  the  PHS,  talking¬ 
time  of  over  8  hours  and  stancfcy-time  of  one  week  are 


expected  A  typical  block  diagram  of  the  PHS  han<^  phone 
set  is  shown  in  Fig.  1.  The  RF  unit  consists  of  a  single 
conversion  transmitter  and  a  double  conversion  receiver.  The 
first  and  second  intermediate  frequencies  are  1.66  GHz  band 
and  240  MHz  band  respectively.  Since  the  transmission 
frequency  and  receiving  frequency  are  the  same  in  the  PHS, 
a  T/R  switch  is  required. 

2.  Process  Selection  for  MMICs 
In  selecting  PHS  component  fabrication  processes,  there 
are  tradeoffs  to  consider  with  regard  to  performance,  cost, 
reliability,  and  time-to-market.  According  to  the  present 
market  trend  price  is  the  most  important  factor  in 
employing  components.  As  a  result,  manufacturers  tend  to 
employ  a  silicon  process,  as  long  as  silicon  circuits  can 
provide  acceptable  performance.  In  the  RF  unit,  GaAs  is 
used  in  the  power  amplifier,  T/R  switch,  and  local  oscillator 
switch,  while  silicon  is  used  in  all  the  other  circuits,  i.e., 
mixers,  QPSK  mockilators,  PLLs,  IF  ICs,  and  so  on.  In  the 
baseband  unit,  all  circuits  are  silicon-based. 

3.  MMIC  Requirements 
3.1  Cost  and  price  requirements 

MMICs  for  the  PHS  are  commercially  available  now 
from  various  Japanese,  American,  and  European 
manufacturers,  for  example,  Matsushita,  NEC,  Fujitsu, 
Mitsubishi,  and  Sony  in  Japan,  National  Semiconductor, 
Pacific  communication.  Analog  Devices,  AT&T,  and  HP  in 
the  US,  and  Philips  in  Europe.  The  present  market  prices 
are  very  low;  in  the  neighborhood  of  1  to  8  dollars.  Just 
last  year,  the  market  prices  were  4  to  15  dollars.  With  the 
advance  of  mass  production,  our  guess  is  that  market  prices 
will  be  rediced  to  from  1  to  5$.  For  example,  the  price  of 
PHS  power  amplifiers  was  around  15  $  in  1994,  but  it  has 
dropped  to  about  5$.  For  each  RF  IC,  only  one  or  two 
manufacturers  can  provide  satisfactory  service  in  this  severe 
cost  race.  This  is  because  orders  tend  to  be  concentrated  to 
specific  manufacturers  who  supply  lower-priced  or  special- 
performance  devices. 
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3*2  RF  Performance  Requirements 

Digital  systems  require  special  performance  from 
microwave  components.  A  transmitter  unit  for  digital 
systems  such  as  the  PHS  requires  low  adjacent  channel 
interference  and  low  modulation  vector  error.  The  ideal 
output  spectrum  of  71/ 4-shift  QPSK  modulation  has  a 
narrow  occupied  bandwidth  of  about  200  kHz.  However,  the 
output  spectmm  of  a  transmitter  spreads  out  because  of 
microwave  component  distortion,  as  shown  in  Fig.  2.  Here, 
the  PHS  specifications  require  an  adjacent  channel 
interference  level  of  better  than  -50  dBc  at  a  192-kHz 
bandwidth  at  a  600-kHz  off-carrier,  and  modulation  vector 
error  of  better  than  12.5%rms.  Phase  noise  from  VCOs,  and 
phase  and  amplitude  distortion  from  a  mixer,  driver 
amplifier,  power  amplifier,  and  T/R  switch  degrade  both 
adjacent  channel  interference  and  modulation  vector  error. 
Therefore,  the  transmitter  components  are  required  to  have 
sufficient  linearity.  This  linearity  requirement  strongly 
limits  the  low-power  consumption  designs  of  microwave 
components.  For  example,  power-added  efficiency  of  a 
commercially  available  power  amplifier  is  normally  20%. 
These  requirements  differ  from  those  of  analog  cellular 
systems. 

4,  Example  of  MMICs 

NTT  has  developed  new  RF  circuit  techniques  for  the 
PHS,  which  overcome  the  distortion  problems  that  occur 
with  low  supply  voltages  such  as  2  V.  For  example,  several 
MMICs  will  be  presented. 

4.1  Low  noise  amplifier  (LNA)  [1] 

High  performance  LNAs  can  be  constmcted  by  suitably 
combining  enhancement-mode  FETs,  or  EFETs,  with 
depletion-mode  FETs,  or  DFETs.  An  EFET  is  characterized 
by  low  noise  and  high  gain  under  a  low  operation  current. 
However,  a  DFET  is  characterized  by  low  distortion. 
Therefore,  a  series  connection  of  the  two  types  of  FETs 
provides  low  noise,  high  gain,  and  low  distortion 
performance  under  low-voltage  and  low-current  conditions. 
The  developed  LNA  is  shown  in  Fig.  3.  It  has  a  noise 
figure  of  2.0  dB  and  a  gain  of  12  dB  at  1.9  GHz  using  a 
drain  voltage  of  2  V  and  a  drain  current  of  1  mA. 

4.2  Power  amplifier  [2] 

For  low  voltage  and  high  efficiency  operation,  we  tried 
to  reduce  the  phase  cfistortion.  It  was  discovered  that,  as  the 
input  power  increases,  the  phase  of  the  common  source 
FET  advances,  but  that  of  the  common  gate  FET  is  delayed 


This  led  us  to  believe  that  a  combination  of  these  FETs 
could  compensate  for  the  phase  distortion.  Indeed,  this  is  the 
case,  the  Cascode  FET,  which  is  a  cascode  connection  of  a 
common  source  FET  and  common  gate  FET,  thoroughly 
suppresses  the  phase  deviation.  Thanks  to  the  phase 
distortion  compensation,  the  limit  of  the  efficiency  for  the 
new  power  amplifier  increases  up  to  50%  at  3  V  operation. 
The  new  linear  power  amplifier  is  shown  in  Fig.  4.  This 
amplifier  has  excellent  performance  with  a  power-added 
efficiency  of  45%,  an  output  power  of  20.5  dBm,  and  a  gain 
of  30  dB  in  spite  of  a  low  supply  voltage  of  2.4  V.  At  a 
supply  voltage  of  3  V,  the  performance  of  the  PA  increases. 

It  achieves  record  performance  with  a  power  added  efficiency 
of  50%,  an  output  power  of  21.5  dBm,  and  a  gain  of  31.5 
dB.  The  chip  size  of  the  PA  is  2.0  x  1 .5  mm. 

4.3  Quadrature  modulators  [3] 

The  factor  preventing  low  voltage  operation  for 
modulators  is  the  use  of  Gilbert  Cell  mixers.  The  supply 
voltage  of  less  than  3  V  of  the  conventional  Gilbert  Cell 
mixer  degrades  its  linearity  and  modulation  accuracy, 
because  the  Gilbert  Cell  mixer  employs  three  series- 
connected  transistors  and  each  transistor  requires  a  supply 
voltage  of  more  than  1  V,  as  shown  in  Fig.  5(a).  Figure 
5(b)  shows  our  new  Gilbert  Cell  mixer  which  can  operate  at 
a  supply  voltage  of  2  V.  The  essence  for  low  voltage 
operation  lies  in  reducing  the  number  of  series-connected 
transistors.  Two  cross-coupled  single-balanced  mixers 
consisting  of  differential  pairs  form  a  double-balanced  mixer 
core  similar  to  the  Gilbert  cell.  A  local  signal  is  ^plied  to 
the  differential  pairs  in  the  Gilbert  Cell  core.  Each  baseband 
voltage  signal  is  converted  into  a  differential  current  signal 
using  the  current  mirror.  The  sum  of  the  single-balanced 
mixer  tail  currents  is  held  constant  because  the  differential 
current  mirror  has  current  sources.  Thus,  low-distortion 
mixing  is  obtained  Since  the  new  double-balanced  mixer 
includes  only  two  series-connected  transistors  in  each  circuit 
branch,  it  can  operate  at  the  low  voltage  of  2  V.  The  new 
quadrature  modulator  is  shown  in  Fig.  6.  The  active  area  is 

2.4  X  0.7  mm.  At  the  1.9  GHz  specified  for  the  PHS,  the 
vector  error  is  6.9%rms.  The  power  consumption  of  68  mW 
at  2  V  is  about  1/3  of  the  minimum  consumption  of 
previously  reported  modulator  ICs. 

5.  Conclusion 

In  selecting  MMIC  fabrication  processes  for  handy 
phones,  there  are  tradeoffs  to  consider  with  regard  to 
performance,  cost,  reliability,  and  time-to-market. 
According  to  the  present  market  trend,  price  is  the  most 
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important  factor  in  employing  microwave  components.  As 
a  result,  manufacturers  tend  to  employ  a  silicon  process,  as 
long  as  silicon  circuits  can  provide  acceptable  performance. 
Therefore,  GaAs  MMICs  must  distinguish  themselves  by 
their  special  performances. 
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Fig.  3  1 .9  GHz  low  noise  amplifier. 

Fig.  4  1 .9  GHz  power  amplifier. 
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Fig.  5  New  Gilbert  cell  mixer  and  conventional  one. 


Fig.  6  1.9  GHz  quadrature  modulator. 
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ABSTRACT 

The  R/D  activity  on  MMICs  in  Europe  is  presented 
through  three  representative  aspects.  The  first  one 
consists  in  an  overview  of  recent  power  circuits.  The 
second  one  shows  the  developments  in  millimeter-wave 
circuits.  The  third  one  describes  an  activity  which 
becomes  specially  interesting  with  monolithic  circuits: 
the  multifunction  chips. 

1.  INTRODUCTION 

Although  the  activities  in  MMICs  are  still  changing 
in  Europe,  the  main  sources  of  MMICs  available  at 
present  time  or  the  research  centres  working  on  MMICs, 
are  listed  in  Table  1  (F  is  for  foundries  and  R  for  research 
centres): 


Company  or  Institution 

Country 

IMEC(R) 

Belgium 

CNET  (R);  lEMN  (R);  Philips  Microwave 
Limeil  (F);  Thomson-CSF  (R,F) 

France 

Daimler-Benz  (R,F);  F-IAF  (R); 

Siemens  (F) 

Germany 

Alcatel-Telettra  (F);  Alenia  (F) 

Italy 

GEC  Marconi  Materials  Technology  (F), 
DRA  (R),  GMMT  (F) 

UK 

Table.  1 :  Different  sources  of  MMICs  in  Europ 

The  monolithic  circuits  activities  are  also  based  on 
different  research  centres  working  on  epitaxy  or  discrete 
transistors  like  lEMN  (France),  F-IAF  (germany)  and 
DRA  (UK)  or  suppliers  of  epitaxial  products  like 
Picogiga  (France)  or  Epitaxial  Products  International 
(UK).  Several  research  and  development  programmes  from 
European  Community,  from  European  Space  Agency, 
from  european  military  programmes  or  from  the  different 
national  governments  helped  to  develop  these 
technologies  [1]. 

The  seven  major  foundries  are  grouped  in  an  association 
called  EUROGaAs  [2]. 


These  recent  developments  make  possible  the  many 
applications  which  are  emerging  in  telecommunications 
or  in  consumer  applications  [3].  The  state  of  the  art  of 
these  technologies  is  presented  by  the  description  of  three 
aspects  of  these  integrated  circuits  :  the  power  circuits, 
the  millimeter-wave  circuits  and  the  multifunction  chips. 

2.  POWER  AMPLIFIERS 

According  to  the  frequency,  different  kinds  of 
power  technologies  are  available  [4]: 

MESFET  technologies  with  0.7  pm  or  0.5  p.m 
gatelengths  give  power  circuits  at  0.9  or  1.8  GHz  for 
cordless  and  cellular  telecommunications  or  at  5-6  or 
10  or  6-18  GHz  for  phased  array  radars.  With  these  well 
stabilized  technologies,  the  research  efforts  concern  the 
design  of  the  circuits.  It  consists  in  increasing  the  power 
added  efficiency  (PAE)  by  presenting  short  or  open 
circuits  on  2nd  or  3rd  harmonics.  For  instance,  when 
working  in  class  F,  it  is  possible  to  increase  the  PAE  of 
an  X-band  power  amplifier  of  5%  [6]. 

During  last  years,  a  big  effort  has  been  done  in 
Europe  on  HBT  technologies  [9][10][1 1].  Power  HBTs  for 
digit^  mobile  communication  systems  were  developed  by 
CNET  and  Siemens.  As  for  Thomson-CSF,  GEC- 
Marconi,  Defense  Research  Agency,  DASA  and  Daimler- 
Benz  Research,  they  developed  also  high  efficiency  power 
HBTs  but  for  X  band  applications.  A  special  cooperation 
is  currently  developed  between  Thomson-CSF  and 
Daimler-Benz  to  get  fully  compatible  HBT  technologies. 
This  gives  a  GaInP/GaAs  technology  with  a  carbon  doped 
base.  A  basic  cell  for  the  power  device  was  optimized 
with  the  help  of  electrical  and  thermal  modelling.  It  gives 
a  transistor  having  a  cutoff  frequency  of  60  GHz  and  a 
maximum  frequency  of  90-1 10  Ghz. 

Typical  results  of  power  amplifiers  are  shown  in 

table  2. 

When  the  frequency  increases,  another  important 
technology  is  based  on  P-HEMT.  Some  typical  circuits 
obtained  with  a  P-HEMT  power  technology  with  a  gate 
length  of  0.25  pm  or  0.16  pm  are  also  presented  in 
table  2. 


0-7803-2516-8/95  $4.00©  1995  IEEE 


29 


Technology 


Institution 


1.7 

4.8 
5-  6 
9-11 
9-11 

10 

9 

18-23 

35-39 

62-70 

71-77 

70-80 


3  MESFET  38 

2  MESFET 

3  MESFET  30 

2  MESFET  30 

2  MESFET  38 

2  HBT  34 

1  HBT  43 

2  P-HEMT  20 

2  P-HEMT 

2  P-HEMT 

1  P-HEMT 

2  I  P-HEMT  I 

Table  2 :  Examples  of  power  amplifiers 


Siemens  [5] 
Alcatel-Telettra  [7] 
TH-CSF  [8] 
Dassault  +  Siemens 
TH-CSF  [8] 
Daimler-Benz  and 
TH-CSF  [12][28] 
TH-CSF  [13] 


F-IAF  [291 


3.  MILLIMETER  WAVE  CIRCUITS 

Within  the  power  circuit  shown  in  the  proceeding 
section,  some  of  them  work  in  millimeter  wave. 

Above  20  GHz,  several  tens  of  circuits  have  been 
realized  in  different  technologies.  At  these  frequencies, 
three  European  Community  programmes  pushed  the 
research.  In  the  frame  of  AIMS  programme,  technologies 
and  circuits  were  developed  to  work  between  20  and  40 
GHz  while  in  the  frame  of  CLASSIC  programme  for  the 
technology  and  MBS  for  the  study  of  a  mobile  broadband 
system,  circuits  between  40  and  80  GHz  were  developed. 
TTie  main  technologies  used  at  these  frequencies  are: 

•  a  well  stabilized  MESFET  technology  with  a 
0.5  |im  gate  length  which  gives  circuits  like  mixers, 
doublers,  switches,  oscillators  and  buffer  amplifiers  [14], 

•  a  0.25  |im  and  0.15  pm  gate  length  technology 
developed  at  Daimler-Benz,  at  Thomson-CSF,  at  GMMT, 
for  MMICs  and  at  lEMN  for  discrete  components. 


•  a  double  heterostructure  P-HFET  technology 
for  power  circuits  with  a  0.25  pm  gatelength  which  was 
introduced  in  the  preceding  section, 

•  the  same  type  of  technology  was  also 
developed  by  F-IAF  with  a  0.16  pm  gatelength, 

•  the  HBT  technology  presented  in  the  last 
section  for  X  band  power  amplifiers  can  also  be  used  for 
oscillators  thanks  to  its  good  1/F  performance  and  also 
because  of  the  availability  of  a  good  vertical  varactor  diode 
betwen  base  and  collector  electrodes  [1 1][17]. 

Table  3  shows  some  examples  of  distributed  and 
low  noise  amplifiers  realized  with  these  technologies.  At 
these  frequencies,  coplanar  technologies  present  some 
advantages  and  some  of  these  circuits  are  realized  in 
coplanar  lines  [23]. 

Some  research  centres  like  lEMN,  F-IAF  and 
IMEC  work  on  InP  based  (lattice  matched  or  PM) 
technologies.  A  typical  result  is  shown  on  table  3. 


F 

GHz 

Gain 

db 

stage 

numb. 

NF 

dB 

niH 

Institution 

2  -  52 

9 

matrix 

7 

0.25  p  HFET 

F-IAF -t- Alcatel  SEL  [21] 

2  -  28 

13 

distribut. 

6 

0.25  p  HFET 

Dassault  Electronique  +  TH-CSF 

5  -  80 

9 

cascode 

3.5 

0.25  p  HFET 

F-IAF  [16] 

28.6-  38 

13 

2 

3.5 

0.25  p  P-HFET 

Thom-EMI  +  GMMT  [22] 

24  -  30 

16 

3 

4 

0.2  pP-HEET 

Alcatel  -I-  PML  [19] 

28.5 

13.5 

2 

3.2 

0.25  p  HFET 

Daimler-Benz 

34.7 

18 

3 

3 

0.25  p  HFET 

+  TH-CSF  [15] 

62  -  69 

10 

3 

- 

0.15  p  HFET 

TH-CSF 

5  -  50 

11.5 

dual-gate 

- 

0.25  p  HFET  InP 

IMEC  -h  ESAT/TELEMIC  [32] 

30 


Table  4  shows  some  examples  of  oscillators. 


F 

Gtts 

Tuning 

range 

GHz 

Phase 
noise  at 
lOOkHz 
dBc/Hz 

Techno. 

Institution 

30.9 

1 

WM 

HBT 

Daimler-Benz  +  DASA  [20] 

34.5 

1.3 

MESFET 

TH-CSF  [15] 

28.5 

6.5 

-75 

HBT 

TH-CSF  [11] 

Table  4  :  Examples  of  VCOs 


4.  MULTIFUNCTION  CHIPS 

Another  important  aspect  of  the  evolution  of  the 
MMIC  activity  is  the  appearance  of  multifunction  chips. 
This  type  of  integrated  circuits  are  worth  noting  because 
they  result  from  the  association  of  technological  progress 
and  of  design  effort 

Moreover,  they  have  some  very  important 
characteristics: 

•  As  all  the  connections  are  made  on  the  chip 
itself,  they  allow  the  realization  of  complex  circuits 
which  could  not  be  realized  with  a  hybrid  technology. 

•  With  a  mature  technology,  they  bring  an 
important  reduction  in  cost 

As  frequency  is  increasing,  the  interconnection  of  several 
chips  brings  incertainties  which  are  avoided  with  a  single 
chip.  Moreover,  these  interconnections  can  be  taken  into 
account  in  the  simulation  of  the  circuits.  This  fact  is  very 
important  for  millimeter  wave  circuits. 

•  The  multifunction  technique  also  improves  the 
reliability  of  the  final  circuit. 


Circuit 

F  GHz 

Modulator 

1  -  2.8 

4,  16  QAM 

Modulator/Demod 

1.25  -  3 

•  The  design  of  multifunction  circuits  implies 
the  development  of  circuit  cells  which  are  introduced  in  a 
library.  The  performances  of  these  blocks  can  be  changed 
by  the  variation  of  some  parameters. 

Some  examples  of  such  circuits  are  given  in  table  5 
which  shows  several  modulators  or  demodulators,  for 
telecommunications  from  1  to  65  GHz. 

Another  example  is  given  by  a  56.8  GHz  phase  lock  loop 
chip  realized  by  Thomson-CSF  [31].  On  this  chip,  several 
elementary  circuits  are  found:  a  doubler  oscillator,  a  band¬ 
pass  filter,  a  power  divider  and  an  amplifier. 

5.  CONCLUSION 

Some  samples  of  recent  results  obtained  with 
different  MMIC  technologies  available  in  Europe  have 
been  shown.  These  results  implies  a  good  cooperation 
between  technological  and  designers  teams. 

A  common  trend  can  be  observed  for  all  the 
categories  of  circuits  which  have  been  presented:  a 
constant  increasing  in  frequency.  All  the  examples 
showed  that  the  MMIC  technology  is  now  ready  for  the 
many  applications  which  are  emerging,  specially  at 
millimeter  wave  frequencies 


Institution 


/LN  A/Medium 
power  amp./4  PSK 

Vector  modul. 

Modulator/ 
demodulator 
4,  16,  64  QAM 

Modulator 
4,  16,  64  QAM 


5  -  5.6  MESFET  0.7  Jim  1.9  x  2.9 

5.9-  6.8  MESFET0.5Mm  2.7x3.65 

35  -  65  HFET0.2pm  2x4 


PML  [24] 


Alcatel  Standard 
Electrica/GMMT  [30] 


PML  +  TNO  [25] 
TH-CSF  [26] 


ANT  +  F-IAF[27] 


Table  5  :  Examples  of  modulators  /  demodulators 
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ABSTRACT 

The  inlroduclion  of  new  microwave-based  technologies 
such  as  personal  communications  s>stcms  (PCS), 
wireless  local  area  networks  (WLANs)  and  the 
prov  ision  to  the  home  and  ofiicc  of  broadband  services, 
is  focusing  the  attention  of  the  Australian 
communications  industry  on  gallium  arsenide  (GaAs)- 
based  monolithic  microwave  integrated  circuits 
(MMlCs).  In  order  to  provide  a  lower  risk  entry  point 
into  MMIC-based  applications,  the  CSIRO  has 
established  a  design  and  fabrication  capability  at  its 
Division  of  Radiophysics.  This  paper  will  discuss 
millimetre-wave  MMICs  developed  using  this 
capability. 


I,  INTRODUCTION 

A  significant  Australian  GaAs  MMIC  program  is 
being  carried  out  at  the  CSIRO  Division  of 
Radiophysics.  MMIC  design,  fabrication  and  testing 
services  arc  now^  available  to  both  Australian  and 
overseas  companies  for  prototype  development,  and,  in 
the  near  future,  limited  production  capability  will  be 
offered.  In  addition  to  this,  the  Division  has  established 
relationships  with  several  US  foundries  to  provide  a 
convenient  path  to  large-scale  production. 

The  CSIRO  provides  three  MMIC  processes,  0.25 
micron  gate,  pscudomorphic,  high-elcctron-mobility 
transistor  (PHEMT)  providing  amplifier  performance 
to  --70  GHz,  planar  Schottky  diode  (mixer  and 
varactor)  suitable'  for  circuits  operating  to  beyond  100 
GHz.,  and  combined  PHEMT/Schottky  diode.  Principal 
emphasis  thus  far  has  been  on  low-noise,  high- 
frequency  applications  and  establishing  accurate 
models  to  facilitate  the  objective  of  one-pass  design. 

The  paper  will  describe  circuits  that  have  been 
designed  and  fabricated  using  these  processes,  together 
w  ith  their  packaging  for  application  to  millimetre-wave 


PCS  systems  and  wireless  distribution  of  wideband 
customer  services. 


2.  MMIC  FABRICATION 

The  principal  process  offered  by  the  CSIRO  GaAs 
MMIC  fabrication  facility,  the  PHEMT  process,  used 
0.25  micron,  mushroom-cross-section  gates  and  offers 
performance  to  -70  GHz.  An  improved  process  under 
development  will  reduce  the  gale-length  to  <0.15 
micron  enabling  performance  to  be  extended  beyond 
100  GHz.  Two-inch  wafers  with  epitaxial  layers  grown 
in  the  Division's  molecular-beam  epitaxy  (MBE) 
system  are  used.  The  process  incorporates  the  usual 
mctal-insulator-mctal  (MIM)  capacitors  and  thin-film 
resistors,  together  with  air-bridge  cross-overs.  Dry 
recess-etching  is  used  for  the  gale  process.  Wafers  are 
thinned  to  either  50  or  100  microns  and  through- 
substrate  vias  are  used  to  provide  grounding.  This 
process  has  been  used  to  fabricate  amplifiers  operating 
to  -70  GHz.  The  example  shown  in  figure  I,  a  2-stage 
amplifier,  covers  the  frequency  range  of  50  to  67  GHz 
with  -10  dB  gain,  as  showm  in  Figure  2. 


Figure  I  60  GHz  2-stagc  amplifier 
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Figure  2  S]  I  and  S2 1  measured  and  predicted 
for  a  2-stage  60  GHz  MMIC  amplifier. 


coupler  providing  a  compact  chip  of  3  x  3  mm.  Upper 
or  lower  sideband  operation  can  be  selected  by 
connecting  to  the  appropriate  terminal  of  the  spiral 
coupler. 


In  addition  to  the  development  of  shorter  length  gates 
(<0.15  micron),  research  is  being  done  into  epitaxial 
material  structures  for  increased  linearity  and  power 
handling.  Results  of  this  work  to  date  arc  encouraging 
with  some  materials  having  saturation  currents  of  --1 
A/mm. 


Figure  4  Conversion  gain  and  S|  |  (RF  return  loss) 
for  a  60  GHz  mixer.  Circles  show 
measured  performance  for  the  lower 
sideband,  crosses  show  uppe  sideband 
performance. 


Figure  3  60  GHz  image-reject  mixer 


The  Schottky  diode  process  [1]  provides  both  varactor 
and  mixer  diodes.  This  process  has  been  used  to 
produce  mixers  fl,2|  and  frequency  multipliers 
operating  to  100  GHz  and  beyond.  Figure  3  shows  an 
example  of  a  60  GHz  image-reject  mixer.  Its 
performance  is  illustrated  in  Figure  4  showing 
conversion  loss  of  ~10  dB  and  image  rejection  of  ~20 
dB.  The  mi.xcr  is  laid  out  inside  a  spiral  2.5  GHz  IF 


The  combined  PHEMT/Schottky  diode  process  has 
been  used  for  voltage-controlled  oscillator  (VCO) 
circuits.  A  12.5  GHz  VCO  gave  a  tuning  range  of  1 1.5 
to  13  GHz  and  an  output  power  of  +14  dBm.  The 
phase-noise  for  this  circuit  was  -95  dBc/Hz  at  100 
kHz.  Higher  frequency  oscillators  at  20  and  30  GHz, 
with  a  tuning  ranges  of  --2  GHz  and  output  powers  of 
~14  dBm,  have  recently  being  fabricated. 

Other  MMICs  designed  and  fabricated  in  the  CSIRO’s 
GaAs  facility  include: 

1-25  GHz  distributed  amplifiers 
6-20  GHz  amplifiers 
25-50  GHz  amplifiers 
35-45  GHz  image-reject  mixers. 


4.  DESIGNS  FOR  EXTERNAL  FOUNDRIES 

The  Radiophysics  Division  has  established 
relationships  with  external  foundries  to  enable  it  to 
provide  a  design  ser\  ice  for  commercial  customers  that 
can  lead  to  large-scale  production.  One  such 
relationship,  with  the  TRW  foundiy  in  the  USA,  is 
being  used  to  produce  a  chip  set  for  38  GHz 
transceivers  for  wireless  backhaul  applications  in  PCS 
systems.  In  addition  to  low-noise  amplifiers  (LNAs), 
medium-powder  amplifiers  and  mixers  (both  up  and 
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down  comcrlcrs)  for  this  program,  oilier  designs 
fabricalcd  at  TRW  include: 

38  GH/  receiver  (LNA  plus  mixer)  [2] 

60  GHz  amplifiers 

60  GHz  mixers 

1-20  and  20-45  GHz  switches. 

All  design  work  and  prototype  testing  for  this  program 
has  been  carried  out  at  the  Radiophysics  Division. 


5.  DEVICE  TESTING 

MMIC  testing  and  active  device  characterisation  is 
carried  out  in  the  Division’s  MMIC  test  laboratories. 
The  dc  laboratory  includes  facilities  for  evaluating 
materials,  using  step-and-elch  Hall  effect  and  photo- 
rcflcclancc  systems.  The  microwave  test  laboratory 
includes  four  microwave  probe  stations  (two  with  auto- 
stepping  capability)  and  two  HP85I0C  50  GHz 
automatic  network  analysers,  one  of  which  is  equipped 
with  waveguide  extensions  enabling  on-wafer  probing 
from  40  MHz  to  110  GHz  for  device  characterisation 
and  MMIC  testing,  as  well  as  measurement  of 
waveguide  and  coaxial  packaged  devices. 


6.  DEVICE  MODELLING 

A  significant  effort  has  gone  into  device  measurement 
and  the  refinement  of  active  and  passive  device  models 
with  a  view  to  accurate  performance  prediction  and, 
ultimately,  one-pass  design.  The  measured  and 
predicted  linear  responses  for  60  and  40  GHz 
amplifiers  shown  in  Figures  2  and  5  indicate  the 
progress  that  has  been  made  in  this  aspect  of  the  work. 
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Figure  5  Sj  j  and  S2 1  measured  and  predicted  for  a  2- 
stage  40  GHz  MMIC  amplifier. 


A  physics-based,  non-linear  HEMT  model  has  been 
developed  in  conjunction  with  Macquarie  University, 
NSW,  Australia  [3,4].  This  model  has  been  coded  into 
HP-EEsof  Libra  [5]  to  facilitate  accurate  non-linear 
MMIC  design.  Software  tools  have  also  been  written  to 
extract  circuit  elements  from  measured  I-V 
characteristics  and  bias-dependent  S-parameters  for  the 
Libra  implementation  of  the  Curtice  non-linear  FET 
model  and  the  CSIRO  non-linear  HEMT  model. 


7.  MMIC  APPLICATIONS 

The  MMIC  work  at  the  CSIRO  is  part  of  a  larger 
program  of  system  design,  and  complements  other 
work  on  antenna  design,  telecommunications  and 
signal  processing.  Work  in  progress  at  the  Division  of 
Radiophysics  includes  designing  MMIC-based 
transmitter  and  receiver  elements  for  38  GHz  wireless 
backhaul  applications  in  personal  communications 
systems,  investigating  systems  for  deliveiy  to  the  home 
and  office  of  wideband  customer  services,  and 
developing  a  60  GHz  WLAN  system  which  is  capable 
of  100  M  bit  s“^  operation  16].  Each  of  these  tasks 
includes  packaging  MMICs,  together  with  designing 
circuits  with  higher  levels  of  integration. 

Packaging  efforts  have  included  optimising  transitions 
between  waveguide  and  other  transmission  media 
using  the  HP  HFSS  (high-frequency  stnicture 
simulator)  software,  miniaturising  packages,  and 
developing  planar  antennas  for  integration  with  MMIC 
transceiver  elements. 


8.  CONCLUSION 

While  the  scale  of  MMIC  design  and  fabrication  in 
Australia  is  small  compared  to  the  work  being  carried 
out  in  the  major  GaAs  foundries  of  the  world,  the 
levels  of  quality  and  innovation  are  high.  Devices  and 
circuits  arc  produced  with  high  performance  at 
millimetre  wavelengths,  and  designs  performing  at,  or 
close  to,  specification  arc  produced  with  a  single 
design  cycle.  The  prototyping  services  being  offered 
are  increasing  the  use  of  MMICs  in  Australian 
products. 
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Abstract 

This  paper  summarizes  the  general  requirements  of  the 
monolithic  microwave  integrated  circuit  (MMIC)  for 
satellite  communication  applications.  It  also  describes  a 
space  qualification  procedure  being  implemented  on  a 
0.25  pm  MESFET  MMIC  process  as  well  as  a  0.15  pm 
InP  HEMT  process  and  a  few  examples  of  the  MMIC's 
developed  using  these  processes. 


1.  Introduction 

Communication  satellites  rely  heavily  on  microwave  and 
millimeter-wave  carriers  for  signal  transmission. 
Therefore,  microwave  and  millimeter-wave  components 
play  an  important  role  in  communication  satellites. 
Because  of  the  unique  environment  in  space,  design 
requirements  of  microwave  components  are  also  unique. 
The  expensive  operation  of  transporting  the  satellite  to 
space  demands  that  payloads  be  small  and  light.  The 
limited  amount  of  energy  from  solar  cells  demands 
efficient  operation  of  the  electronics  while  radiating 
sufficient  microwave  power  for  signal  transmission.  The 
10  to  15  years  of  service  requirement  without  the 
possibility  of  field  repair  demands  that  every  component 
be  extremely  reliable. 

The  reliability  must  not  be  compromised  for  any  other 
performance  requirement.  By  employing  MMIC's  as  the 
functional  building  blocks,  the  size  and  weight  of  the 
electronic  payload  will  be  dictated  by  the  other  electronic 
circuits  and  their  packages.  Therefore,  reducing  the  size 
of  the  MMIC's  themselves  is  not  a  key  issue.  Higher 
functional  levels  of  integration  are  desirable  to  reduce  the 
size/weight  of  the  MMIC  circuit  packaging.  In  addition, 
having  fewer  components  and  wirebonds  in  the 
manufacturing  process  will  also  improve  the  system 
reliability.  In  summary,  the  design  parameters  of  the 


MMIC's  for  satellite  communications  in  order  of  their 
importance  are:  (1)  reliability,  (2)  performance,  (3) 
power  consumption,  and  (4)  size  and  weight. 

In  this  paper,  we  first  summarize  a  space  qualification 
procedure  that  we  developed  for  the  design  and 
manufacture  of  MMIC's  for  satellite  communications. 
Then,  we  will  present  examples  of  MMIC's  that  we 
developed  for  microwave  and  millimeter-wave 
applications. 


2.  Space  Qualiflcation  Procedure 

Hughes  Space  and  Communications  is  qualifying  for 
space  a  0.25  pm  MESFET  MMIC  process  for 
communication  payload  applications.  Over  23  individual 
MMIC's  have  been  designed,  fabricated,  and  tested  for 
engineering  model  (EM)  development.  These  MMIC's 
are  now  in  flight  production.  The  qualification  plan  for 
these  MMIC's  is  based  on  a  tailored  version  of  MIL-I- 
38535B  which  has  been  adopted  in  part  due  to  specific 
program  requirements  and  the  timing  of  the  Hughes 
GaAs  Operation  (GAO)  foundry  QML  certification.  The 
qualification  process  is  broken  into  generic  process 
qualification  and  specific  MMIC  product  qualification. 
Additionally,  there  are  specific  lot  acceptance  test 
requirements.  This  procedure  can  be  applied  to  any 
process  or  foundry. 

Generic  process  qualification  includes  step  stress  and 
biased  life-test  on  technology  characterization  vehicles 
(TCV)  and  standard  evaluation  circuits  (SEC).  The 
TCVs  include  standard  FET's,  Ohmic  contact  TLM's, 
electromigration  test  patterns,  etc.  For  the  process  SEC, 
an  X-band  two  stage  LO  driver  circuit  was  chosen.  The 
MMIC's  for  the  communication  payload  can  be  divided 
functionally  into  five  classes:  low  noise,  driver,  mixer, 
multiplier,  and  macrocell.  DC  and/or  RF  burnin  and  life 
tests  are  performed  as  part  of  a  one  time  functional 
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qualification.  Representative  MMIC's  from  each 
Actional  class  are  chosen  to  qualify  the  entire 
functional  class.  Additional  design  validation  tests  are 
performed  on  all  MMIC’s  to  verify  tlie  reliability  of  all 
designs. 

In  MMIC  product  qualification,  each  MMIC  product  will 
be  individually  analyzed  to  determine  the  extent  of  the 
qualification  testing.  Product  qualification  addresses  the 
required  tests  to  demonstrate  that  the  MMIC  meets  the 
performance  requirements  over  the  expected  bias 
variations,  temperature  range,  and  life  of  the  satellite. 
The  accelerated  product  life-tests  described  here  will 
demonstrate  the  acceptability  of  specific  MMIC  products 
for  use  in  space  applications.  Successful  completion  of 
performance  and  reliability  tests  will  indicate  that  the 
specific  MMIC  product  is  qualified  for  space 
applications.  Within  each  fiinctional  class  at  least  one 
MMIC  product  will  receive  more  extensive  testing.  This 
will  typically  involve  a  two  or  three  temperature 
accelerated  life-test.  The  remainder  of  the  MMICs  will 
have  more  limited  life-testing  and  will  be  qualified  in 
part  by  similarity  to  the  other  MMIC  products  in  its  class. 

The  MMIC  performance  requirements  are  clearly 
documented.  The  MMIC  FET  DC  performance 
requirements  are  defined  by  a  MMIC  general 
specification  whereas  the  MMIC  RF  performance 
requirements  are  defined  by  the  MMIC  specification 
control  drawing  (SCD)  for  each  product.  Each  MMIC 
product  will  have  a  full  functional  electrical  test 
consisting  of  DC  and  RF  performance  measured  over  bias 
and  temperature  ranges.  This  test  will  ensure  that  the 
specific  MMIC  product  design  will  meet  performance 
over  all  bias  and  temperature  extremes  it  is  likely  to  see 
in  operation.  The  nominal  operating  bias  is  defined  in 
the  individual  MMIC  product  SCD.  This  test  will  vary 
the  drain-source  bias  voItageCVjg)  and  gate-source  bias 
(Vgs)  within  five  percent.  DC  and  RF  performance  for 
ea^  product  must  be  met  for  these  test  conditions.  The 
MMIC  products  will  have  DC  and  RF  performance 
measured  at  three  temperatures  covering  the  nominal 
operating  temperature  and  the  upper  and  lower  extremes. 

Supplemental  accelerated  life-tests  are  performed  on 
individual  MMIC  products  because  the  SEC  life-test 
described  above  in  the  MMIC  generic  process 
qualification  does  not  cover  all  functional  operating 
modes  of  the  MMIC's  to  be  qualified  by  this  plan.  The 
MMIC's  to  be  qualified  fall  into  a  number  of  functional 
classes  differentiated  by  the  operating  mode  of  the  FET's. 
These  functional  classes  include  small  signal  or  low 
noise  amplifiers,  large  signal  power  devices,  dual  gate 
variable  gain  amplifier,  FET  mixers,  complex 


macrocells,  voltage  controlled  oscillators  (VCO),  FET 
multipliers,  and  phase  and  amplitude  modulators. 

The  DC  or  RF  accelerated  life-test  and  electrical  stress 
for  the  MMIC  product  is  selected  based  upon  the 
functionality  of  each  product.  All  tests  include  DC  and 
RF  characterization.  The  two  temperature  life-test  will 
continue  until  at  least  50%  of  the  devices  fail  at  least  one 
parameter.  If  time  is  available  the  devices  will  be  taken 
well  beyond  50%  failure.  The  log  normal  distribution  of 
percent  failures  versus  failure  time  will  be  plotted  for 
each  parameter  and  temperature.  The  median  failure 
point  for  each  parameter  will  then  be  plotted  versus 
temperature  as  an  Arrhenius  plot  to  determine  MTTF  at 
mean  mission  baseplate  temperature  of  65®C  and  the 
activation  energy  for  the  degradation  mechanism. 

Lot  Acceptance  Test  (LAT)  is  the  steady-state  life-test 
which  is  performed  on  a  minimum  of  5  each  of  the 
MMIC's  utilizing  tlie  MMIC  product /die  from  each 
product  wafer  lot.  Test  conditions  \yill  be  at  a  case 
temperature  of  125°C  for  1000  hours  with  bias  as 
specified  in  the  detailed  drawing,  Lifetest  pass/fail 
criteria  are  clearly  defined  in  the  detailed  drawing. 


3.  Ku-band  MMIC  Examples 

Using  the  0.25  pm  MESFET  process  described  above,  we 
developed  a  family  of  MMIC  microcells  and  macrocells 
for  Ku-band  communication  satellite  applications.  Figure 
la  shows  a  voltage-controlled  attenuator  (VC A)  which 
consists  of  two  identical  shunt  FET's  sandwiched 
between  two  Lange  couplers.  At  zero  bias,  the  drain- 
source  resistance  is  low  (<5  Ohms)  to  reflect  most  of  the 
power  and  set  the  maximum  insertion  loss.  As  the 
negative  bias  increases  in  magnitude,  the  insertion  loss 
decreases  because  the  FET  resistance  increases  and 
allows  more  power  through  the  50Q  line.  The  balanced 
configuration  using  Lange  couplers  directs  the  reflected 
power  into  a  50Q  load  to  achieve  low  return  loss  (>20 
dB).  Since  the  insertion  loss  is  mainly  due  to  the  reflected 
power,  only  a  small  amount  of  power  is  dissipated  in  the 
FET.  This  results  in  a  low  voltage  swing  across  the  FET 
junction  and  a  high  IP3  of  the  attenuator.  For  the 
bandwidth  of  10.7  to  12,7  GHz,  the  VCA  has  about  1  dB 
insertion  loss,  10  dB  dynamic  range,  17  dB  return  loss, 
and  greater  than  15  dBm  of  IPS  over  the  dynamic  range. 

Combining  the  VCA  circuit  with  single-stage  gain 
block(s)  constitutes  the  variable  gain  attenuators  (VGA) 
as  shown  in  Figures  Ib-ld.  The  VGA  circuits  have  10-dB 
of  dynamic  range  per  VCA  and  6-dB  of  gain  per  gain 
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block.  Figures  le  and  If  are  examples  of  Ku-band 
macrocells.  Figure  le  is  a  17-GHz  downconverter 
consisting  of  an  RF  amplifier,  LO  amplifier,  and  a 
passive  FET  mixer.  The  IF  signal  is  at  1  GHz.  Figure  If 
is  a  singly-balanced  up-converter  taking  a  2  Hz  signal 
through  a  buffer  amplifier  and  an  active  splitter  to  a  pair 
of  passive  FET  mixers.  The  LO  drive  is  amplified  by  a 
two-stage  amplifier. 


4.  V-band  MMIC  Examples 

Future  satellite  payloads  have  a  need  for  size  and  weight 
reductions  in  the  EHF  payload  without  sacrificing  the 
system  perfonnance,  InP  HEMT's  can  play  a  major  role 
in  meeting  those  needs.  The  InP  HEMT  stnicture  has 
produced  the  lowest  device  noise  figure  performance  at 
60  GHz  (about  1  dB)  with  9  dB  of  associated  gain  and 
the  highest  cutoff  frequency  (>  300  GHz).  This  section 
summarizes  the  circuit  design  and  measured  results  of 
four  MMIC  chips  shown  in  Figure  2. 

The  V-band  LNA  design  uses  a  100  pm  device,  4  x  25 
pm,  biased  for  low  noise  (Vds=+1.0V  at  14  niA)  in  the 
first  stage  and  a  50pm  device  biased  for  high  gain 
(Vds=+1.5V  at  10  mA)  in  the  second  stage.  The  100pm 
device  was  chosen  to  minimize  the  matching  network 
required  for  the  best  noise  match  and  return  loss  at  V- 
band.  The  device  were  stabilized  with  a  double  tuned 
quarter-wavelength  structure  on  the  gate  bias  for 
optimum  stability.  Chip  size  is  2  x  2.5  mm^  and 
consumes  29  mW  of  power  (Figure  2a).  The  measured 
results  show  a  noise  figure  between  2.2  and  2.4  dB  with 
an  associated  gain  of  14.7  to  16.7  dB  from  58  to  62  GHz. 

The  mixer  design  uses  a  single-ended,  passive  HEMT  as 
the  nonlinear  element  to  yield  a  small  mixer  with  a  low 
LO  drive.  The  drain  is  biased  at  zero  volts  and  the 
mixing  function  is  achieved  by  creating  a  linear,  time 
varying  channel  resistance  produced  by  the  LO  signal 
driving  the  gate.  This  linear  resistance  minimizes 
intermodulation  products.  The  design  uses  a  single  100 
pm  device.  An  LO  signal  of  +4  dBm  is  sufficient  to  drive 
the  gate  which  is  biased  near  pinch-off.  The  drain  is 
designed  with  a  bandpass  diplexer  network  which 
receives  the  RF  input  signal  (58  to  64  GHz)  through  one 
port,  mixes  with  the  LO  frequency  (48  to  58  GHz),  and 
outputs  the  IF  frequency  (6  to  10  GHz)  through  the 
second  port.  The  chip  size  is  2  x  3  mm^  (Figure  2b)  and 
consumes  no  power.  The  measured  result  shows  a 
conversion  loss  from  8  to  10  dB  for  the  IF  frequency  from 
4.5  to  10  GHz  with  Vds  =  OV  and  Vgs  =  -0.4V.  The  2IF 
spur  is  at  -32  dBc. 


The  three-stage  LO  amplifier  design  uses  a  50  pm  device 
in  the  first  stage  and  a  150  pm  device  in  the  second  and 
third  stages.  All  stages  are  biased  for  high  gain  with 
Vds=+l.5V  at  10  mA  for  the  50  pm  device  and  30  mA 
for  each  of  the  150  pm  devices.  The  chip  size  is  2  x  3 
mm^  and  consumes  105  mW  of  power  (Figure  2c). 
Measurement  at  55  GHz  shows  21.7  dB  of  linear  gain 
and  +9.65  dBm  of  output  power  at  1  dB  gain 
compression. 

We  also  developed  a  V-band  downconverter  macrocell  by 
combining  the  LNA  and  mixer  building  blocks  with  a  50 
Ohm  line  (Figure  2d).  The  measured  result  shows  7  dB 
of  gain  and  3  dB  noise  figure  for  RF  signals  between  61 
to  62  GHz. 


5.  Conclusion 

In  this  paper,  we  have  summarized  the  general 
requirements  of  the  MMIC's  for  satellite 
communications.  We  have  also  described  the  space 
qualification  procedure  being  implemented  on  a  0.25  urn 
GaAs  MESFET  process  as  well  as  a  0. 15  um  InP  HEMT 
process.  A  few  examples  of  the  MMIC’s  that  were 
developed  using  these  processes  are  also  given. 
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ABSTRACT 

It  has  been  demonstrated  recently  that  the 
technique  of  direct  electro-optic  probing  can 
successfully  be  applied  to  study  two- 
dimensional  electric  field  distributions  in 
MMICs.  As  a  result,  each  MMIC 
component  can  be  characterized  by  on-wafer 
and  in-circuit  measurements  up  to  millimeter 
wave  fi-equencies.  In  this  paper, 
experimental  results  performed  on  a 
traveling-wave  amplifier  are  presented  as  an 
example.  Furtheron,  different  MMIC 
components  are  tested  in  detail  and  a 
quantitative  comparison  with  theoretical 
results  is  carried  out. 


1.  INTRODUCTION 

Rapid  progress  has  recently  been  made  in  the 
simulation,  design  and  processing  of 
monolithic  microwave  integrated  circuits 
(MMICs)  on  ni-V  semiconductors. 
However,  the  high-frequency  performance 
test  is  usually  made  by  conventional  network 
analysis  at  the  input  and  output  ports  which 
only  allows  an  integrated  overview  of  the 


electrical  behavior  of  the  whole  structure. 
On  the  other  hand,  the  direct  electro-optic 
measurement  technique  [1]  has  been  shown 
to  provide  a  circuit-internal  and  contactless 
test  with  high  spatial  and  frequency  or 
temporal  resolution  by  two-dimensional  field 
mapping  of  the  device  under  test  (DUT)  [2]. 
In  this  paper,  an  overview  is  given  on  recent 
results  performed  on  a  traveling-wave 
amplifier  (TWA)  and  on  different  MMIC 
components.  In  particular,  the  two- 
dimensional  field  distributions  on  the  drain 
transmission  line  of  a  three  stage  amplifier 
and  inside  a  high-speed  MESFET  and 
interdigitated  capacitor  are  presented  and 
discussed.  Additionally,  the  properties  of 
different  coplanar  waveguide  (CPW)  modes 
are  studied  in  detail.  Moreover,  the  paper 
summarizes  preliminary  steps  towards 
quantitative  comparisons  with  theorectical 
results. 


2.  EXPERIMENTAL  SETUP 

The  fundamental  experimental  setup  (Fig.l) 
consists  of  a  polarized  picosecond  Nd:YAG 
laser  system.  The  optical  pulses  are  focussed 
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onto  the  DUT  from  the  backside,  the 
reflected  signal  being  analysed  by  a  polarizer 
and  a  photodiode.  The  output  electrical 
signal  is  a  low-frequency  replica  of  the 


ti-anslation  stage  +  DUT 


phase 
stabilisation 


miccoscope- 
objective 

photodiode 

I  11  ^  """W. 


lensc 


beam¬ 

splitter 


(PLL) 


)74-plate  C 
polarizer  c 


lasers  pulse 


Fig.l  Experiment  setup  [2,3] 

microwave  signal  at  the  point  of  observation, 
generated  by  harmonic  mixing  in  the 
semiconductor  as  a  result  of  the  electro¬ 
optic  Pockels  effect  [2,3], 

3.  EXPERIMENTAL  RESULTS 

In  Fig.2  the  1-12  GHz  traveling-wave 


Fig.l  Microwave  coplanar  TWA  [2] 


amplifier  under  test  is  shown.  It  consists 
transmission  lines,  terminating  resistors,  of 
three  MESFETs,  gate  and  drain  air  bridges, 
capacitors  and  dc  bias  ports.  The  field 
distribution  inside  the  circuit  has  been 
studied  in  detail  [2].  Two  examples  are 
shown  in  Figs.3  and  4.  From  the  field 
distribution  in  Fig.3 


Fig.  3  Two-dimensional  field  map  of  a 
MESFET  of  the  TWA  in  Fig.l,  frequency  1 
GHz 


Fig.  4  Two-dimensional  field  map  of  the 
drain  transmission  line  of  the  TWA  in  Fig.l, 
frequency  9  GHz 

the  circuit-internal  amplification  can  directly 
be  determined.  The  measurement  gives  also 
an  insight  into  the  potential  and  phase 
distribution  on  the  metallization  structure  at 
different  frequencies.  The  traveling-wave 
characterization  of  the  TWA  can  be 
obtained  from  the  results  in  Fig.4,  where 
field  maxima  and  minima  reveal  the  influence 
of  standing  wave  effects  inside  the  structure. 
Fig.  5  is  an  example  of  the  high  spatial 
resolution  which  has  been  obtained  using 
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conventional  optical  components.  Clearly, 
the  field  distributions  in  a  2|im  structure  can 


Fig.  5  Two-dimensional  field  map  of  an 
interdigitated  capacitor  (2pm  structure) 

easily  be  detected. 

In  Fig.  6  the  measured  field  distribution  of 
different  modes  on  a  CPW  are  plotted  [4]. 
The  standing  waves  clearly  show  the 
characteristics  of  the  even  and  odd  mode  and 
the  so  called  microstrip  mode  [5]  which  give 
rise  to  novel  coupling  effects,  especially  at 
inhomogeneities  such  as  bends,  air  bridges, 


4.  QUANTUAVIE  COMPARSION 
WITH  THEORY 

Preliminary  quantitative  comparisons  of 
experimental  results  with  those  of  numerical 
simulations  have  been  carried  out. 
Particularly,  microwave  propagation 
properties  on  CPWs  have  been  studied  in 
detail  including  the  phase  and  attenuation 
coefficient  and  the  characteristic  impedance 
[6],  In  Ref  [7]  the  signal  traces  in  a  coplanar 
18-21  GHz  amplifier  have  been  measured 
and  compared  with  the  predictions  of  a 
microwave  CAD.  From  the  experimental 
results,  the  dispersion  properties  can  easily 
be  determined  [5].  Additionally,  the  two- 


dimensional  electro-optic  probing  has 
recently  been  combined  with  field  threoiy 
based  ultimode  wave  amplitude  extraction 
technique  to  study  mode  conversion 
phenoena  in  MMICcomponents  [3]. 


Fig.  6  Two-dimensional  field  map  of  even 
(a),  odd  (b),  and  microstrip  (c)  mode  on  a 


CPW 
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5.  SUMMARY  AND  CONCLUSIONS 

It  is  shown  by  several  examples  that  two- 
dimensional  electro-optical  field  mapping 
can  successfully  be  used  to  study  and 
characterize  MMICs.  The  method  provides 
circuit-internal,  contactless  measurements  of 
electrical  signals  at  arbitrary  nodes.  In 
addition,  a  good  agreement  between 
experimental  and  theoretical  result  has  been 
obtained  and  it  is  shown  that  the  electro¬ 
optic  field  mapping  is  a  unique  method,  not 
only  for  the  measurement  of  mode 
conversion  or  wave  propagation  effects  but 
also  for  a  detailed  and  spatially  resolved  test 
of  MMICs  and  relevant  devices.  This  is 
especially  important  for  frequencies  close  to 
and  above  100  GHz. 

6.  ACKNOWLEDGEMENTS 

This  work  has  been  financially  supported  by 
the  Deutsche  Forschungsgemeinschaft.  The 
authors  would  like  to  thank  the  Fachgebiet 
Halbleitertechnik  /  Halbleitertechnologie,  the 
Fachgebiet  Allgemeine  und  Theoretische 
Elektrotechnik,  both  Universitat  Duisburg, 
and  the  IMST,  Kamp  Lintfort,  for  providing 
the  traveling-wave  amplifiers. 

REFERENCES 

[1]  B.H.  Kolner  and  D.M.  Bloom, 
"Electrooptic  sampling  in  GaAs  integrated 
circuits",  IEEE  J.  Quantum  Electron.,  vol. 
QE-22,  (1986),  79-93. 

[2]  G.  David,  S.  Redlich,  W.  Mertin,  R.M. 
Bertenburg,  S.  KoBlowski,  F.J.  Tegude,  and 
D.  Jager,  "Two-dimensional  direct  electro¬ 
optic  field  mapping  in  a  monotithic 
integrated  GaAs  amplifier",  Proc.  23rd 
EuMC,  September  6-9,  1993,  Madrid, 
Spain,  497-499. 

[3]  G.David,  W.  Schroder,  D.  Jager,  and  I. 
Wolf,  "2D  electro-optic  probing  in 
combination  with  field  theory  based 


multimode  wave  amplitude  extraction:  A 
new  approach  to  on-wafer  measurement", 
1995  IEEE  MTT-S  Int.  Symposium,  May 
15-19,  1995,  Orlando,  USA. 

[4]  G.  David,  P.  Schmitz,  S.  Redlich,  J. 
Buschke,  R.  Kremer,  and  D.  Jager,  "Mode 
analysis  of  wave  propagation  on  coplanar 
transmission  lines  using  electro-optic  field 
mapping",  Proc.  Asia-Pacific  Microwave 
Conference,  Tokyo,  December  6-9,  1994, 
149-152. 

[5]  H.  Shigesawa,  M.  Tsuji,  and  A.A.  Oliner, 
"A  new  mode  coupling  effect  on  coplanar 
waveguide  of  finite  width",  1990  IEEE 
MTT-S  Digest,  1990,  1063-1066. 

[6]  G.  David,  S.  Redlich,  W.  von  Wendorff, 
and  D.  Jager,  "Electro-optic  probing  of 
coplanar  transmission  lines  and 
optoelectronic  microwave  devices  up  to  40 
GHz",  Proc.  of  the  MIOF93,  May  25-27, 
Sindelfingen,  1993,  492-496. 

[7]  G.  David,  A.  Ising,  Y.  Kalayci,  R. 
Tempel,  I.  Wolf,  and  D.  Jager,  "Circuit- 
internal  characterization  of  MMICs  using 
two-dimensional  electro-optic  field  mapping 
in  combination  with  microwave  CAD", 
Proc.  24th  EuMC,  September  5-8,  1994, 
Cannes,  France,  1378-1383. 


44 


High  Power  Amplifiers  for  Mobile  Commumcatioo  Systems 

Yasushi  Itoh 

Mitsubishi  Electric  Corporation 
lectro-Optics  &  Microwave  Systems  Laboratory 

Tel:.8M67-4N2l«;Z 

2519,  E-mail;sei@eml.melco.co.jp 


Abstract 

results  of  GaAs 

^SFET  high  power  amplifier  modules  with  low  drain 

nc  S'and  fT  ^sterns. 

MMTP  tS.  f  fabncation  techniques  using  HMIC  and 

presented,  addressing  a  design 
method  of  high  power  amplifier  modules  with  tL 
ngent  requirements  for  cost,  size,  and  performance 

^ules.  UHF-band  high  efficiency,  high  power 

antf  employing  a  combination  of  HMIC 

and  MMIC  technologies,  a  UHF-band  high  efficiency 

^Ta^ToH®  h  technologiS; 

d  a  1.9GHz  high  efficiency,  linear  MMIC  amplifiCT. 

1.  Introduction 

Si,?'  “  ■"<*“«  and 

^llne  coiMiuncation  systems,  a  number  of  high 
po  er  amplifier  modules  have  been  developed  [1-31 
l2  c^f  requirements  for  high  power  amplifiers  an 
s^n  •  voltage,  low  power  consumption 

cell  ’  linearity  especially  for  digital 

Phone  System)  m  Japan.  In  this  paper,  our  current 
resu  ts  of  GaAs  I^SFET  high  power  amplifier  modules 
wth  low  dram  bias  voltage  for  use  in  analog/digital 
handheld  telephones  and  the  PHS  in  Japan  are  presented 
fo^sing  on  the  design,  fabrication  and  peifomance  of 

effi  amplifiers  demonstrate  high 

efficiency  high  power,  or  high  linearity  under  low  drain 
bias  operation  of  3 . 3  V. 

2.  High  Power  Amplifiers  Employing  a 
Combination  of  HMIC  and  MMIC 

A  photograph  of  the  UHF-band  three-stage  high 

r,L  f  peripheries  of  the  1st,  2nd,  and 

final  stage  FETs  were  determined  to  0.2,  1.6,  and 


In 

are 


18.2mm,  respectively,  to  achieve  a  saturated  outnnf 
Pr“  "f  ream,  to  3,gBm  aud  a 
efficiency  of  greater  to  50%  at  UHFtod  ^Tlte 
mpiffier  ts  opetated  with  low  drain  bias  voltage  of  3?v 
for  rrinctng  battety  size  of  handheld  teleph^fL  .v 
b;^  unde,  class  AB  operation  for  acCt“g^„g 

oJ^lmtotaped^ees““tef“f 

ey  making  the  fimdamental  and  second  harmonic  load 

S  "T”  ‘’1  »  <l“  resnr„fTte 

load-pull  measurements,  the  output  matching  circuit  was 
designed  by  employing  the  second  harmonic  reflection 
circuit  to  show  the  optimum  impedancesir  ^th  foe 
fundamental  and  second  harmonic  waves. 

The  amplifier  is  comprised  of  three  chips:  active  circuit 
chjp,  passive  circuit  chip,  and  output  Ltching  circuit 
chip.  The  active  and  passive  circuits  were  designed  bv 

suDstotes  for  achieving  a  miniaturized  size  The 
onto  matching  circuit  was  designed  h  nsl^^ 
tb^bnted  elements  and  fabricated  on  a  high  Lectri? 

These  three  ehips  are  mounted  on  a  ceramic  leadless 


Fig.  1  UHF-band  three-stage  high  efficiency,  high 
power  amplifier  module 
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nackage  with  the  dimensions  of  9.5  x  lOnun  .  The 
measvLl  output  powet  and 

plottod.nFig.2.  Asatunttedontput^ronlA^ 
Md  a  power-added  efficiency  of  57.3 /«  have  be 
achieved  at  UHF. 


Fig.2  Measured  output  power  and  efficiency 

Another  example  of  a  UHF-band  four-stage,  high 
efficiency,  high  power  amplifier  module  ^ 

combination  of  HNflC  and  MMIC  is  shown  in  Fig.3  [  ]. 
It  consists  of  three-stage  driver  MNHC  "" 

single-stage  power  HMIC 

peripheries  from  the  1st  and  final  stag  . 

optimized  to  maximize  the  efficiency  for  an  averaged 
power  of  31dBm  and  a  samrated  power  g^^ater^ 
34dBm  for  zr /4-shift  QPSK  modulated  signals  at  Ulffi 
Thfl“arimpedance  of  the  final  stage  FET  prov^mg  the 
eLiency  was  obtained  by  the  ffindamental 
and  second  harmonic  load-pull  measurements. 


The  source  impedance  providing  the  minimum  insertion 
phLe  variatiL  was  obtained  by  the  source-pu^^ 
ras— .  Based  on  the  — s  of  the  ^urc^H 
and  load-pull  measurements,  the  input  and  outp 
matching  circuits  of  the  final  stage  amplifier  were 
designed  to  achieve  high  efficiency  and  low  insertion 

phase  variation. 

The  amplifier  consists  of  four  chips;  a  three-stage  NMC 
amplifier  chip,  a  final-stage  FET  chip,  input  and  ou^ut 
matohing  cirL  chips.  The  input  and  output  matckng 
circuits  were  fabricated  on  high  dielectnc  substrates  ( 

=9  8  and  38)  to  reduce  loss  and  size  of  the  ampli^^ 
These  chips  were  mounted  on  a  ceramic  package.  1  he 
^l^i^ed'^output  power,  power-added  efficiency,  ^d 
insertion  phase  variation  are  plotted  in  Fig.4.  The 
amplifier  demonstrates  a  saturated  output  power  of 
34dBm,  a  drain  efficiency  of  38. 1%  and  an 
phase  variation  of  7degs  at  an  averaged  power  of  31® 
at  950MHz.  The  measured  output  spectrurn  for 
shift  QPSK  modulated  signals  at  an  averaged  power  of 
3^m  is  plotted  in  Fig.5.  An  adja^nt  channel 
leakage  power  (50kHz  offset)  of  -48.5®c  has  be 
obtained.  The  drain  voltage  is  5.8V. 


Fig.3  UHF-band  four-stage,  high  efficiency,  high 
power  amplifier  module 
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3.  High  Power  Amplifier  Employing  HMIC 

B 

A  photograph  of  the  UHF-band  two-stage,  high  ^ 
efficiency,  linear  amplifier  module  is  shown  in  Fig,6  [7],  fe 
The  gate  peripheries  of  driver  and  power  FETs  were  % 
determined  to  8.4  and  31.5mm,  respectively,  to  achieve  g 
the  maximum  efficiency  at  an  averaged  output  power  of  & 
31dBm  for  7r/4-shifi  QPSK  modulated  signals.  The  ^ 
optimum  load  impedance  providing  the  maximum  S) 
efficiency  at  an  averaged  output  power  of  31dBm  was  fe 
obtained  first  for  the  power  FET  by  the  load-pull  < 
measurements  using  tt /4-shift  QPSK  modulated  signals. 
Next,  the  optimum  source  impedance  providing  the 
minimum  ACP  (Adjacent  Channel  leakage  Power)  was 
obtained  by  the  source-pull  measurements  using  7r/4- 
shift  QPSK  modulated  signals.  The  driver  FET  is 
biased  under  class  A  operation  for  high  linearity.  The 
power  FET  is  biased  under  class  AB  operation  for  high 
efficiency.  The  drain  voltage  is  3.3  V. 


Fig.6  Photograph  of  the  UHF-band  two-stage,  high 
efficiency,  linear  amplifier  module 

The  circuit  elements  of  the  amplifier  were  designed  by 
using  lumped  and  distributed  elements  and  fabricated  on 
PPO  (Poly  Phelylen  Oxide)  substrates  with  the  circuit 
dimensions  of  11  x  16mm^  for  achieving  low  cost. 
The  driver  and  power  stage  FETs  employ  a  mold  type 
package  for  low  cost.  The  averaged  output  power, 
power-added  efficiency,  and  ACP  (50kHz  offset)  were 
measured  for  TT /4-shift  QPSK  modulated  signals  at  UHF 
and  plotted  in  Fig.  7.  A  power-added  efficiency  of  44% 
and  an  ACP  of  -49.8dBc  have  been  achieved  for  an 
averaged  output  power  of  3  IdBm. 


-5  0  5  10  15 


Input  Power  [dBm] 

Fig.7  Measured  output  power,  efficiency,  and  ACP 

4.  High  Power  Amplifier  Employing  MMIC 

A  photograph  of  the  1.9GHz  three-stage  high  efficiency, 
linear  MMIC  amplifier  is  shown  in  Fig.  8  [8].  The  gate 
peripheries  of  the  1st,  2nd,  and  final  stage  FETs  were 
determined  to  0.3,  0.5,  and  4nun,  respectively,  to  achieve 
the  maximum  efficiency  at  an  averaged  output  power  of 
22dBm  for  7r/4-shift  QPSK  modulated  signals.  The 
optimum  load  impedance  providing  the  maximum 
efficiency  at  a  3dB  backed  off  output  power  was  obtained 
first  for  the  final  stage  FET  by  load-pull  measurements. 
Next,  the  optimum  source  impedance  providing  the 
minimum  insertion  phase  variation  with  input  power 
was  obtained  by  source-pull  measurements.  The  1st 
and  2nd  stage  amplifier  are  biased  under  class  A 
operation  to  prevent  amplitude  and  phase  distortion. 
The  final  stage  amplifier  is  biased  under  class  AB 
operation  to  improve  efficiency.  The  drain  voltage  is 
3.3V. 


Fig.  8  Photograph  of  the  1.9GHz  three-stage  high 
efficiency,  linear  MMIC  amplifier 
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The  circuits  elements  were  fabricated  by  using  lumped 
and  distributed  elements  on  GaAs  substrates  with  the 
chip  dimensions  of  2.7  x  2.7mm^,  All  matching  and 
bias  circuit  elements  are  included  in  the  MMIC  chip. 
The  averaged  output  power,  power-added  efficiency,  and 
ACP  (600kHz  offset)  were  measured  at  1.9GHz  for  7r/4- 
shift  QPSK  modulated  signals  (398kbps)  and  plotted  in 
Fig.  9.  A  power-added  efficiency  of  35%  and  an  ACP  of 
-55dBc  have  been  achieved  at  an  averaged  power  of 
22dBm. 


CI4 


Fig.9  Measured  output  power,  efficiency,  and  ACP 

5.  Conclusion 


Three  types  of  high  power  amplifier  modules  employing 
HMIC,  MMIC,  and  a  combination  of  HMIC  and  MMIC 
have  been  presented.  It  has  been  shown  that  these 
amplifier  modules  utilizing  distinguished  features  of 
HMIC  and  MMIC  technologies  would  be  a  candidate  for 
high  power  amplifier  modules  with  low  cost,  small  size, 
low  voltage  operation,  and  high  performance  for  use  in 
mobile  communication  systems. 
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Abstract  -  This  paper  gives  an  overview  on  alternative 
approaches  for  the  implementation  of  recent  hybrid 
coding  schemes.  An  efficiency  measure  is  introduced 
and  applied  for  the  comparison  of  video  signal  proces¬ 
sors,  proposed  in  the  literature.  Concerning  recent 
hybrid  coding  schemes,  the  comparison  shows  that 
adapted  architectures  achieve  an  efficiency  gain  by  a 
factor  of  6  to  7  compared  to  flexible  architectures, 
applicable  for  a  wider  range  of  application  fields.  It  is 
expounded  that  this  efficiency  might  decrease  for  future 
video  compression  schemes,  due  to  data  dependent 
computational  requirements  of  the  applied  algorithms. 


!•  INTRODUCTION 

Video  processing  has  been  a  rapidly  evolving  field 
for  telecommunications,  computer  and  media  indus¬ 
tries.  In  particular,  for  real  time  video  compression  ap¬ 
plications  a  growing  economical  significance  is  ex¬ 
pected  for  the  next  years.  Besides  digital  TV 
broadcasting  and  videophone,  services  like  multime¬ 
dia  education,  teleshopping,  or  video  mail  will  be¬ 
come  audiovisual  mass  applications. 

To  facilitate  world  wide  interchange  of  digitally 
encoded  audiovisual  data  there  is  a  demand  for  in¬ 
ternational  standards,  defining  coding  methods  and 
transmission  formats.  International  standardization 
committees  have  been  working  on  the  specification  of 
several  compression  algorithms.  The  Joint 
Photographic  Experts  Group  (JPEG)  of  the  Interna¬ 
tional  Standards  Organization  (ISO)  has  specified  an 
algorithm  for  compression  of  still  images  [4].  The  ITU 
proposed  the  H.261  standard  for  video  telephony  and 
video  conference  [1].  The  Motion  Pictures  Experts 
Group  (MPEG)  of  ISO  has  completed  its  first  standard 
MPEG-1,  which  will  be  used  for  interactive  video  and 
provides  a  picture  quality  comparable  to  VCR  quality 
[2].  MPEG  made  substantial  progress  for  the  second 


phase  of  standards  MPEG-2,  which  will  provide  au¬ 
diovisual  quality  of  both  broadcast  TV  and  HDTV  [3]. 
Besides  the  availability  of  international  standards,  the 
successful  introduction  of  the  named  services  depends 
on  the  availability  of  VLSI  components,  supporting  a 
cost  efficient  implementation  of  video  compression 
applications. 

In  this  paper  we  give  a  short  overview  of  recent 
coding  schemes  and  discuss  basic  implementation  al¬ 
ternatives  in  section  3.  Section  4  provides  a  figure  of 
merit  for  architectural  efficiency  of  the  alternative  im¬ 
plementation  strategies.  Section  5  discusses  future 
trends  of  video  compression  applications  and  their 
VLSI  implementation. 

2.  RECENT  CODING  SCHEMES 

Recent  video  coding  standards  are  based  on  a  hy¬ 
brid  coding  scheme  that  combines  transform  coding 
and  predictive  coding.  An  overview  of  these  hybrid 
encoding  schemes  is  depicted  in  figure  1 . 


Fig.  1:  Hybrid  encoding  scheme 


The  coding  scheme  consists  of  the  tasks  motion 
estimation  (ME)  typically  based  on  blockmatching  al- 
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gorithms,  computation  of  the  prediction  error  (PRE), 
discrete  cosine  transform  (DCT),  quantization  (Q), 
variable  length  coding  (VLC),  inverse  quantization 
(IQ),  inverse  discrete  cosine  transform  (IDCT)  and  re¬ 
construction  (REC).  The  reconstructed  image  data  are 
stored  in  an  image  memory  for  further  predictions. 
The  decoder  performs  the  tasks  variable  length  decod¬ 
ing,  inverse  quantization,  and  motion  compensated  re¬ 
construction. 

Generally,  video  processing  algorithms  can  be 
classified  in  terms  of  regularity  of  computation  and 
data  access.  This  classification  leads  to  three  classes 
of  algorithms: 

Low-Level  Algorithms 

These  algorithms  are  based  on  a  predefined  se¬ 
quence  of  operations  and  a  predefined  amount  of 
data  at  the  input  and  output.  Typical  examples  are 
block  matching  or  transforms  like  the  DCT. 

Medium-Level  Algorithms 

The  sequence  of  operations  of  medium-level  algo¬ 
rithms  depends  on  the  data.  Typically,  the  amount 
of  input  data  is  predefined,  whereas  the  amount  of 
output  data  varies  according  to  the  input  data  val¬ 
ues.  With  respect  to  hybrid  coding  schemes,  exam¬ 
ples  for  these  algorithms  are  quantization,  inverse 
quantization  or  variable  length  coding. 

High-Level  Algorithms 

High-level  algorithms  are  associated  with  a  vari¬ 
able  amount  of  input  and  output  data  and  a  data 
dependent  sequence  of  operations.  Control  tasks  of 
the  hybrid  coding  scheme  are  generally  members 
of  this  class. 

Since  hybrid  coding  schemes  are  applied  for  vary¬ 
ing  video  data  rates,  the  required  absolute  processing 
power  varies  in  the  range  from  a  few  hundreds  of 
MOPS  (Mega  Operations  Per  Second)  for  video  sig¬ 
nals  in  QCIF  format  to  several  GOPS  (Giga  Opera¬ 
tions  Per  Second)  for  processing  of  TV  or  HDTV  sig¬ 
nals.  Nevertheless,  the  relative  computational  power 
of  each  algorithmic  class  is  nearly  independent  of  the 
processed  video  format.  In  case  of  hybrid  coding  ap¬ 
plications,  approximately  90%  of  the  overall  proces¬ 
sing  power  is  required  for  low-level  algorithms.  The 
share  of  medium-level  tasks  is  about  7%  and  nearly 
3%  is  required  for  high-level  algorithms. 


3.  ARCHITECTURAL  ALTERNATIVES 

In  terms  of  a  VLSI  implementation  of  hybrid  cod¬ 
ing  applications  two  major  requirements  can  be  identi¬ 
fied.  First,  the  high  computational  power  requirements 


have  to  be  met  by  the  hardware.  Second,  low 
manufacturing  cost  of  video  processing  components  is 
essential  for  the  economic  success  of  an  architecture. 
Depending  on  the  envisaged  application  field,  imple¬ 
mentation  size  and  architectural  flexibility  might  have 
to  be  taken  into  account. 

Basically,  implementations  of  video  processing 
applications  can  either  be  based  on  standard  proces¬ 
sors  from  workstations  or  PCs  or  on  specialized  video 
signal  processors.  The  major  advantage  of  standard 
processors  is  their  availability.  Applying  these  archi¬ 
tectures  for  implementation  of  video  processing  hard¬ 
ware,  does  not  require  the  time  consuming  design  of 
new  VLSI  components.  The  disadvantage  of  this  im¬ 
plementation  strategy  is  the  insufficient  processing 
power  of  recent  standard  processors.  Video  processing 
applications  would  still  require  the  implementation  of 
multiprocessor  systems  to  meet  the  computational  re¬ 
quirements. 

To  achieve  compact  implementations  video  pro¬ 
cessing  hardware  has  to  be  based  on  specialized  video 
signal  processors,  adapted  to  the  requirements  of  the 
envisaged  application  field. 

Although  adaptation  of  the  architecture  leads  to 
small  implementation  size  and  decreased  manufactur¬ 
ing  cost,  the  computational  requirements  of  video  pro¬ 
cessing  applications  demand  for  the  exploitation  of 
the  algorithm— inherent  independence  of  basic  arith¬ 
metic  operations  to  be  performed.  Independent  opera¬ 
tions  can  be  processed  concurrently,  which  enables  the 
decrease  of  processing  time  and  thus  an  increased 
through-put  rate.  For  the  architectural  implementation 
of  concurrency  two  basic  strategies  can  be  distin¬ 
guished:  pipelining  and  parallel  processing. 

In  case  of  pipelining  several  tasks,  operations,  or 
parts  of  opierations  are  processed  in  subsequent  steps 
in  different  hardware  modules.  Depending  on  the  se¬ 
lected  granularity  level  for  the  implementation  of  pi¬ 
pelining,  intermediate  data  of  each  step  are  stored  in 
registers,  register  chains,  FIFOs,  or  dual— port  memo¬ 
ries.  Assuming  a  processing  time  of  Tp  for  an  non— pi¬ 
pelined  processor  module  and  Td  im  for  the  delay  of 
intermediate  memories  we  get  in  the  ideal  case  the 
following  estimation  for  the  processing  time  Tppipe 
for  a  pipelined  architecture  with  Npipe  pipeline  stages: 


Pipe 

From  this  follows,  that  the  major  limiting  factor  for 
the  maximum  applicable  degree  of  pipelining  is  the 
access  delay  of  these  intermediate  memories. 

The  alternative  to  pipelining  is  the  implementa¬ 
tion  of  parallel  units,  processing  independent  data 
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concurrently.  Parallel  processing  can  be  applied  on 
operation  level  as  well  as  on  task  level.  Assuming  the 
ideal  case,  this  strategy  leads  to  a  linear  increase  of 
processing  power  and  we  get: 

T  =  n\ 

where  Npar  =  Number  of  parallel  units 

According  to  the  three  ways  for  architectural  opti¬ 
mization,  adaptation,  pipelining,  and  parallel  proces¬ 
sing,  three  architectural  classes  for  the  implementa¬ 
tion  of  video  signal  processors  can  be  distinguished: 

Intensive  Pipelined  Architectures 
These  architectures  are  typically  scalar  architec¬ 
tures  that  achieve  high  clock  frequencies  of  several 
hundreds  of  MHz  due  to  the  exploitation  of  pipelin¬ 
ing. 

Parallel  Data  Paths 

These  architectures  exploit  data  distribution  for  the 
increase  of  computational  power.  Several  parallel 
data  paths  are  implemented  on  one  processor  die. 


which  leads  in  the  ideal  case  to  a  linear  increase  of 
supported  computational  power.  The  number  of 
parallel  data  paths  is  limited  by  the  semiconductor 
process,  since  an  increase  of  silicon  area  leads  to  an 
decrease  of  hardware  yield. 

Coprocessor  Architectures 
Coprocessors  are  known  from  general  processor 
designs  and  are  often  used  for  specific  tasks,  e.g. 
floating  point  operations.  The  idea  of  the  adapta¬ 
tion  to  specific  tasks  and  increase  computational 
power  without  an  increase  of  the  required  semicon¬ 
ductor  area  has  been  applied  by  several  designs. 
Due  to  their  high  regularity  and  the  high  processing 
power  requirements  low-level  tasks  are  the  most 
promising  candidates  for  an  adapted  implementa¬ 
tion.  The  main  disadvantage  of  this  architectural 
approach  is  the  decrease  of  flexibility  by  an  in¬ 
crease  of  adaptation. 

A  selection  of  proposed  VLSI  architectures  for 
hybrid  coding  schemes  and  their  implementation  data 
are  shown  in  table  1 . 


Ref. 

Architectural 

Class 

Technology 

[micron] 

Chip  Size 
[mm^J 

[7] 

Coprocessor 

0.9 

132  (Coder) 

112  (Decod.) 

[8] 

Coprocessor 

0.5 

312 

[9] 

Coprocessor 

0.8 

160 

[10] 

Coprocessor 

1.0 

77  (VP), 

80  ?  (VC) 

[11] 

Coprocessor 

0.8 

144 

[12] 

Coprocessor 

0.6 

80 

[13] 

Intens.  Pipelining 

0.8 

202 

[14] 

Intens.  Pipelining 

0.5 

281 

[15] 

Parallel  Data 
Paths 

1.0 

225 

[16] 

Parallel  Data 
Paths 

0.8 

231 

[17] 

Parallel  Data 
Paths 

0.5 

324 

[18] 

Parallel  Data 
Paths 

0.6 

180 

#  Chips  required 
for  CIF-30HZ 
H.261  codec 


Comments 


w/o  motion  estimation.  This  chip 
has  been 

designed  for  MPEG-2  applications 


frame  rate  15  Hz 


2  VC  +  3  VP  I  VC’s  chip  size  roughly  estimated 


Table  1  Overview  of  selected  architectures  for  video  coding  applications 
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4.  ASSESSMENT  OF  VLSI 
ARCHITECTURES 

For  a  comparison  of  architectural  alternatives  an 
efficiency  measure  has  to  be  defined.  As  a  first  ap¬ 
proach  efficiency  E  can  be  expressed  by  the  ratio  of 
through-put  rate  Rj  and  the  required  silicon  area  Asi  : 


Since  the  required  silicon  area  and  the  through-put 
rate  for  an  implementation  of  a  specific  application 
depend  on  the  applied  semiconductor  technology,  a 
realistic  architectural  assessment  has  to  consider  the 
gains  provided  by  the  progress  in  semiconductor 
technology.  A  sensible  way  to  achieve  a  realikic  as¬ 
sessment,  is  the  normalization  of  the  architecture  pa¬ 
rameters  according  to  a  reference  technology.  In  the 
following  we  assume  a  reference  process  with  a  grid 
length  Xo  =  1.0  micron.  Assuming  the  ’’constant  field” 
model,  the  processing  time  of  a  given  process  with 
grid  length  X  scales  as  X/Xq.  This  model  supposes  a  lin¬ 
ear  scaling  of  voltages  and  doping  concentration.  In 
the  ’’constant  voltage”  model  of  scaling  of  MOS  tran¬ 
sistors  the  gate  delay  scales  as  (X/Xq)^.  These  two 
models  do  not  take  into  account  the  long  distance  in¬ 
terconnection  delay  (which  basically  does  not  scale 
[19]),  the  short  channel  effects  and  the  limitations  due 
to  the  power  dissipation  issue.  Reality  is  not  well  de¬ 
scribed  by  the  traditional  models  of  scaling  (the  volt¬ 
age,  at  least  for  function  specific  architectures  re¬ 
ported  up  to  now,  has  not  been  reduced  linearly  with 
X).  Thus  it  is  reasonable  to  use  an  empiric  measure  of 
processor  speed  vs.  technology.  Bakoglu  produces  a 
comprehensive  list  of  microprocessors,  with  their 
clock  frequency  and  their  gate  length  X  [19].  From 
these  data,  it  can  be  estimated  that  the  cycle  time  of 
these  processors,  which  results  from  a  combination  of 
all  the  factors  mentioned  above,  scales  approximately 
as  (XAo)^-^.  Thus,  for  comparison  of  architecture  al¬ 
ternatives  we  use  in  this  paper  a  normalization  by 
(X/Xo)^  for  the  silicon  area  and  (Xo/X)^*^  for  through¬ 
put. 


where  the  index  0  is  used  for  the  system  with  refer¬ 
ence  gate  length  Xq. 

Equations  (4)  and  (5)  can  be  applied  for  the  com¬ 
parison  of  the  architectural  alternatives  shown  in 


table  1.  The  result  of  this  comparison  is  shown  in  fig¬ 
ure  2,  using  the  coding  scheme  according  to  ITU  rec¬ 
ommendation  H.261  as  a  benchmark. 

From  figure  2,  we  can  identify  two  groups  of  ar¬ 
chitectural  classes.  The  first  group  consists  of  adapted 
architectures,  optimized  for  hybrid  coding  applica¬ 
tions,  like  ITU-H.261  or  ISO  MPEG-1.  The  architec¬ 
tures  contain  one  or  more  adapted  modules  for  com¬ 
putation  intensive  tasks,  like  DCT  or  block  matching. 
It  is  obvious  that  the  application  field  of  these  archi¬ 
tectures  is  limited  to  a  small  range  of  applications. 
This  limitation  is  avoided  by  the  members  of  the  se¬ 
cond  group  of  architectures.  Most  of  these  architec¬ 
tures  do  not  contain  function  specific  circuitry  for  spe¬ 
cific  tasks  of  the  hybrid  coding  scheme.  Thus,  they 
can  be  applied  for  wider  variety  of  applications  with¬ 
out  a  significant  loss  of  sustained  computational  pow¬ 
er.  On  the  other  hand,  these  architectures  are 
associated  with  a  decreased  architectural  efficiency 
compared  to  the  first  group  of  proposed  architectures: 
Adapted  architectures  achieve  an  efficiency  gain  of 
about  6  to  7. 

Normalized 

Area 


imm^l 


Normalized 
Frame  Rate 
[Hz] 


Fig.  2:  Normalized  silicon  area  and  throughput  (frame 
rate)  for  adapted  and  flexible  programmable  ar¬ 
chitectures  for  a  H.261  codec 
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5.  FUTURE  TRENDS 

The  hybrid  coding  scheme  is  based  on  the  seg¬ 
mentation  of  the  input  image  into  square  blocks  of 
equal  size  (typ.  16x16  pels).  These  blocks  are  encoded 
according  to  the  scheme  presented  in  section  2.  It  is 
obvious,  that  the  predefined  segmentation  of  the 
image  will  generally  not  correspond  to  the  image  con¬ 
tent.  As  an  alternative  to  hybrid  coding  approach,  con¬ 
tent  based  coding  schemes  are  proposed.  These  ap¬ 
proaches  are  based  on  a  segmentation  of  the  image 
data  at  runtime  according  to  the  image  content.  It  is 
expected  that  content  based  video  compression 
schemes  will  be  the  basis  for  the  next  generation  of 
video  coding  standards.  For  example,  content  based 
video  coding  is  proposed  for  the  ISO  MPEG4  stan¬ 
dardization  that  is  currently  under  progress  and  will  be 
finished  within  this  century  [5]. 

Concerning  the  VLSI  implementation  of  content 
based  video  coding  schemes  the  major  difference  in 
comparison  to  hybrid  coding  scheme  is  the  increased 
share  of  medium-level  and  high-level  algorithms.  It 
can  be  expected  that  the  increased  share  of  data  de¬ 
pendent  algorithms  will  lead  to  a  decreased  efficiency 
gain  of  adapted  architectures  in  comparison  to  flexible 
programmable  architectures.  To  clarify  this  statement, 
scheduling  effects  for  the  processing  of  hybrid  coding 
schemes  and  coding  schemes  associated  with  data  de¬ 
pendent  computational  requirements  will  be  shortly 
discussed. 

For  simplification  of  the  discussion  we  assume 
three  different  tasks.  In  case  these  tasks  are  associated 
with  data  independent  computational  requirements, 
each  task  can  be  processed  by  one  adapted  module. 
Generally,  the  processing  power  of  each  module  can 
be  adapted  to  the  requirements  of  the  applied  algo¬ 
rithms.  The  utilization  of  the  modules  is  maximized,  if 
all  the  processing  times  Tpj  of  modules  I  are  equal.  In 
this  case,  a  100%  utilization  can  be  achieved,  if  the 
modules  operate  in  a  macropipeline  mode.  This  is  cla¬ 
rified  by  figure  3. 


Module  1 


Module  2 

Module  3 


Fig.  3:  Processing  of  algorithms  associated  with  data 
independent  computational  requirements 


Utilization  of  adapted  modules  might  decrease  if 
the  tasks  are  associated  with  data  dependent  computa¬ 
tional  requirements.  In  this  case  the  processing  time  of 
each  task  cannot  not  be  determined  at  design  time  of 
the  circuits.  The  result  is  varying  idle  times  of  the 
modules,  according  to  the  image  content  (Fig.  4  a). 

In  case  of  programmable  architectures,  the  avail¬ 
able  hardware  resources  can  be  assigned  according  to 
the  varying  computational  requirements  at  runtime 
and  the  impact  of  varying  computational  requirements 
is  decreased.  In  consequence,  the  supported  coniputa- 
tional  power  of  programmable  architectures  has  to 
meet  the  overall  computational  power  required  to  pro¬ 
cess  the  envisaged  application  in  realtime  (Fig.  4  b). 

a)  Adapted  Architecture 

Module  1 


Module  2 


Module  3 


b)  Programmable  Architecture 


t 

Fig.  4:  Processing  of  algorithms  associated  with  data 
dependent  computational  requirements  pro¬ 
cessed  with  adapted  architectures  and  program¬ 
mable  architectures 

Due  to  the  increased  share  of  idle  times,  the  ad¬ 
vantages  in  terms  of  architectural  efficiency  of 
adapted  architectures  will  become  smaller  for  future 
sophisticated  video  compression  schemes.  Thus,  it  can 
be  expected  that  flexible  programmable  architectures 
will  become  more  attractive,  even  for  mass  applica¬ 
tions. 


6.  CONCLUSION 

The  properties  of  recent  hybrid  coding  schemes  in 
terms  of  VLSI  implementation  have  been  presented. 
Architectural  alternatives  for  the  realization  have  been 
discussed.  Based  on  a  simplified  efficiency  modelling, 
it  has  been  shown  that  adapted  circuits  achieve  a  6-7 
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times  higher  efficiency  than  flexible  programmable 
circuits.  This  efficiency  gap  might  decrease  for  future 
coding  schemes  associated  with  a  higher  amount  of 
medium-level  and  high-level  algorithms.  Due  to  their 
flexibility,  programmable  architectures  will  become 
more  and  more  attractive  for  future  VLSI  implementa¬ 
tions  of  video  compression  schemes. 
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Abstract 

We  have  proposed  and  analyzed  a  combined  source-channel 
coding  scheme  using  multicarrier  modulation.  By  changing 
the  power  and  modulation  of  subchannels  carrying  different 
bits  of  the  quantized  signal,  the  channel-induced  distortion 
can  be  minimized.  An  algorithm  for  the  subchannel  power 
allocation  is  derived.  For  discrete  cosine  transform- 
(DCT-)  coded  images,  multicarrier  systems  achieve  more 
than  a  9-dB  improvement  over  single-carrier  systems  on 
very  noisy  channels. 

1.  Introduction 

One  of  the  most  important  results  of  Shannon’s  work 
was  that  source  coding  and  channel  coding  can  be  treated 
separately  without  sacrificing  fidelity  [l]-[2].  Traditionally, 
the  source  and  channel  codes  are  designed  separately  and 
then  cascaded  together.  However,  Shannon’s  argument  is 
true  only  if  both  transmitter  and  receiver  are  permitted  to 
have  an  infinite  degree  of  complexity.  In  practice,  because  of 
channel-induced  additive  noise,  the  probability  of  bit  error 
(bit-error  rate,  BER)  is  not  infinitesimally  small.  In  typical 
communication  systems,  in  order  to  transmit  an  analog 
source,  the  source  is  quantized  into  binary  bits  and  all  bits 
are  transmitted  via  the  same  channel,  and  are  thus  afforded 
the  same  protection  against  error.  However,  some  bits  in  the 
quantized  signal  (e.g.,  the  most  significant  bit)  are  far  more 
important  than  others.  In  such  a  situation,  we  can  combine 
the  source  and  channel  codes  into  a  single  code  in  which  the 
objective  is  to  minimize  the  overall  distortion  between  the 
original  source  at  the  transmitter  and  its  reconstruction  at  the 
receiver. 

The  schematic  diagram  of  a  multicarrier  modulation 
system  is  shown  in  Fig.  1.  In  transmission  of  a  quantized 
analog  source,  each  subchannel  [3]-[4]  can  utilize  different 
power,  modulation  constellation,  channel  encoding,  and  car¬ 
rier  ,  so  that  different  error  protection  can  be  provided 
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for  different  bits.  The  distortion  between  the  received  signal 
and  the  original  signal  can  thereby  be  minimized.  In  this 
paper,  we  will  consider  the  using  of  multicarrier  modulation 
to  transmit  disaete  cosine  transform-  (DCT-)  coded  images. 

2.  Channel-Induced  Distortion 

Our  combined  source-channel  scheme  uses  a  fixed 
source  encoder  and  changes  the  channel  modulation  to 
reduce  the  channel-induced  distortion.  The  source  encoder 
consists  of  a  DCIT  followed  by  scalar  quantizers.  As  the  DCT 
coefficients  are  known  to  be  accurately  modeled  as  Lapla- 
cian-distributed  random  variables,  the  quantizer  is  optimized 
for  a  Laplacian  source  using  the  method  of  Lloyd  [5]  and 
Max  [6].  The  quantizer  partitions  the  real  axis  into  Q  regions 
of  (-0O,  jtJ,  {Xi,  Xi+i],  /  =  1,  2,...,  Q-2,  (xq  _  1,  +oo).  Each 
region  is  quantized  to  the  centroid  of  that  region,  y,.  For  sim¬ 
plicity,  we  use  the  natural  binary  coding,  i.e.,  the  codeword 
y,-  is  mapped  to  binary  codeword  i. 

Because  a  codeword  represents  the  centroid  of  the 
respective  partition,  the  overall  distortion  from  the  source  at 
the  transmitter  to  the  reconstructed  signal  at  the  receiver  is 
the  sum  of  the  quantization  distortion  and  the  channel- 
induced  distortion  [7].  Because  the  quantizer  is  given  and 
fixed,  in  this  paper,  we  seek  to  minimize  the  channel-induced 
distortion: 

Q-lQ-l 

Dc=  'Z^PyPU\i)(yj-yi)\  (1) 

i=0j=0 


Fig.  1  Schematic  diagram  of  a  multicarrier  system.  is 
the  power  transmitted  in  the  mth  subchannel,  (p„,(0  is  the 
carrier  of  the  mth  subchannel,  and  n„  is  the  numter  of  bits 
per  symbol  of  the  mth  subchannel. 
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where  is  a  priori  probability  of  codeword  y,-,  and  P(j\i)  is 
the  transition  probability  for  transmission  of  binary  code¬ 
word  I  and  reception  of  binary  codeword  j. 

The  binary  codewords  are  binary  numbers  of  n  =  \0g2Q 
bits.  Each  bit  is  transmitted  over  a  different  subchannel  using 
multicarrier  modulation.  Assuming  that  the  BER  of  each 
channel  is  m  =  1, 2,...,  n,  the  transition  probabilities  are 

m=  1 

where  Z„  ii,j)  =  1  if  the  binary  codewords  represented  by  i 
and  j  differ  in  the  mth  position  and  otherwise  /„  {i,j)  =  0. 
Usually,  the  BERs  Pj,  are  small,  so  that  the  probability  of 
multiple  bit  errors  within  the  same  binary  codeword  is  small, 
and  any  terms  having  factors  of  p*  pu  , 
ml,  m2  =  1, 2,  ...n,  can  be  ignored.  The  transition  proba¬ 
bilities  are  thus  simplified  to 

f  1  «■=;■ 

P(j\i)  =  \Pb„  /„(«,;■)  =1.  (3) 

^  0  otherwise 

In  this  simplification,  P  (j\  i)  is  non-zero  if  i  and  j  are  equal, 
or  are  separated  by  a  unit  Hamming  distance.  For  any  n-bit 
binary  number  i,  there  are  n  different  binary  numbers  sepa¬ 
rated  from  i  by  a  Hamming  distance  of  unity.  If  i^  is  the  mth 
bit  in  the  binary  representation  of  i,  i.e., 
i  =  those  «  binary  numbers  are 

ie.m  =  '■+  (l-2iJ2"-'",m=l,2,...,«.  (4) 

After  some  algebra,  the  channel-induced  distortion  is  found 
to  be: 

«  C- 1  n 

=  X  Z p^py, Z 

/M  =  li  =  0  m  =  i 


3.  Subchannel  Power  Allocation 

The  2-D  DCT  has  been  shown  to  provide  relatively  effi¬ 
cient  and  robust  performance  in  a  variety  of  image  coding 
applications.  The  image  is  divided  into  small  blocks  of  size 
pxq,  and  a  2-D  DCT  is  employed  for  each  small  block.  The 
variances  of  transform  coefficient  0<k<p,  0^l<q, 
can  be  estimated  from  the  image  or  derived  from  the  image 
model. 

Assume  that  /  is  the  bit  allocation  table  determined  by 
a  bit  allocation  algorithm  [9,  pp.  501-504].  Assuming  that 
the  DCT  coefficients  are  Laplacian-distributed,  if  w„^  „  are 
the  weighting  factors  of  the  mth  bit  of  an  optimal  quantizer 
for  a  unit-variance  Laplacian  source,  obviously,  the  weight¬ 
ing  factor  of  the  corresponding  source  bit  is  ^  j.  Let 

us  assume  that  pxq  2-D  DCT  coefficients  are  quantized  to 
N  bits  and  transmitted  by  a  AT-subchannel  multicarrier  modu¬ 
lation  system.  Assuming  that  each  subchannel  has  a  power 
of  E„^  J,  the  BER  of  the  each  subchannel  is  a  function  of 
\,.pOrp  =  P„^^  where  P,f(E)  is  the  BER 

function,  which  can  be  different  for  the  individual  subchan¬ 
nels  if  different  modulation  schemes  are  employed.  There¬ 
fore,  we  would  like  to  minimize  the  channel-induced 
distortion  per  pixel: 


(7) 


it=  i;=  lj=  1 


with  the  constraint  that  the  total  power  is  equal  to  Ef.  Using 
a  Lagrange  multiplier,  the  solution  of  (7)  is  [8] 


=>•. 


(8) 


where  G^E)  =  dP^E)ldE  is  the  derivative  of  P^E)  with 
respect  to  the  power  E,  and  X  is  the  Lagrange  multiplier.  For 
a  fixed  overall  power,  the  multiplier  X  can  be  found  by  solv¬ 
ing 


where 


=  ^f:oPySyi,.„-y>^^ 

are  the  weighting  factors  of  each  bit. 

The  weighting  factors  w„  „  and  other  parameters  of 
optimal  scalar  quantizers  of  1  to  9  bits,  designed  by  Method  I 
of  Lloyd  [5]  for  a  zero  mean,  unit- variance  Laplacian  source, 
can  be  found  in  [8]. 


*=  l/=  1;=  1 

where  (E)  are  the  inverse  functions  of  G,^E).  After  the 
multiplier  X  is  evaluated,  the  power  allocation  can  be  calcu¬ 
lated  by 


(10) 


Although  complicated,  the  expression  (9)  is  a  one-variable 
equation  that  can  be  solved  easily  by  numerical  methods  [8]. 
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4.  Numerical  Example 

As  a  numerical  example,  we  consider  images  transmit¬ 
ted  over  an  additive  white  Gaussian-noise  (AWGN)  channel 
Without  loss  of  generality,  we  assume  that  all  subchannels 
are  identically  modulated  using  BPSK.  For  Gaussian  chan¬ 
nel,  the  function  G^E)  is  a  monotonic  function  so  that  (9) 
can  be  solved  by  the  bisection  method. 

The  original  256  gray-scale  Lena  image  in  Fig.  2  is 
employed  for  a  demonstration.  First,  128  is  subtracted  from 
the  image,  and  then  the  image  is  divided  into  small  square 
blocks  of  size  16  x  16.  The  2-D  E>CT  is  computed  for  each 
block  and  the  variances  of  the  DCT  coefficients  are  esti¬ 
mated.  The  transform  coefficients  are  normalized  by  their 
corresponding  standard  deviations  /  and  quantized  by  the 
optimal  quantizer  for  a  unit-variance  Laplacian  distribution. 
The  bit  allocation  table  is  determined  by  the  integer  bit-allo¬ 
cation  algorithm  [9,  pp.  503-504]. 


Fig.  2  The  original  gray-scale  image  of  Lena. 

Single-Carrier  BER, 


Average  Channei  SNR,  (dB) 

Fig.  3  PSNR  as  a  function  of  average  SNR  of  the 
channel  and  single-channel  BER.  Both  theoretical  and 
simulation  results  are  shown  for  comparison. 


Fig.  3  shows  the  peak  signal-to-noise  ratio  (PSNR)  as  a 
function  of  average  signal-to-noise  ratio  (SNR)  of  the  chan¬ 
nel,  and  the  corresponding  BER  of  the  single-carrier  system. 
The  image  is  compressed  to  both  0.25  bpp  (bit/pixel)  and 
1  bpp.  Both  theoretical  results  and  simulation  results  are 
shown  for  comparison.  In  the  calculation  of  the  theoretical 
result,  the  channel  distortion  is  computed  using  expression 
(5).  The  difference  between  the  theoretical  and  simulation 
PSNR  is  smaller  than  0.8  dB,  allowing  us  to  infer  the  validity 
of  both  the  Laplacian  distribution  and  the  approximation  of 
transition  probabilities  (3).  For  high  channel  SNRs,  the 
PSNR  of  single-carrier  and  multicarrier  systems  are  identical 
because  the  BER  is  small  and  the  quantization  distortion  is 
much  larger  than  the  channel-induced  distortion. 

For  low  channel  SNRs,  the  PSNR  of  the  multicarrier 
system  is  much  larger  than  that  of  the  single-carrier  system. 
At  SNRs  less  than  3  dB,  the  improvement  in  the  PSNR  of  the 
multicarrier  over  the  single-carrier  system  is  more  than 
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Fig.  4  The  compressed  image  of  0.25  bpp  and  the  bit 
allocation  table.  PSNR  =  28.05  dB. 


(a)  Single-carrier  (b)  Multicarrier 


Fig.  5  Reconstructed  image  at  the  receiver  of  single- 
carrier  and  multicarrier  systems.  The  average  SNR  of  the 
channel  is  4.32  dB,  corresponding  to  a  single-carrier  BER  of 
Pjj  =  10'^.  (a)  Transmitted  through  single-carrier  system, 
PSNR  =  20.52  dB.  (b)  Transmitted  through  multicarrier 
system,  PSNR  =  27.01  dB. 
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9.5  dB  for  the  0.25  bpp  image  and  more  than  13.5  dB  for  the 
1  bpp  image.  Fig.  4  shows  the  compressed  image  of  0.25  bpp 
and  the  corresponding  bit-allocation  table.  Fig.  5  compares 
the  images  transmitted  over  a  single-carrier  and  a  multicar¬ 
rier  system  for  SNR  =  4.32  dB,  which  corresponds  to  a  sin¬ 
gle-carrier  BER  of  10'^.  A  6.5-dB  improvement  in  PSNR  is 
achieved  by  use  of  multicarrier  modulation.  While  channel 
errors  cause  the  PSNR  of  the  image  transmitted  over  the  sin¬ 
gle-carrier  system  to  degrade  by  7.5  dB,  that  of  the  image 
transmitted  over  the  multicarrier  system  degrades  by  only  1 
dB. 


5.  Conclusions 

A  combined  source-channel  coding  scheme  using  multi¬ 
carrier  modulation  is  proposed  and  analyzed  for  DCT-coded 
images.  Allocation  of  subchannel  powers  according  to  the 
importance  of  each  bit  achieves  a  large  improvement  over 
single-carrier  modulation.  A  numerical  example  shows  that 
on  very  noisy  channels,  multicarrier  systems  can  achieve  an 
improvement  over  single-carrier  systems  of  more  than  9  dB 
in  terms  of  signal-to-distortion  ratio. 

In  general,  subband-coded  images  [10]  have  better  qual¬ 
ity  than  DCT-coded  images.  We  are  currently  studying  the 
application  of  the  combined  source-channel  coding  scheme 
to  subband-coded  images. 
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Abstract 

The  MPEG-2  source  coding  edgorithmis  very  sensitive  to  chan¬ 
nel  disturbances.  A  single  bit  error  in  the  bitstream  will  cause 
a  high  degradation  of  picture  quality  due  to  the  use  of  variable 
length  coding.  Therefore,  for  picture  replenishment  error  con¬ 
cealment  techniques  (ECT’s)  may  be  applied  at  the  receiver. 

The  aim  of  this  article  is  to  study  the  error  concealment 
(EC)  enhancements  due  to  the  use  of  the  soft  output  values  of 
the  SOVA^  (Soft  Output  Viterbi  Algorithm)  as  an  error  detec¬ 
tion  tedmique  (EDT)  for  MPEG-2  inter  coded  pictures  applied 
for  terrestrial  broadcasting. 

1  Introduction 

The  future  terrestrial  digital  TV/HDTV  signal  will  be  a 
highly  compressed  signal  because  of  the  restricted  band¬ 
width  of  the  channels  (UHF  and  VHF).  The  TV/HDTV 
source  coding  algorithms  used  for  this  applications  will  be 
based  on  MPEG-2  [1], 

The  TV  layer  should  be  highly  protected.  An  example 
of  a  channel  coding  and  modulation  scheme  for  terres¬ 
trial  hierarchical  TV/HDTV- broadcasting  is  described  in 
[2,  3].  At  very  long  distances  or  at  worst  reception  con¬ 
ditions,  i.e.  deep  fades  or  impulsive  noise,  the  portable 
receiver,  which  suffers  from  the  non  existence  of  a  grace¬ 
ful  degradation,  may  risk  to  receive  nothing  (the  threshold 
effect).  To  avoid  this  or  to  enhance  the  picture  quality, 
one  may  apply  the  ECT’s  at  the  receiver. 

The  aim  of  this  article  is  to  study  the  EC  enhancements 
in  MPEG-2  Video  decoder  due  to  the  use  of  the  reliability 
outputs  of  the  SOVA,  taking  into  account  a  real  transmis¬ 
sion  medium  [2].  For  the  simulations  a  Channel  Model  is 
used,  which  reproduces  the  errors  and  the  appropriate  re¬ 
liability  values  for  each  bit  at  the*  receiver  side. 

The  paper  is  organized  as  follows:  In  section  2  the 
Channel  Model  for  the  simulations  is  examined.  In  sec¬ 
tion  3  the  different  ED  and  temporal  ECT’s  are  described 
briefly.  Simulation  results  are  given  in  section  4  and  sec¬ 
tion  5  is  devoted  to  some  conclusions. 


2  The  Channel  Model 


In  this  section  a  Channel  Model,  i.e.  an  error-pattern 
model  for  the  whole  terrestrial  transmission  scheme,  is 
described.  This  model  reproduces  the  error-pattern  in 
the  channel  decoder.  It  is  an  easy  way  to  simulate  the 
error-pattern' without  the  simulation  of  the  whole  system. 

*The  author  is  within  the  European  dTTb-Race  and  the 
German  ^  DTVx  projects 
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This  model  can  also  be  used  for  a  hierarchical  scheme  with 
different  layers  [2]. 

The  channel  encoder  is  based  on  a  concatenated  cod¬ 
ing  scheme.  The  inner  code  is  a  convolutional  code  with 
memory  m  =  6  and  different  rates  for  the  different  lay¬ 
ers,  and  the  outer  code  is  a  RS  (204, 188, 17)  code  over 
GE(2®),  which  can  handle  up  to  tmax  =  8  errors.  The 
outer  RS-codewords  are  adapted  to  the  MPEG-2  Trans¬ 
port  Packets  of  length  188  bytes. 

The  super  channel  comprises  a  channel  encoder,  a  mod¬ 
ulator,  an  OFDM  multiplexer,  the  transmission  channel, 
an  OFDM  demultiplexer,  a  demodulator  and  a  channel 
decoder.  The  model  can  be  described  as  follows: 

One  example  for  an  interleaver  between  the  inner  and 
the  outer  code  is  given  in  Figure  1.  The  length  of  the  outer 
code  is  A  =  204  and  the  depth  of  the  interleaver  is  /  =  8. 
The  bits  of  the  inner  decoder  are  mapped  into  symbols  of 
8  bits  (one  RS-symbol)  and  are  then  written  out  in  rows 
into  the  interleaver-matrix.  The  RS-codewords  from  the 
outer  code  will  be  read  out  in  columns. 

The  inner  decoder  is  a  Viterbi-decoder.  This  decoder 
generates  burst-errors.  Therefore,  it  is  necessary  to  de¬ 
fine  two  probabilities  for  the  symbol-error-probability: 
First  the  probability  P{Sw\S'r)  that  the  current  symbol 
S  is  wrong  under  the  condition  that  the  previous  sym¬ 
bol  5'  was  right,  and  on  the  other  hand  the  probability 
P(5ty|5'u;)  that  the  current  symbol  is  wrong  under  the 
condition  that  the  previous  symbol  was  also  wrong. 

The  symbols  in  the  interleaver-matrix  consist  of  8  bits. 
For  these  bits  the  probability  P{Bw\Syj),  that  one  bit  is 
wrong  under  the  condition  that  the  symbol  is  wrong,  is 
defined.  If  the  symbol  is  right,  the  Viterbi-decoder  has 
corrected  the  wrong  bits. 

Now  these  three  probabilities  are  used  to  produce  a 
Channel  Model.  First  one  goes  through  the  rows  of  the 
interleaver-matrix.  The  first  symbol  will  be  disturbed 
with  the  probability  P(S'u;|iS'V).  If  the  symbol  is  wrong, 
errors  will  be  produced  in  the  bits  of  this  symbol  with 
the  probability  P{Byj\Sy,).  The  next  symbol  will  be  dis¬ 
turbed  with  either  P(5'tyjS'V)  or  P{Sw\S'w)  depending 
on  whether  the  previous  symbol  was  disturbed.  Then  the 
bits  of  this  symbol  will  be  disturbed  with  P{B^\Sw)  and 
so  on.  If  all  symbols  are  handled,  the  errors  of  the  inner- 
decoder  are  reproduced.  The  three  probabilities  will  be 
obtained  from  computer  simulations  of  the  whole  trans¬ 
mission  scheme.  They  depend  on  the  SNR,  on  the  channel 
and  on  the  code-rates. 

To  reproduce  the  error-patterns  at  the  output  of  the 
outer-decoder,  the  RS-words  are  read  out  in  columns  and 
the  wrong  symbols  will  be  counted.  If  the  number  of  errors 
are  equal  or  less  than  the  number  of  correctable  errors,  cdl 
errors  will  be  corrected  by  the  outer  RS-code,  so  they  can 
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be  deleted  in  the  total  column.  If  this  number  exceeds  the 
number  of  correctable  errors,  the  errors  in  that  column 
remain.  Only  the  first  fc  =  188  information  symbols  from 
the  RS-codeword  are  taken,  to  obtain  the  output  error 
pattern. 

For  a  hierarchical  system  with  different  inner  and  outer 
decoders,  one  can  obtain  a  Channel  Model  by  combining 
the  corresponding  models.  This  model  is  used  for  the 
simulations. 

For  ED  this  Channel  Model  can  be  extended  for  provid¬ 
ing  some  reliability  information  for  each  bit.  Assuming  an 
inner  SOVA  [4]  the  inner  decoder  can  provide  reliability 
information  for  each  bit.  Because  the  outer  RS- Decoder  is 
a  systematic  code,  the  reliability  information  is  also  use¬ 
ful  after  the  outer  decoder.  If  the  RS-decoder  detects  one 
wrong  packet,  this  packet  will  be  send  to  the  source  de¬ 
coder  without  change,  so  that  the  reliability  information 
after  the  inner  decoder  is  also  legal  for  the  bits  after  the 
outer  decoder.  This  reh ability  information  can  also  be 
used  for  ED. 

The  soft  output  values  of  the  SOVA  are  the  log- 
likelihood  ratios  for  each  bit.  The  definition  of  the  log- 
likehhood  ratio  is: 


L  =  /n 


/  P{^  =  i\y)  \  _  .  (  P(^  =  m  \ 
VP(a;=  Vl-P(^=  l|y)/  ’ 


(1) 


where  P[x  =  l\y)  denotes  the  probability  that  a:  =  1  wbs 
sent  under  the  condition  that  y  was  received.  Assuming 
an  AWGN-channel  with  variance  and  the  mean  //c  =  1, 
the  probability  for  the  received  symbol  is: 


P(y\x  =  1)  =  >  (2) 

For  the  probability  that  x  =  1  was  sent  under  the  condi¬ 
tion  that  y  is  received  one  obtains  with  Bayes: 


P{x  =  l\y)  = 

_ P(x  =  l)P{y\x  =  1) _ 

P(x  —  l)P{y\x  =  1)  +  P(x  —  -l)F(y|a7  =  -1) 

With  P{x  =  1)  =  P{x  =  -1)  =  1/2  this  reduces  to 

P(3/|r  =  l)  +  P(j,|x  =  -l) 

By  inserting  equation  2  one  obtains: 


(3) 

(4) 


P{x  =  lly)  = 


-at 


(5) 


1  +  e 

With  this  equation  the  log-likelihood  value  (L-value)  is: 

1 

\ 

2 


nr 

2 


\P(a7  = -1|2/)/  1 - ^ 


•  y  «  (6) 


_  22i. 


1+e 


This  is  the  L-value  after  the  channel  before  the  inner 
decoder.  Because  the  received  values  y  are  gaussian  dis¬ 
tributed  the  L- values  are  also  gaussian  distributed  with 
(XLc  ~  a,nd  fiLc  “  ^2  *  y>C' 

The  soft-output  values  of  the  SOVA  are  also  L-values. 
From  simulations  one  can  see  that  the  power  density  func¬ 
tion  (pdf)  of  the  L-values  after  the  inner  decoder  can 
also  be  approximated  by  a  gaussian  distribution  with 
(TLa  =  •  0-5  and  fiLs  —  ^ if  assuming  an  AWGN- 

channel  (Figure  2  where  x  =  1  was  sent).  Therefore,  for 


the  soft  outputs  one  needs  only  the  variance  <tI  and  the 
mean  /is  of  the  outputs  of  the  inner  decoder,  where  for  the 
mean  it  seems  certain  that  /is  =  1  with  the  assumption 
that  X  =  1  was  sent. 

The  variance  is  determined  by  the  BER  after  the  inner 
decoder  and  by  the  corresponding  E3IN0.  The  variance 
<Ts  to  the  corresponding  BER  can  be  obtained  with  the 
following  equations: 

1  2 

BER  =  “  *  erfc[—]  with  erfc[x]  =  1  -  /  e“^  dt 

2  No  Jq 

I  No 

With  a  given  BER  after  the  inner  decoder  the  correspond¬ 
ing  (Ts  can  be  computed.  Note,  that  the  E^/No  after  the 
inner  decoder  is  not  the  Es/No  of  the  channel.  With  the 
knowledge  of  the  variance  and  the  mean,  one  can  obtain 
the  pdf  for  the  reliability  informations. 

The  Channel  Model  is  a  model  with  two  states:  one 
state  describing  the  right  path  and  one  describing  the 
wrong  path  of  the  Viterbi-decoder.  The  reliability  infor¬ 
mation  for  the  bits  in  the  wrong  path  is  usually  small.  If 
X  =  1  was  sent,  the  pdf  of  the  L-values  is  the  following: 


pdf(L)  =  { 


1 


e 


:  L<0/\Sw 
(wrong  path,  wrong  bit) 

(wrong  path,  right  bit) 
0  :  L  <0  A  Sr 

(right  path) 


1 

(right  path) 


If  one  bit  in  the  wrong  path  is  correct,  then  the  L-value 
is  distributed  in  the  positive  side  of  the  pdf,  otherwise  the 
L-value  lies  in  the  negative  side  of  the  pdf.  It  should  be 
noted  that  the  L-values  for  a  sent  x  =  —1  are  gaussian 
distributed  with  ctls  and  /ils  =  ^*(— !)•  Therefore,  the 

mirrored  pdf(L)  has  to  be  used. 

If  one  simulates  the  L-values  for  each  bit  additionally 
in  the  Channel  Model  for  the  wrong  and  the  right  path 
respectively,  one  obtains  a  good  approximation  for  the  L- 
values  after  the  SOVA.  These  reliabilities  can  also  be  used 
for  ED. 

For  an  Rayleigh  fading  channel  the  L-values  after  the 
SOVA  are  no  more  gaussian  distributed.  The  pdf  is  then 
approximately  the  convolution  of  an  Rayleigh  distribution 
with  an  gaussian  distribution.  The  computation  of  this 
function  is  more  complex.  The  Channel  Model  should  be 
so  simple  as  possible  and  with  the  assumption  that  the 
L-values  are  gaussian  it  seems  to  be  a  good  compromise 
between  complexity  and  precision. 


3  Error  Detection  and  Error 
Concealment 


3.1  Error  Detection 

For  EC  measures,  proper  ED  is  needed.  Three  ways  of 
ED  are  considered:  first  the  source  decoder  can  detect 
errors  [5l,  but  with  this  method,  many  errors  will  still  re¬ 
main  undetected  and  there  is  the  risk  of  error  propagation. 
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Therefore,  another  way  zdready  mentioned  is:  the  channel 
decoder  can  also  detect  errors.  The  outer  RS~decoder  can 
detect  wrong  packets  with  high  reliability  [6].  A  wrong 
packet  will  be  indicated  in  the  transport  header.  The 
header  is  about  4  bytes  and  the  payload  consists  of  184 
bytes.  If  the  source  decoder  knows  the  errors  in  packets, 
the  slices  [l]  where  the  packet  errors  are,  will  be  concealed 
until  the  next  synchronization  point  (the  next  slice).  The 
worst  case  is  that,  if  this  packet  would  contain  the  slice 
start  code.  Then  the  rest  of  the  current  slice  and  the 
whole  next  slice  must  be  conceziled.  This  EDT  finds  the 
region  where  errors  are  in  the  bitstream.  The  region  is  at 
least  184  bytes  long  and  these  184  bytes  will  cause  the  loss 
of  synchronization  until  the  next  slice.  This  EDT  gives  no 
information  about  the  exact  location  of  the  errors  in  the 
block.  In  many  cases  where  the  decoder  failed,  the  RS- 
word  has  tmax  +  1  errors.  If  these  errors  are  at  the  end 
of  the  RS-word,  decoding  until  another  EDT  finds  errors 
could  be  better  because  the  information,  which  could  be 
correct  in  the  bitstream,  will  be  used.  But  this  EDT  must 
be  a  method  with  high  reliability.  One  solution  for  using 
the  correct  part  of  a  wrong  packet  is  the  use  of  the  soft 
information  at  the  output  of  the  inner  channel  decoder  [4] 
as  a  reliability  for  each  symbol.  The  SOVA  gives  the  log- 
likelihood  values  for  each  bit.  From  this  information  one 
can  obtain  the  reliability  for  each  symbol  of  the  RS-word. 
With  equation  1  one  obtains  for  the  probabilities  for  each 
bit: 


The  probability  P(5),  that  one  symbol  of  one  RS-word  is 
correct  (with  the  assumption  that  the  bits  are  statistically 
independent),  is: 

A  e'^O)'  el^'l 

=  |I<l  =  nunL(,) 

i=i 

With  this  value  one  obtains  a  measure  for  the  proba¬ 
bility  of  each  symbol  in  one  Transport  Packet.  With  a 
certain  threshold  one  can  say  with  high  reliability,  if  the 
next  symbol  is  wrong  or  not.  The  correct  part  of  the  erro¬ 
neous  packet  can  therefore  be  used  until  the  first  symbol 
is  wrong  in  one  packet. 

3.2  Error  Concealment 

The  ECT’s  used  in  this  article  for  comparing  the  EDT^s 
are  temporal  EC  with  and  without  motion  compensation 
[5].  The  simple  temporal  EC  is  a  simple  copy  of  the  wrong 
parts  of  the  previous  anchor  picture  into  the  current  pic¬ 
ture.  With  this  EC  method  shifts  will  be  visible,  if  there 
is  motion.  The  temporal  ECT  with  motion  compensation 
uses  in  addition  the  motion- vectors  from  one  nearest  mac¬ 
roblock  in  the  current  picture.  The  second  method  in  the 
most  cases  is  the  best  one,  because  high  motion  becomes 
not  visible.  But  if  the  motion  from  the  current  macroblock 
is  a  motion  with  another  direction  than  the  surrounding 
macroblocks,  this  error  becomes  visible.  This  occurs  also 
if  there  are  burst  errors  which  cause  more  than  one  dis¬ 
turbed  slice,  and  the  motion  vectors  which  are  considered 
are  too  far  from  the  current  macroblock.  All  ECT’s  will 
cause  some  remaining  errors  in  the  picture  domain,  be¬ 
cause  they  could  not  be  as  good  as  the  real  picture  data. 
Therefore,  it  seems  worthwhile  to  use  the  soft  informa¬ 
tion  of  the  SOVA  as  an  EDT,  because  there  as  much  as 
errorfree  data  is  used. 

Other  EC  algorithms  like  spatial  EC  [7,  8]  could  have 
been  used  for  comparing  the  EDT’s,  but  they  will  lead  to 
the  similar  results. 


4  Simulation  Results 


Simulation  results  of  different  temporal  ECT’s  with  differ¬ 
ent  EDT’s  in  P-  and  B-pictures  of  MPEG-2  coded  pictures 
are  given  in  Table  1  and  in  Table  2.  The  different  EDT’s 
are  ED  in  the  channel  decoder  without  soft  information 
(’channel’)  and  ED  in  the  channel  decoder  with  soft  infor¬ 
mation  (’sova’).  Two  different  temporal  ECT’s  are  con¬ 
sidered.  One  is  simple  temporal  EC  (simple  copying  from 
one  previous  picture,  ’simple’)  and  the  other  is  temporal 
EC  with  motion  compensation  (with  motion  vectors  from 
one  nearest  macroblock  of  the  current  picture,  ’motion’). 
I  Error  Concealment  Comparison  (PSNR) 
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channel 

simple- 

sova 

motion- 

channel 

motion- 

sova 

without 
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10“^ 
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21.46 

21.60 

26.51 

26.71 

29.63 

U 

31.70 

31.83 

33.50 

33.61 

34.42 

V 

32.71 

32.84 

33.92 

34.11 

35.51 

Y 

27.10 

27.13 

29.03 

29.06 

29.63 

U 

33.94 

33.95 

34.29 

34.30 

34.43 

V 

34.93 

34.94 

35.17 

35.18 

35.51 

Table  1:  PSNR  values  for  calendar  (P-pictures) 


Table  2:  PSNR  values  for  calendar  (B-pictures) 


The  simulation  are  carried  out  under  the  following  con¬ 
ditions:  the  size  of  the  TV  picture  of  4:2:0  format  was 
720x576  pixel.  One  slice  consists  of  44  macroblocks.  The 
bit  rate  of  5  Mbit/sec  is  chosen,  because  it  seems  to  be 
acceptable  for  a  good  TV  quality  (higher  than  PAL  or  SE- 
CAM).  20  P/B- pictures  were  disturbed  with  the  packet 
error  rate  PER  =  10”^  and  PER  =  10“^.  The  mean 
of  the  PSNR  for  the  three  components  is  shown  in  Ta¬ 
ble  1  and  2.  The  errors  were  simulated  with  the  Channel 
Model,  which  reproduces  the  errors  after  the  channel  de¬ 
coder  (Rayleigh  fading  channel).  The  burst  errors  of  the 
inner  Viter  bi-decoder  were  simulated. 

The  simulation  results  show  that  the  main  gain  for  ED 
with  the  soft  output  values  is  only  0.2  dB  for  the  PSNR  at 
PER  =  10“^  for  the  luminance  component.  One  cause 
for  these  small  improvements  is  that  the  slices  are  very 
long  and  that  the  amount  of  bits  in  the  first  part  of  the 
RS-word  can  be  very  small  if  one  symbol  error  is  at  the 
beginning  of  one  word.  Because  the  synchronization  point 
is  at  the  next  start  of  one  slice,  only  a  few  macroblocks  can 
be  further  decoded  when  using  the  remaining  lossless  part 
at  the  beginning  of  the  RS-word.  One  way  for  obtaining 
better  results  is  to  make  the  slices  shorter.  Table  3  and  4 
give  the  PSNR- values  for  the  disturbed  P-  and  B-pictures 
respectively,  where  the  slice  length  is  shorter  (22  and  11 
macroblocks  instead  of  44  macroblocks  per  slice. 
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Error  Concealment  Comparison  (PSNR)  | 
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34.11 
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34.33 

34.39 
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35.22 

35.31 
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Table  3:  PSNR  values  for  calendar  (P-pictures) 
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35.31 

35.39 

35.46 

Y 

28.64 

29.00 

29.08 

28.75 
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Table  4:  PSNR  values  for  calendar  (B-pictures) 


The  results  show  that  for  shorter  slices  the  gain  of  us¬ 
ing  additional  soft  information  is  up  to  2  dB  for  the  lumi¬ 
nance  component  for  simple  temporal  EC  at  PER  =  10“^ 
whereas  the  gain  for  temporal  EC  with  motion  compen¬ 
sation  is  up  to  1  dB.  The  temporal  ECT  with  motion 
compensation  is  a  very  good  ECT  therefore,  the  gain  is 
not  so  high.  The  reduced  picture  quality  due  to  the  use  of 
more  synchronization  points  in  a  video  sequence  with  the 
bitrate  of  5  Mbit/sec  is  negligible.  In  I- pictures,  where 
no  motion  information  exists,  the  gain  wiQ  be  higher  be¬ 
cause  the  ECT’s  like  spatial  EC  and  simple  temporal  EC 
will  cause  errors  in  the  picture  domain,  therefore  it  seems 
worthwhile  to  use  the  correct  part  of  one  disturbed  RS- 
word. 

5  Conclusions 

In  this  paper  two  different  ECT’s  with  two  different  ED 
methods  for  MPEG-2  compressed  video  sequences  are 
studied.  The  first  ECT  is  based  on  a  simple  temporal 
EC,  where  the  previous  anchor  picture  is  copied  into  the 
current  picture,  where  errors  are  detected.  The  second 
technique  is  the  temporal  EC  with  motion  compensation. 
In  this  technique  the  motion  vectors  of  a  neighbouring 
macroblock  are  used  for  the  EC  of  the  current  macroblock. 
The  second  technique  is  the  best  technique  in  the  case  of 
uniformly  motion.  With  different  high  motion  this  tech¬ 
nique  can  cause  a  few  wrong  blocks  in  the  picture.  The 
two  EDT’s  are  ED  in  the  outer  decoder  of  the  channel 
decoder  and  the  detection  of  wrong  symbols  by  addition¬ 
ally  using  the  soft  information  of  the  inner  SOVA.  It  is 
shown  that  it  is  worthwhile  to  use  the  correct  part  of  the 
RS-word  until  a  symbol  is  wrong  even  if  the  ECT  is  not  so 
good.  It  should  be  noted  that  the  gain  raises  with  shorter 
slices.  For  simple  temporal  EC  the  gain  for  the  EDT  with 
soft  information  is  up  to  2  dB  for  the  luminance  compo¬ 
nent  at  PER  =  10“^.  For  I- pictures  the  gain  will  be 
higher  because  no  ECT  is  better  than  decoding  on  until 
the  next  symbol  error. 
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1.  INTRODUCTION 

Transform  coders,  in  particular  the  Discrete  Cosine  Trans¬ 
form  (DCT),  have  been  widely  used  in  the  implementation 
of  low-rate  codecs  for  video  compression.  The  DCT  has  be¬ 
come  an  integral  part  of  several  standards  such  as  MPEG 
[1],  and  CCITT  Recommendations  H.  261  and  H.263  [2]. 
For  real  time  implementation  of  the  DCT  at  MPEG  rates, 
general  purpose  DSP’s  [3]  have  been  proposed.  However, 
with  the  imminent  arrival  of  High  Definition  Television  and 
the  considerably  higher  sample  rates  (75  MHz  sampling  fre¬ 
quencies)  and  higher  throughput  requirements,  processors 
specific  to  the  DCT  (ASIC’s)  with  highly  parallel  and/or 
pipelined  architectures  have  been  recently  reported  [4].  The 
use  of  such  processors  can  be  justified  only  if  they  result 
in  a  cost  effective  implementation.  In  this  paper  we  will 
describe  a  simple  but  cost  effective  design  of  a  combined 
DCT/IDCT  processor.  The  implementation  uses  a  paral¬ 
lel  architecture  that  exploits  symmetry  properties  in  the 
DCT/IDCT  algorithm  and  requires  only  simple  hardwired 
multipliers.  Clocking  at  100  MHz,  the  processor  is  capa¬ 
ble  of  handling  real-time  HDTV  signals.  The  core  area  of 
the  processor  hcis  been  minimized  while  keeping  the  I/O  re¬ 
quirements  simple.  Finite  wordlength  and  accuracy  studies 
have  been  carried  out  to  ensure  that  the  resulting  imple¬ 
mentation  conforms  to  existing  standards  (H.261)[2]. 

2.  COMPUTATION  OF  THE  DCT/IDCT 

The  forward  and  inverse  transforms  can  be  written  as  a 
triple  matrix  product  Z  =  AXA^  and  X  =  A^ZA  where 
A  A  =  by  virtue  of  the  orthogonality  of  A,  which  is 
aji  N  X  N  matrix.  The  decomposition  to  a  triple  matrix 
product  requires  2N^  multiplications  to  be  performed  over 
clock  cycles  (or  input  sample  periods) which  requires  2N 
multiplies  to  be  computed  per  input  sample. 

A  standard  block  diagram  of  a  DCT(IDCT)  processor 
is  shown  in  Fig.  1  where  the  computation  of  the  2D  DCT 
has  been  broken  down  into  two  ID  DCT’s  (or  IDCT’s).  The 
first  computes  Y  =  AX  (or  A^ X)  and  the  second  computes 
^  =  YA^  (or  Y A).  The  N  x  N  matrix-matrix  multiply 
has  been  separated  into  N  matrix-vector  products.  Thus, 
the  basic  computation  performed  by  the  DCT  (or  I  DCT) 
is  the  evaluation  of  the  {N  x  N)  matrix  by  {N  x  1)  vector 
product.  Each  ID  DCT  (or  IDCT)  unit  must  be  capable 
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of  computing  N  multiplies  per  input  sample  to  perform  a 
matrix-vector  product. 

If  the  input  block  X  is  scanned  column  by  column.  The 
intermediate  product  Y  =  AX  (or  A ^ AT)  is  also  computed 
column  by  column.  However,  since  an  entire  row  of  Y 
has  to  be  computed  prior  to  the  evaluation  of  the  next  ID 
DCT,  the  intermediate  result  Y  must  be  stored  in  an  on- 
chip  buffer.  Since  columns  are  written  into  the  buffer  and 
rows  are  read  from  it,  it  is  commonly  called  the  transpose 
memory. 

The  first  ID  DCT/IDCT  unit  operates  on  rows  of  A 
(or  A^)  and  columns  of  A,  while  the  second  ID  DCT  unit 
operates  on  a  rows  of  Y  and  columns  of  (or  A).  Since  a 
column  of  A^  (A)  is  equivalent  to  a  row  of  A  (A^),  the  sec¬ 
ond  ID  DCT/IDCT  unit  is  unnecessary  if  we  can  multiplex 
the  first  ID  DCT/IDCT  unit  between  a  column  of  X  and  a 
row  of  T  as  shown  in  Fig.  2.  However,  the  ID  DCT/IDCT 
unit  must  now  process  samples  at  twice  the  input  sample 
rate,  i.e,  the  ID  DCT/IDCT  unit  must  be  capable  of  com¬ 
puting  2N  multiplies  per  input  sample.  We  can  exploit 
certain  features  of  the  ID  DCT/IDCT  to  reduce  the  com¬ 
putational  overhead  of  the  basic  building  block.  The  8x8 
DCT  matrix  can  be  written  as 
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where  a  =  |cosf,  6  =  |cos  c  =  ^cos  f ,  d  =  |cos  ff, 
e  =  I  cos  y-,  /  =  I  cos  ^  and  g  ^  ^  cos  Even  rows  of  A 
are  symmetric  and  odd  rows  are  anti-symmetric.  Thus,  by 
exploiting  this  symmetry  in  the  rows  of  A,  and  separating 
the  even  and  odd  rows,  we  get 
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Similarly,  the  ID  IDCT  can  be  rewritten  as: 
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The  number  of  multiplies  is  halved  since  the  (N  x  N)  x 
(N  X  1)  matrix- vector  multiply  has  been  replaced  by  two 
(^Xy)x(yXl)  matrix- vector  multiplies  which  can  be 
computed  in  parallel.  Since  the  computation  for  both  the 
DCT  and  the  IDCT  has  been  halved,  the  ID  DCT/IDCT 
unit  in  Fig.  2  now  needs  to  compute  only  N  multiplies  per 
input  sample.  The  savings  in  the  multiplications  comes  at 
the  expense  of  some  data  ordering  and  reordering  that  is 
required  before  and  after  the  ID  DCT/IDCT  unit. 

3.  ARCHITECTURE 

Both  the  DCT  and  the  IDCT  require  the  computation  of 
two  (4  X  4)  by  (4  X  1)  matrix- vector  products.  Our  matrix- 
vector  multipliers  are  designed  to  compute  these  matrix- 
vector  products  in  parallel  during  four  clock  cycles.  For 
four  clock  cycles,  the  matrix- vector  multipliers  operate  on 
a  column  of  X.  For  the  next  four  clock  cycles,  the  matrix- 
vector  multipliers  operate  on  a  row  of  Y,  This  is  the  multi¬ 
plex  operation  illustrated  in  Fig.  2.  The  DCT  requires  sev¬ 
eral  additions/subtractions  prior  to  the  matrix- vector  mul¬ 
tiply  whereas  the  IDCT  requires  the  grouping  of  even  and 
odd  samples.  This  is  performed  by  our  Data  Reorder  Unit 
(DRU)  which  receives  its  its  inputs  from  the  columns  of  X 
(off-chip)  and  the  rows  of  Y  (on-chip).  Similarly,  the  DCT 
requires  the  regrouping  of  even  and  odd  coefficients  after 
the  matrix-vector  multiply  while  the  IDCT  requires  some 
additions/subtractions.  These  are  performed  in  the  Inverse 
Data  Reorder  Unit  (IDRU).  The  completed  forward  (re¬ 
verse)  transformed  coefficients  Z  are  written  off-chip  while , 
Y  is  written  into  the  transpose  memory  which  is  imple¬ 
mented  as  a  64-word  DRAM.  Each  memory  cell  is  imple¬ 
mented  as  a  three- transistor  RAM  (3T-RAM).  The  pixels 
of  Y  are  delayed  by  four  clock  cycles  such  that  Y (0)  arrives 
four  clock  cycles  after  A(0).  Input  pixels  X{n)  and  output 
pixels  Z{n)  enter  and  leave  the  chip  a  single  pixel  at  a  time. 
The  overall  architecture  (and  floorplan)  of  the  chip  is  shown 
in  Fig.  3  It  consists  of  the  following  blocks:  (1)  Data  Re¬ 
order  Unit  (DRU)  (2)  Two  Matrix- Vector  Multiplier  Units 
(3)  Inverse  Data  Reorder  Unit  (IDRU)  (4)  Transpose  Mem¬ 
ory 


3.1.  Matrix- vector  Multiplication 

The  computation  of  the  ID  DCT/IDCT  requires  two  4x4 
matrices.  The  first  4x4  matrix  has  elements  that  can 
assume  one  of  three  values:  a,  c,/  (possibly  negated).  Any 
input  to  this  matrix- vector  multiplier  can  multiply  only  one 
of  three  coefficients.  The  second  4x4  matrix  has  elements 
that  assume  one  of  four  values:  6,  d,  e^g  (possibly  negated). 
Similarly,  any  input  to  this  matrix-vector  multiplier  can 
multiply  only  one  of  four  coefficients. 

The  architecture  of  the  ACF  matrix-vector  multiplier  is 
shown  in  Fig.  4.  It  consists  of  three  hardwired  multipliers 
implementing  multiplications  by  fixed  coefficients  a,c  and 
/.  An  input  Xe  to  the  ACF  multiplier  bank  is  broadcast  to 
all  three  multipliers.  The  products  axe^cxe,  fx^  are  avail¬ 
able  in  parallel.  A  bank  of  four  accumulators  then  selects 
one  of  these  products  to  perform  the  various  linear  com¬ 
binations  required  for  the  matrix-vector  multiply  over  four 
clock  cycles.  Thus,  at  the  end  of  four  clock  cycles,  ACCO 
has  computed  the  sum  of  products  corresponding  to  the  first 
row  of  the  matrix,  ACCl  hzis  computed  the  sum  of  prod¬ 
ucts  corresponding  to  the  second  row  of  the  matrix,  and  so 
on.  The  sequence  of  inputs  to  the  ACF  matrix- vector  mul¬ 
tiplier  bank  for  the  DCT  is:  X(3)  -f- A(4),  X{2)  ^(5), 
X{1)  +  A(6),  X{0)  +  X(7),  y(3)  +  y(4),  Y{2)  +  7(5), 
7(1)  +  7(6),  7(0)  -{-  7(7),  For  the  IDCT,  the  sequence 
of  inputs  is:  ^(4),  X(2),  ^(6),  X{Q),  7(4),  7(2),  7(6), 
7(0). 

The  architecture  for  the  BDEG  matrix- vector  multiplier 
is  shown  in  Fig.  5.  It  consists  of  four  hardwired  multipli¬ 
ers  implementing  multiplications  by  6,  d,  e  and  g.  An  in¬ 
put  Xo  to  the  BDEG  multiplier  bank  is  broadcast  to  all 
four  multipliers.  The  products  hxo^dxo^oxo  and  gxo  are 
available  in  parallel  to  aU  four  accumulators.  The  bank 
of  four  accumulators  then  selects  one  of  these  products  to 
perform  the  various  linear  combinations  over  four  clock  cy¬ 
cles.  As  before,  ACCO  computes  the  first  sum  of  products, 
ACCl  computes  the  second  sum  of  products,  and  so  on. 
The  sequence  of  inputs  to  the  BDEG  matrix-vector  mul¬ 
tiplier  bank  for  the  DCT  is:  A (3)  —  A (4),  A (2)  —  A(5), 
A(l)  -  A(6),  A(0)  -  A(7),  7(3)  -  7(4),  7(2)  -  7(5), 
7(1)  -  7(6),  7(0)  -  7(7).  For  the  IDCT,  the  sequence 
of  inputs  is:  A(3),  A(5),  A(l),  A(7),  7(3),  7(5),  7(1), 
7(7).  The  order  in  which  the  accumulators  select  their 
products  is  stored  as  control  information  in  a  small  ROM. 
The  outputs  of  the  accumulators  are  multiplexed  into  the 
IDRU  for  further  post-processing. 

3.1.1.  Hardwired  Multipliers 

Since  the  multiplier  coefficients  are  fixed,  we  adopted  a 
hardwired  -multiplier  approach  with  the  coefficients  repre¬ 
sented  by  a  Signed  Digit  (SD)  code.  The  primary  advan¬ 
tage  of  the  signed  digit  representation  is  that  it  allows  most 
numbers  to  be  represented  with  fewer  non-zero  digits.  Ta¬ 
ble  1  lists  the  hardwired  coefficients  and  their  signed-digit 
representations.  The  number  of  non-zero  bits  is  dictated  by 
accuracy  and  finite  wordlength  specifications,  and  has  been 
determined  by  simulation.  The  multiplier  coefficients  are 
accurate  to  four  decimal  places.  From  Table  1  we  can  see 
that  four  to  six  non-zero  bits  are  adequate  to  represent  the 
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coefficients,  which  significantly  reduces  the  hardware  com¬ 
plexity  of  the  multiplier  bank. 

3.1.2.  Accumulator 

The  accumulator  shown  in  Fig.  6  consists  of  a  4-to-l  mul¬ 
tiplexer  that  selects  between  one  of  the  (three)  four  prod¬ 
ucts  at  any  given  time  instance,  A  negative  coefficient  re¬ 
quires  that  the  selected  product  be  subtracted  from  the 
current  accumulated  value.  This  is  accomplished  by  the 
conditional  invert  (xor)  unit.  The  contents  of  the  accumu¬ 
lator  are  latched  into  the  output  register  (OUTREG)  once 
every  four  clock  cycles  and  then  reset  to  zero.  This  is  ac¬ 
complished  by  the  signal  reset.  The  controls  for  the  4-to-l 
multiplexer  se/[0  :  1],  conditional  invert,  and  reset  for  both 
the  DCT/IDCT  modes  of  operation,  are  stored  in  a  ROM. 

4.  FINITE  WORDLENGTH  SIMULATIONS 

We  have  carried  out  finite  wordlength  simulations  to  ensure 
that  our  implementation  meets  the  specifications  dictated 
by  the  standards  (H.261).  Simulations  were  carried  out  on 
a  set  of  10,000  8x8  blocks.  These  blocks  were  generated  by 
the  random  number  generator  specified  in  [10].  Simulations 
were  carried  out  for  the  three  cases  (a)  L  =  —256,  255, 

(b)  L  =  =  5,  and  {c)  L  =  H  =  300,  where  L  and  H  are 

the  lower  and  upper  bounds  on  the  random  numbers  (i.e., 
—  L<X<H).  The  peak  mean  square  error  and  overall 
mean  square  error  are  illustrated  in  Figs. 7-8  for  the  Ccise  L 
=  —256,  H  =  255.  An  internal  wordlength  of  22  bits  has 
been  found  to  be  adequate.  The  error  characteristics  of  the 
core  processor  are  summarized  in  Table  2. 

5.  IC  CORE  CHARACTERISTICS 

Our  DCT/IDCT  processor  has  been  designed  using  a  0.8- 
fim.  CMOS  cell  library.  The  library  includes  high-speed  pa- 
rameterizable  carry-select  adders,  registers  and  multiplex¬ 
ers,  among  other  cells.  The  22-bit  carry-select  adder  deter¬ 
mines  the  critical  path,  which  is  well  under  10  ns.  This  per¬ 
mits  the  realization  of  the  100  MHz  clock  and  sample  rate. 
The  layout  has  been  generated  using  the  Mentor  Graphics 
GDT  design  tools.  The  core  measures  under  10  mm^  with 
a  transistor  count  of  67,000  transistors.  The  core  charac¬ 
teristics  have  been  summarized  in  Table  3.  A  single  pin 
determines  the  forward  or  inverse  modes  of  operation.  A 
single  start  pulse  is  required  to  initialize  the  chip.  All  other 
control  signals  are  generated  internally  on-chip. 

6.  CONCLUSION 

The  10  mm^  core  area  of  our  2D  DCT/IDCT  processor 
compares  very  favorably  with  core  areas  reported  in  the 
recent  literature.  This  results  from  (a)  the  use  of  a  fast 
algorithm  that  permits  the  use  of  only  one  ID  DCT/IDCT 
unit  for  the  2D  transform,  and  (b)  an  architecture  that  re¬ 
sults  in  a  common  datapath  that  is  shared  by  both  the  DCT 
and  the  IDCT.  The  architecture  requires  seven  hardwired 
multipliers  and  eight  accumulators.  A  significant  savings 
in  area  ha.s  been  obtained  by  allocating  just  enough  preci¬ 
sion  to  the  hardwired  multipliers  that  the  implementation 


Figure  1:  Basic  DCT/IDCT  Unit 


Figure  2:  Multiplexed  DCT/IDCT  Unit 


is  compliant  with  the  existing  standards.  The  processor  has 
been  designed  with  a  parameterizable  input  wordlength  for 
applications  that  might  need  more  precision.  Clocking  at 
100  MHz,  the  processor  can  meet  the  real  time  processing 
requirements  of  currently  proposed  HDTV  systems. 
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Figure  3:  Floorplan  and  Architecture 
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Figure  4:  acf  matrix  vector  multiplier 


Figure  5:  bdeg  matrix  vector  multiplier 
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Figure  6:  Accumulator 
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Figure  7:  Peak  Mean  Square  Error 


Overall  Mean  Square  Error 

L  -  -256,  H  «  255 


Figure  8:  Overall  Mean  Square  Error 
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value 

12  bit 

SD  representation 

bits 

a 

0.35355 

0.35351 

2"^  +2-*  +2"'’  -1-2“' 

-1-2"®  =  0.35352 

5 

0.49039 

0.49023 

2-1  _  2"'  -  2"^'  +  2""'^  =  0.49036 

4 

c 

0.46194 

0.46191 

2_i  _  2-s  _  2-'  ^  2"‘®  =  0.46191 

4 

IT 

0.41573 

0.41552 

2-‘‘  4-  2"''  -1-2"'’  -1-2"' 

4-2~®  -  2"‘®  =  0.41577 

6 

e 

0.27779 

0.27734 

2"^  +  2-“  -  2"“  -1-  2"'"  =  0.27783 

4 

~T~ 

0.19134 

0.19091 

2-^  +  2-*  -1-  2"“  -  2"'’  =  0.19135 

4 

g 

0.09755 

0.09716 

2"^  +  2"=’  +  2"®  -  2"“  =  0.09753 

4 

Table  1:  Signed  Digit  Representation  of  the  DOT  Coeffi¬ 
cients 


Spec. 

L,H  = 
-256,255 

L,H 
=  300 

L,H 

5 

Peak  Pixel  Error 

<  1 

1 

1 

1 

Peak  Pixel  M.S.E. 

<  0.06 

0.0134 

0.0153 

0.0139 

Overall  M.S.E. 

<  0.02 

0.0104 

0.0101 

0.0028 

Peak  Pixel  Mean  Error 

<  0.015 

0.0133 

0.0125 

0.0139 

Overall  Mean  Error 

<  0.0015 

0.00096 

0.0011 

0.0011 

Table  2:  Accuracy  of  Processor 


Inputs 

9  bits  (DOT),  12  bits  (IDCT) 

Outputs 

12  bits  (DCT),  9  bits  (IDCT) 

Internal  Wordlength 

22  bits 

Technology 

O.S-fim  CMOS,  triple  metal 

No.  of  Transistors 

67,000 

Core  Size 

10  mm"^ 

Clock  Rate 

100  MHz 

Mode  Selection 

DCT  or  IDCT 

Block  Size 

8x8 

Accuracy 

CCITT  compliant 

Table  3:  Core  Characteristics 
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ABSTRACT 

Tropospheric  scintillation  can  seriously  affect  satellite  links 
operating  at  A^a-band.  The  average  bit  error  probabil¬ 
ity  for  a  multiuser  binary  direct-sequence  spread-spectrum 
satellite  communication  network  operating  under  amplitude 
scintillation  is  evaluated.  Numerical  results  are  presented 
to  illustrate  performance  comparisons  of  systems  operating 
under  typical  scintillation  statistics.  As  in  the  single  user 
case,  it  is  shown  that  the  average  system  performance  grad¬ 
ually  degrades  as  the  scintillation  intensity  increases  and  is 
worse  for  a  high  variability  of  these  events.  Results  also 
indicate  that  the  impact  of  amplitude  tropospheric  scin¬ 
tillations  becomes  less  important  as  the  number  of  asyn¬ 
chronous  active  users  increases. 

1.  INTRODUCTION 

New  commercial  applications  are  being  promoted  for  the 
use  of  spread-spectrum  (SS)  techniques  for  satellite  com¬ 
munication  systems  including:  (i)  Low-  to  medium-density 
traffic  for  interactive  business  transactions  (credit  card  ver¬ 
ifications,  financial,  and  reservations  services),  (ii)  Single¬ 
hop,  very  low  response  time  links  for  voice,  data,  and  fac¬ 
simile  direct  communications,  and  (iii)  Direct  broadcasting 
from  satellites  (DBS).  In  the  first  application,  asynchronous 
code-division  multiple- access  (CDMA)  allows  low-cost  very 
small  aperture  terminals  (VSATs)  to  communicate  with  a 
hub-station  through  a  star  network.  The  second  applica¬ 
tion  consists  of  a  CDMA  fully  meshed  network  of  small 
personal  communications  terminals  (PCTs)  conveying  in 
demand-assignment  multiple  access  (DAMA)  fashion.  The 
DBS  application  uses  synchronous  code  domain  multiplexed 
(CDM)  signals  to  transmit  different  channels  from  a  broad¬ 
casting  ground  station  (cable  company,  satellite  company, 
‘‘sky-music”,  etc.)  to  fixed  and/or  mobile  VSATs.  Al¬ 
though  there  are  two  basic  forms  of  SS,  frequency  hopping 
(FH)  and  direct  sequence  (DS),  the  latter  is  more  frequently 
used  for  satellite  communication  networks. 

Additionally,  the  next  generation  of  geostationary  com¬ 
munications  satellites  will  take  advantage  of  operation  at 
AVband  (20/30  GHz)  with  its  large  available  bandwidths, 
and  using  high  gain  narrow  spot  beam  antennas  that  will 
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be  able  to  support  smaller  user  terminals.  Unfortunately, 
atmospheric  effects  including  rain  attenuation  and  tropo¬ 
spheric  amplitude  scintillation  can  seriously  affect  the  qual¬ 
ity  of  transmission  at  these  high  microwave  frequencies. 
The  scintillation  degradation  effect  can  even  become  pre¬ 
dominant  for  low-margin  VS  AT  and  PCT  systems  operat¬ 
ing  at  dry  low-elevation  sites. 

Accurate  evaluation  of  the  performance  of  these  net¬ 
works  is  of  interest  in  planning  and  designing  efficient  fading 
compensation  methods  such  as  (i)  forward  error  correction 
(FEC),  and  automatic  repeat  request  (ARQ)  schemes,  and 
(ii)  adaptive  modulations  and  power  control  techniques.  As 
part  of  the  Advanced  Communications  Technology  Satellite 
(ACTS)  propagation  experiments  program,  the  Georgia  In¬ 
stitute  of  Technology  has  been  involved  in  the  A"a-band 
channel  characterization  and  in  the  estimation  of  propa¬ 
gation  impairments  on  the  performance  of  VSATs  [l].  Of 
key  interest  are  experimental  and  theoretical  studies  of  the 
impact  of  scintillation  on  SS  signals. 

This  paper  examines  the  performance  of  a  direct  se¬ 
quence  spread-spectrum  (DS/SS)  satellite  communication 
system  subject  to  tropospheric  amplitude  scintillation.  The 
following  section  gives  the  statistical  characterization  of  scin¬ 
tillation  then  describes  the  scintillating  channel  model.  Sec¬ 
tion  3  presents  the  DS/SS  system  rnodel  and,  the  derivation 
of  the  average  bit  error  probability  (BER).  The  performance 
results  calculated  for  specific  scintillation  statistics  are  re¬ 
ported  in  section  4.  The  paper  concludes  with  a  discussion 
of  the  results  and  directions  for  future  research. 

2.  SCINTILLATING  CHANNEL  MODEL 

Tropospheric  scintillation  results  from  random  temporal  vari¬ 
ations  and  spatial  inhomogeneities  in  the  refractive  index 
along  the  radio  propagation  path.  It  appears  as  an  insta¬ 
bility  in  both  the  amplitude  and  the  phase  of  the  signal.  In 
this  paper,  we  assume  that  the  scintiUation-induced  phase 
jitter  is  small  compared  to  the  jitter  generated  by  satellite 
and  ground  station  frequency  converters.  The  discussion  is 
therefore  limited  to  the  impact  of  amplitude  scintillation. 

During  a  typical  scintillation  event,  the  received  satel¬ 
lite  signal  amplitude  fluctuates  at  a  rate  of  approximately 
1/3  Hz  for  a  time  period  up  to  about  one  and  a  half  hour. 
These  events  tend  to  occur  in  the  early  afternoon  on  hot 
humid  summer  days,  and  their  effect  is  polarization  inde- 
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pendent  but  is  naore  pronounced  at  low-elevation  angles, 
higher  frequencies,  and  for  small- diameter  receiving  anten¬ 
nas  [2]. 

Based  on  the  Rytov  approximation  and  Tatarskii’s  the¬ 
oretical  formulation,  one  can  characterize  amplitude  scintil¬ 
lations  by  the  logarithmic  stochastic  process  x(^)  [3]*  Due 
to  variations  in  meteorological  conditions  on  the  propaga¬ 
tion  path,  x(^)  is  a  stationary  zero  mean  Gaussian  process 
only  during  short  periods  of  time  during  which  its  intensity 
Ox  remains  constant.  However  as  the  weather  conditions 
change,  the  refractive  index  structure  parameter  varies,  in¬ 
ducing  an  inherent  variability  in  This  causes  the  scin- 
tiUation  long-term  probability  density  function  (pdf)  to  di¬ 
verge  from  the  normal  distribution.  This  divergence  is  espe¬ 
cially  clear  in  the  tails  of  the  pdf.  Since  short-term  statistics 
are  insufficient  for  satellite  communications  system  perfor¬ 
mance  evaluations  (in  term  of  average  BER  and  link  avail¬ 
ability),  Mousley  and  Vilar  [3]  proposed  a  long  term  sta¬ 
tistical  model  for  x(^)  that  was  experimentally  validated  at 
the  X-band  and  at  the  higher  A^a-band  [3].  According  to 
this  model,  is  itself  considered  to  be  a  random  variable 
following  a  log-normal  distribution  with  ln(o’^)  mean  and 
(Tfj  standard  deviation.  The  parameters  a-m  and  <Ta  have 
to  be  characterized  for  a  particular  site  and  depend  mainly 
on  its  climatological  conditions,  the  operating  frequency, 
the  receiver  antenna  diameter  and  the  link  elevation  angle. 
This  model  prescribes  the  normal  pdf  for  short-term  log- 
amplitude  fluctuations  as  a  conditional  pdf,  conditioned  on 
the  random  variable  ax\  the  resulting  long-term  pdf  is  de¬ 
termined  by  averaging  over  the  range  of  This  pdf  is  the 
Mousley- Vilar  pdf  p;(;(x),  and  is  given  by  [3] 
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Taking  the  mean  variance  of  the  Mousley-Vilar  pdf  the  long¬ 
term  (or  average)  variance  of  x  is  related  to  (Tm  and  Ca 
by  <t|  =  (T^exp(<T^/2). 

The  scintillating  communication  channel  is  modeled  as 
an  additive  white  Gaussian  noise  (AWGN)  channel  affected 
by  tropospheric  amplitude  scintillation.  At  the  output  of 
the  channel,  the  scintfllating  received  signal  r{t)  can  there¬ 
fore  be  written  as 


r{t)  =  ro{t)  exp(x(i))  +  n{t)  (2) 

where  n(i)  is  the  channel  AWGN  stochastic  process  with 
zero  mean  and  two  sided  spectral  density  No/2^  and  ro(t)  is 
the  unperturbed  “clear  sky”  signal  in  which  the  information 
is  carried.  We  assume  in  our  analysis  that  the  tropospheric 
scintillating  channel  is  not  frequency  selective  since  its  co¬ 
herence  bandwidth  is  about  a  few  GHz.  The  channel  is  also 
considered  to  be  slowly  time  varying  since  the  scintillation 
spectrum  has  the  appearance  of  a  band-limited  white  noise 
with  a  roll-off  frequency  of  about  1/3  Hz.  The  stochastic 
process  x(^)  is  thus  assumed  to  have  the  following  charac¬ 
teristics:  (i)  statistically  modeled  by  the  Mousley-Vilar  dis¬ 
tribution,  (ii)  uncorrelated  with  the  AWGN  random  process 
n{t),  (iii)  constant  over  the  total  signal  bandwidth,  and  (iv) 
remains  constant  during  a  bit  time  T. 


3.  PERFORMANCE  ANALYSIS 


The  effect  of  amplitude  scintillation  on  the  performance  of 
the  satellite-earth  channel  had  been  initially  accommodated 
by  introducing  an  extra  fixed  degradation  in  the  nominal 
budget  link  [4].  A  more  accurate  method  of  BER  evalua¬ 
tion  in  the  presence  of  scintillation  has  been  suggested  by 
Banjo  and  Vilar  [5].  This  method  was  developed  for  single 
receivers  using  binary  PSK  modulation.  In  this  section,  we 
extend  this  method  to  a  multi-user  SS  satellite  communi¬ 
cation  system  operating  in  a  scintillating  environment. 

Assuming  a  binary  DS/SS  system  such  zis  described  by 
Pursley  [6],  K  users  simultaneously  share  a  channel,  each 
transmitting  with  power  P  at  a  common  carrier  frequency 
fo  =  2TruJo,  using  a  data  rate  R  =  1/T  and  a  chip  rate 
Rc  =  1/Tc.  Each  transmitter  k  {1  <  k  <  K)  is  assigned  a 
specific  code  sequence  (Uj^^)  of  chips  elements  (-fl,  —  I),  so 
that  its  code  waveform  is  given  by 

-foo 

“fc(*)=  XI  PTcit-jTc)  (3) 

j=:  —  OO 

The  function  Pt  denotes  the  unit  rectangular  pulse  of  dura¬ 
tion  T.  The  code  sequence  (a^^^)  is  assumed  to  be  periodic 
with  period  or  processing  gain  N  —  TjTc.  The  data  signal 
hk{i)  is  binary  PSK  modulated  onto  the  carrier  at  /<>  then 
spread  by  the  k^^  users’s  code  sequence.  The  resulting  k^^ 
users’s  transmitted  signal  Sk{t)  is  thus  given  by 

Skit)  =  akit)  bkit)  cos(wo<  +  Ok)  (4) 

where  the  phase  angle  Ok  is  a  random  variable  uniformly  dis- 
tributed  over  [0,27r]  to  take  into  account  that  the  K  users 
are  not  phase  synchronous.  Moreover,  for  asynchronous 
DS/CDMA  systems,  the  user  k  is  randomly  delayed  rela¬ 
tive  to  a  reference  signal  by  Tk  modeled  as  a  uniformly  dis¬ 
tributed  independent  random  variable  in  the  interval  [0,T]. 
The  composite  signal  stot(t)  can  then  be  expressed  as 


K 

stot{i)  =  y^^V^(ik{t-Tk)bk{t  -  Tk)co3{u;oit~rk)-\-0k)  (5) 
;c=o 

However,  in  case  of  synchronous  transmission,  the  time  de¬ 
lays  Tk  are  not  random  and  can  be  set  to  r*  =  0  for  all 
users. 

Our  analysis  assumes  that  the  channel  responds  in  the 
same  way  for  all  users.  This  is  the  case  for  a  VS  AT  and  PCT 
networks  subject  to  the  following  scintillation  scenarios: 

•  Only  the  down-link  is  affected  by  scintillations  with 
an  average  intensity 

•  Only  the  up-link  is  affected  by  scintillations  with  an 

average  intensity  yj and  for  a  network  supporting 
transmitters  from  the  same  geographical  region  (i.e., 
having  the  same  scintillation  statistics).  This  is  typ¬ 
ically  the  case  for  a  high  gain  Aa-band  narrow  spot 
beam. 

•  Both  the  up-link  and  down-link  are  affected  by  scin¬ 
tillations  with  an  estimated  overall  average  intensity 
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=  Y  +  ^Xd  ^  network  connecting  trans¬ 
mitters  in  region  A  to  a  receiver  in  region  B.  A  looped- 
back  experimental  link  (A  is  the  same  as  B),  such  as 
that  being  used  in  the  Georgia  Tech  ACTS  experi¬ 
ment,  can  be  considered  as  a  particular  case  of  this 
scenario. 

For  all  these  scenarios,  the  received  signal  r{t)  at  the  output 
of  the  channel  may  be  written  as 

r(t)  =  stot{i)  exp  (x(t))  +  n{t)  (6) 

Let  us  consider  the  user’s  receiver  and  assume  that  it 
is  correctly  synchronized,  such  as  =  7"^  =  0.  The  decision 
variable  Zi  at  the  output  of  this  matched  receiver  is 


r{t)  ai{t)  cos{uJo  t)  dt 


By  assuming  slow  scintillation,  and  by  writing  the  data  sig¬ 
nal  waveform  as  hi{t)  =  found 

to  be  equal  to 


n{i)ai{t)cos{u}ot)dt  (8) 


Zi  =  J —TexY>{x)[hi,o-\-I]-\- 


where  I  is  the  multiple  access  interference  term  (MAI)  de¬ 
fined  in  [6]  as  a  function  of  the  discrete  aperiodic  cross¬ 
correlation  function.  Under  the  standard  Gaussian  approx¬ 
imation  (large  number  of  users  and  high  processing  gain, 
i.e.,  A"  >  10  and  N  >  200)  the  MAI  is  modeled  by  a  zero- 
mean  normal  distribution.  Thus,  Zi  may  be  considered  to 
be  a  conditionally  Gaussian  random  variable  (conditioned 
on  x)  so  that 


E[Zi/x] 


k=l 


T  exp(x) 

(9) 

A  NoT 

2^  +  — 

(10) 

k=l 

k^i 

where  Ef  =  interference  param¬ 

eter  caused  by  the  other  K  —  1  codes  on  the  i  one  and 
Eb  =  PT  is  the  energy  per  bit.  The  i*'*  user’s  conditional 
bit  error  probability  (BERi)  for  a  binary  coherent  PSK 
may  then  be  written  under  the  Gaussian  approximation  as 

mmix)  =  Q  ( 5Fi7(x) )  (13) 

where  Q{.)  is  the  Gaussian  integral  function  defined  as 

Q(x)  =  /  exp(-j/V2)  dy  (14) 

V25r 

Integrating  (13)  over  the  Mousley-Vilar  pdf  of  the  scintilla¬ 
tion  (1)  yields  the  user  i’s  average  BER 

<^EWi>=  I  Q{SNRi{x) )  Px{x)dx  (15) 

We  may  note  that  when  there  is  no  scintillation  (x  =  0), 
then  <  'BERl  >=  Q{  SNRi{0)  ),  and  the  probability  of 
error  reduces  to  the  the  expression  derived  by  Pursley  in  a 
scintillation  free  channel  [6].  In  the  same  way,  neglecting 
the  multi- user’s  interference  term  in  (15)  yields  the  same 
expression  for  the  average  BER  as  the  one  computed  in  [5] 
for  a  single-user  coherent  binary  PSK  satellite  digital  link 
subject  to  scintillations  and  given  by 

<  BER  >=  I  Q  (j2{Eb/No)exp{x))  Pxix)  dx  (16) 

This  expression  (16)  holds  for  the  transmission  of  K  syn¬ 
chronous  CDM  signals  spread  by  perfectly  orthogonal  codes 
(since  the  total  interference  parameter  is  equal  to  zero  in 
this  case). 

The  selected  criterion  for  overall  system  performance  is 
the  <  BERav  >,  defined  as  the  average  bit  error  proba¬ 
bility  for  an  average  user  for  which  the  total  interference 
parameter  is  set  to  be  equal  to  Rav  = 
addition,  the  user’s  average  energy  per  bit  to  noise  ratio 
(without  multiple-access  interference)  <  Eb/No  >  is  given 

by 


<  BER  >-- 


where  rk,i  is  the  average  interference  parameter  between  the 
and  users  and  is  expressed  by  Pursley  [6]  in  terms 
of  the  aperiodic  autocorrelation  functions.  In  the  above 
expressions,  the  conditional  expectations  have  been  taken 
with  respect  to  the  independent  random  variables  $kj  Tki 
-1,  and  6/^,0,  assuming  that  there  is  an  equal  probability 
that  the  data  symbols  hk,i  are  equal  to  -|-1  or  -1  for  k  ^  i. 
Let  us  now  define  the  user’s  average  conditional  signal- 
to-noise  ratio  SNRi{x)  as 

^(x)= 


^Var[Zi/x] 

After  the  substitution  of  expressions  (9)  and  (10)  in  (11) 
and  a  simple  rearrangement,  we  obtain  the  conditional  SNRi{x) 
conditioned  on  x  2ls 


<  Eb/No  >=  (Eb/No 


sp(2x)  Pxix)  dx 


SNR,(x)  = 


exp(-2x) 
2  (Eb/No) 


4.  NUMERICAL  RESULTS 

A  set  of  50  Gold  codes  of  length  N  =  511  bits  were  gener¬ 
ated  and  assigned  to  50  users.  Initially,  we  study  the  effect 
of  scintillation  on  a  system  with  a  fixed  number  (A  =20) 
of  active  users.  The  solid  curve  (a)  in  Figure  1  represents 
the  scintillation-free  channel  reference  case.  In  Figure  1,  we 
have  also  plotted  performance  curves  (b,  c,  d)  for  increasing 
scintillation  intensities  at  a  fixed  index  of  variation,  <7^=  0. 
As  can  be  expected,  performance  degrades  as  (T^  increases. 
The  curves  (a,  b,  c,  d)  in  Figure  2  illustrate  the  impact 
of  the  variability  of  the  scintillation  by  keeping  its  long¬ 
term  standard  deviation  constant  at  0.8  dB  and  increasing 
(To.  These  curves  extend  the  conclusion  found  in  [5],  since 
they  also  show  the  worst  degradation  (at  high  SNR)  for  the 
highest  scintillation  variability  statistics. 


Figure  1:  Average  BER  for  increasing  scintillation  intensi¬ 
ties  ((a)  <Tm=  0  dB,  (b)  (Tm=  0-4  dB,  (c)  (Tm=  0.8  dB,  and 
(d)  am=  1*2  dB). 


Figure  2:  Average  BER  for  increasing  scintillation  variabil¬ 
ity  indicies  ((a)  0-^=  0,  (b)  (Tcr~  0.5,  (c)  aa=  0.75,  and  (d) 

<Ta=  1). 


Effect  of  the  Number  of  Active  Users 


Figure  3:  Average  BER  for  different  numbers  of  active  users 
((a)  K=  1,  (b)  K=  15,  (c)  30,  and  (d)  50). 


Secondly,  we  study  the  impact  of  scintillation  on  a  DS/SS 
channel  with  a  variable  number  of  active  users.  The  aver¬ 
age  total  interference  parameter  R^v  was  thus  evaluated 
for  different  values  of  K.  Figure  3  shows  the  performance 
achieved  for  an  average  user  operating  when,  respectively, 

1  (curve  a),  15  (b),  30  (c),  and  50  users  (d),  are  simulta¬ 
neously  accessing  a  free  fading  channel  (solid  curves)  or  a 
scintillating  channel  (dashed  curves).  The  parameter  set 
used  to  obtain  the  curves  of  Figure  3  is  fTm =0.752  dB  and 
<T(^=0.b.  It  can  be  seen  that  as  the  number  of  active  users 
increases,  the  scintillation  induced  <  Eh! No  >  degradation 
(for  a  fixed  <  BERav  >)  decreases.  As  an  example,  the 
<  EhjNo  >  degradation  at  <  BERav  >=  10“®  is  less  then 
0.1  dB  for  50  active  users,  whereas  it  is  0.5  dB  for  30  active 
users,  0.7  dB  for  15  active  users,  and  0.85  dB  for  the  single 
user  case. 

5.  CONCLUSION 

An  analysis  of  the  average  BER  performance  of  a  binary 
DS/SS  satellite  communication  system  operating  in  a  scin¬ 
tillating  channel  has  been  presented.  Numerical  results 
show  that  the  performance  is  expected  to  degrade  as  the 
scintillation  intensity  increases  and  to  be  worse  for  sites 
with  the  highest  scintillation  variability  statistics.  It  may 
also  be  concluded  that  the  degradation  is  predominated  by 
co-channel  interference,  rather  than  by  signal  fluctuations, 
when  a  high  number  of  asynchronous  users  are  simultane¬ 
ously  accessing  a  satellite.  In  the  future,  we  plan  to  verify 
these  theoretical  predictions  by  experimental  measurements 
conducted  as  part  of  the  Georgia  Tech  ACTS  propagation 
effects  experiment. 

6.  REFERENCES 

[1]  D.  H.  Howard  and  P.  G.  Steffes,  ‘‘Georgia  Tech  ACTS 
class  II  experiment,”  in  Proceedings  of  the  VI  ACTS 
Propagation  Studies  Workshop  (APSW  VI),  pp.  159- 
177,  November  1994. 

[2]  D.  C.  Cox,  H.  W.  Arnold,  and  H.  H.  Hoffman,  “Observa¬ 
tions  of  cloud-produced  amplitude  scintillations  on  19- 
and  28-GHz  earth-space  paths,”  Radio  Science,  vol.  16, 
no.  5,  pp.  885-907,  1981. 

[3]  T.  J.  Mousley  and  E.  Vilar,  “Experimental  and  theo¬ 
retical  statistics  of  microwave  amplitude  scintillations 
on  satellite  down-links,”  IEEE  Transactions  on  Anten¬ 
nas  and  Propagation,  vol.  AP-30,  no.  6,  pp.  1099-1106, 
1982. 

[4]  R.  K.  Crane  and  D.  W.  Blood,  “Handbook  for  the  esti¬ 
mation  of  microwave  propagation  effects  -  Links  calcula¬ 
tions  for  earth-space  paths  (path  loss  and  noise  estima¬ 
tion),”  Tech.  Rep.  P-7376-TR1,  NASA  Goddard  Space 
Flight  Center,  1979. 

[5]  O.  P.  Banjo  and  E.  Vilar,  “Binary  error  probabilities  on 
earth-space  links  subject  to  scintillation  fading,”  Elec¬ 
tronics  Letters,  vol.  21,  no.  7,  pp.  296-297,  1985. 

[6]  M.  B.  Pursley,  “Performance  evaluation  for  phase- 
coded  spread-spectrum  multiple- access  communication- 
Parts  I  &:  II,”  IEEE  Transactions  on  Communications, 
vol.  COM-25,  no,  8,  pp.  795-803,  1977. 


70 


Fade  Prediction  and  Control  Systems 

Michele  Luglio 

Universita  di  Roma  “Tor  Vergata”  —  Dipartimento  di  Ingegneria  Elettronica 
Via  della  Ricerca  Scientifica  00133  -  Roma  -  Italy 
lugl  io@  tov  vx  1  xcd.utovrm  .it 


ABSTRACT 

The  growing  utilisation  of  high  frequencies  (Ka  band) 
has  made  very  important  the  use  of  one  or  more  fade 
countermeasures  (UPPC,  Site  Diversity,  Frequency 
Diversity,  Adaptive  Coding,  etc.)  to  overcome  rain 
fading  avoiding  to  overdimension  the  systems. 

Fade  countermeasures  could  provide  better 
performances  if  supported  by  a  control  system  able  to 
obtain  the  prediction  of  dynamic  characteristics  of  rain 
fading  in  the  instant  in  which  the  attenuation  crosses  a 
certain  threshold.  Such  a  system  would  allow  to  avoid 
overcompensation  or  false  alarms  as  well  as  to 
optimise  the  available  resources  in  case  of  multilevel 
compensation,  with  an  efficiency  as  great  as  the 
prediction  would  be  accurate. 

In  this  paper,  after  the  description  of  the  main 
techniques  for  real  time  measurement  of  the 
attenuation,  a  hypothesis  of  a  new  control  system  with 
the  relative  algorithm  is  presented.  This  system  bases 
the  prediction  of  the  dynamic  characteristics  of  fading 
on  the  observation  of  some  initial  parameters. 

L  INTRODUCTION 

Usually  to  determine  attenuation  level  on  a  given  link 
pre-elaborated  models  are  used.  The  most  used  models 
are  those  of  the  CCIR  [1]  and  those  of  the  NASA  [2]. 
These  models  are  easy  to  implement  and  accurate  in  an 
average,  global  sense.  On  the  other  hand  they  suffer 
from  limitations  when  applied  to  certain  regions  (e.g. 
tropics),  or  to  the  prediction  of  phenomena  for  which 
no  systematic  data  exist  (ice-crystal  depolarisation),  and 
to  model  boundary  regions  (low  elevation  angles)  [3]. 
Most  of  the  attenuation  prediction  models  convert 
rainfall  rate  statistics  into  fade  probability  using  the 
analytically  derived  specific  attenuation  per  unit 
distance  (dB/km)  together  with  an  effective  path  length 
and  are  developed  using  meteorological  knowledge  and 
limited  sets  of  data. 

In  the  case  that  one  of  the  fade  compensation  methods 
is  used,  it’s  not  very  important  the  prediction  of  the 
probability  with  which  fading  overcomes  a  given 
threshold  during  the  year,  but  the  prediction  of  the 
value  that  some  parameters  can  assume  within  a  short 
interval.  Thus  it’s  important  to  get  prediction  models 
of  dynamic  characteristics  of  rain  attenuation. 

It  would  be  necessary  to  have  a  control  system  that, 
according  with  a  real  time  evaluation  of  channel  quality 
and  with  a  prediction  of  the  rain  parameters,  taking 
eventually  in  account  its  past  evolution,  could 
determine  the  timing  of  the  whole  process. 

The  problems  are  two:  instantaneous  knowledge  of  fade 


level,  at  the  frequency  of  the  link,  and  prediction  of 
future  characteristics  of  the  attenuation. 

The  knowledge  of  channel  conditions  can  be  obtained 
observing  primary  parameters  (attenuation,  statistical 
distribution  of  rain)  or  secondary  parameters  (signal  to 
noise  ratio,  BER,  received  signal  characteristics)  [4]. 

It  is  possible  to  know  attenuation  level  by 
continuously  measuring  and  scaling  a  known  signal, 
transmitted  by  the  satellite,  with  an  additional  receiver. 
If  this  signal  is  not  available  it’s  possible  to  use  a 
radiometer  or  to  note  any  reduction  in  link  SNR. 
Monitoring  only  one  carrier  it  is  not  possible  to 
separate  the  contributions  on  the  two  links  (in  case  of 
transparent  payload).  If  the  loopback  signal  is  available 
an  estimate  of  the  attenuation  at  each  frequency  can  be 
made.  If  many  carriers  from  different  stations  are 
monitored,  a  fall  in  level  of  all  carriers  implies 
presence  of  fading  on  down  link  otherwise  the  fading 
affects  a  certain  up  link.  Interference  and  variations  due 
to  equipment  and  to  satellite  instability  can  cause  low 
precision. 

To  predict  future  characteristics  of  attenuation  it  is 
necessary  to  verify  the  correlation  between  one  or  more 
parameters  linked  to  initial  characteristics  of  rain  fading 
and  the  dynamic  characteristics  of  the  phenomenon 
itself.  Once  this  would  be  verified,  the  problem  is  to 
elaborate  an  algorithm  able  to  provide  a  real  time 
prediction. 

In  section  V,  taking  hint  from  a  system  proposed  at 
Coventry  University  (U.K.),  a  control  system  is 
introduced.  This  system  would  offer  the  possibility  to 
obtain  dynamic  control  and  management  of  the 
resources  utilising  an  algorithm  to  predict  some 
parameters  by  real  time  observation  and  of  the 
characteristics  of  previous  events. 

II.  RAIN  FADE  OBSERVATION 

To  obtain  and  develop  reliable  statistical  models  for 
rain  fade  prediction  very  long  observation  time  is 
necessary  [7]. 

The  most  commonly  observed  parameters  are  the 
duration  (time  interval  during  which  the  attenuation  is 
greater  than  a  given  threshold),  the  depth  (peak  level 
within  the  duration),  the  rain  rate  (absolute  value  of  the 
derivative  or  as  the  intensity,  mm/h),  the  interfade  time 
(time  interval  during  which  the  attenuation  is  lower 
than  the  threshold). 

Slope 

The  attenuation  is  a  continuous  function  of  time  and 
its  slope,  when  a  threshold  is  crossed,  can  be  defined  as 
the  derivative  (in  the  crossing  point).  Since  we  can 
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have  only  a  sampled  sequence,  an  optimum  criterion  to 
estimate  the  slope  has  to  be  found  [8]. 

So  far  no  correlation  between  slope  and  dynamic 
characteristics  of  fading  has  been  demonstrated. 

The  research  of  the  parameter  linked  to  the  initial 
characteristics  of  fading  that  more  than  others  would  be 
correlated  with  its  dynamic  characteristics  is  an  open 
problem,  in  the  sense  that  no  accurate  study  has  been 
conduct^  on  the  correlation  between  the  initial  instant 
and  the  time  evolution  of  the  phenomenon. 

Empirical  and  theoretical  distributions 
A  hist  step  in  analysing  collected  data  is  to  verify  if 
the  empirical  distributions  (of  rain  fade  or  of  one  of  the 
parameters)  match  one  of  the  statistical  distributions 
already  known  and  characterised  by  means  of  one 
algorithm  such  as  x^,  Kolmogorov  Smirnov,  etc. 

The  results  referred  to  4  years  of  measurements  in 
Texas  showed  that  the  rain  rate  distribution  well  fits 
the  normal  distribution,  while  the  year  to  year 
variability  of  the  cumulative  distribution,  at  a  given 
percentage  of  time,  fits  a  log-normal  distribution.  The 
distribution  of  the  attenuation  level  in  dB  follows 
different  statistics  for  different  percentage  of  time  [3]. 
Time  variations  of  rain  attenuation  show  that  the  rain 
rate  grows  as  a  function  of  attenuation,  providing  a 
further  proof  of  the  log-normality  of  the  process  [6]. 
Both  fade  and  interfade  duration  follow  a  log-normal 
statistic  both  in  K  and  Ka  band  [6]  and  in  X  band  [3]. 
These  results  are  confirmed  in  X  band,  for  durations 
greater  than  32  s  [9] [10],  while  for  lower  values  they 
correspond  to  a  power  law  model,  not  depending  in 
both  cases  on  the  site  or  on  the  threshold.  In  [11]  a 
software  to  analyse  measured  data  at  30  GHz  with 
Olympus  is  presented.  The  results  referred  to  data  at  30 
GHz  during  4  months  (Sept.  ‘91  -  Jan.  ‘92)  with 
Olympus  [12]  agree  with  what  is  shown  in  [10]. 

III.  PREDICTION  OF  THE  DYNAMIC 
CHARACTERISTICS 

The  prediction  of  the  evolution  of  rain  fade  can  be 
obtained  using  one  of  the  linear  prediction  techniques 
using  auto-regressive  series  or  utilising  polynomial 
methods,  with  the  observed  samples.  In  both  cases  the 
precision  of  the  prediction  depends  on  the  number  of 
samples,  on  sampling  frequency  and  on  the  margin 
introduced  to  take  in  account  scintillation,  variable 
during  the  24  hours  (greater  during  the  afternoon  [3]). 
The  prediction  method  presented  in  [5],  based  on  the 
maximum  entropy  principle,  is  a  type  of  linear 
prediction  that  implies  the  approximation  of  the  signal 
level  at  any  instant  with  of  a  linear  combination  of  the 
previous  n  samples.  The  accuracy  of  this  algorithm 
depends  on  the  predict-ahead  time  intervals.  For  a  20 
second  prediction  interval,  the  maximum  error  is 
within  a  range  of  about  ±1.5  dB. 

In  the  framework  of  Sirio  experiments,  utilising  the 
data  collected  at  Spino  d’Adda  (within  four  years  at 


11.6  GHz  with  a  sampling  frequency  of  2  s),  it  has 
been  shown  that  two  successive  fade  durations  within  a 
rain  event  or  between  different  rain  events  are 
statistically  independent  [13]. 

IV  .  CONTROL  SYSTEMS 
Control  system  simulation 

Utilising  the  measured  data  collected  with  Sirio  in 
three  sites  (Spino  d’Adda,  Lario  and  Fucino)  at  11.6 
GHz  a  control  system  for  fading  compensation  has 
been  simulated  at  Politecnico  of  Milano  making  a  real 
time  prediction  elaborating  the  previous  samples  with 
an  algorithm  based  on  linear  regression  [14].  The 
attenuation  level  is  predicted  with  an  anticipation  equal 
to  2  s  (expected  response  time  of  the  system)  and  thus 
compared  with  a  threshold  (the  minimum  acceptable 
quality);  if  the  predicted  value  is  above  the  threshold 
the  system  starts  with  the  procedure. 

The  performances  of  this  algorithm  depend  on  the 
sampling  frequency,  on  the  number  of  previous 
samples  and  on  the  value  of  the  margin  introduced  to 
counteract  scintillation.  They  are  better  (the  percentage 
of  outages  is  lower)  if  the  number  of  samples  is  lower, 
because  fast  variations  of  attenuation  are  easier  to 
predict  with  a  small  number  of  samples  (even  2).  On 
the  other  hand,  with  a  too  small  number  of  samples  or 
with  a  too  high  threshold  for  the  scintillation  a  great 
number  of  false  alarms  occurs. 

The  control  system  efficiency  (ratio  between  the 
effective  utilisation  time  of  the  compensation  and  the 
utilisation  time  in  an  ideal  system  with  no  switching 
delay  and  no  response  time),  is  between  0.8  and  1.3. 
For  values  close  to  2  the  number  of  false  alarms  tends 
to  zero. 

FCMIS  (Fade  CounterMeasure  Intelligent  System) 

An  “intelligent”  detection  and  control  system  is 
proposed  in  [7]  (figure  2).  The  dynamic  characteristics 
of  fading  and  some  prediction  techniques  can  provide 
different  informations  for  the  management  of  the 
compensation  procedure.  These  different  informations 
concern  some  parameters  that  can  have  different 
characteristics  and  statistics,  not  excluding  correlation 
among  them. 

A  first  parameter  is  the  duration:  the  duration 
distribution  can  also  be  utilised  in  the  system  design 
phase  to  dimension  the  backup  resource.  A  second 
parameter  is  the  interfade  duration:  the  distribution  of 
interfade  can  be  utilised  to  estimate  the  backup 
resources  availability.  A  third  parameter  is  the  fade 
rate:  the  knowledge  of  this  parameter  can  be  useful  for 
a  dynamic  management  of  the  power  margin.  A  fourth 
parameter  is  the  fade  depth:  the  distribution  of  fade 
depth  allows  the  prediction  of  the  occurrence  of  an 
attenuation  amplitude  peak;  this  information  could  be 
very  important,  both  in  design  and  in  compensation 
management  phase  to  estimate  the  compensation  level 
in  case  of  multilevei  technique. 
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Figure  2  -  FCMIS  [7] 


V.  PROPOSAL  OF  A  CONTROL  SYSTEM 

The  intelligent  control  system  (FCMIS)  proposed  in 
[7]  suggests  a  hypothesis  on  how  various  informations 
regarding  the  estimate  of  several  parameters,  together 
with  r^l  time  observation,  can  co-operate  in  deciding 
the  activation  of  fade  compensation  techniques. 

Starting  from  the  FCMIS  idea  it’s  necessary,  first  of 
all,  to  define  in  greater  details  the  functionality  of  the 
different  blocks  of  figure  2  and  how  some  of  the 
informations  can  be  utilised  for  dynamic  control  and 
management  of  the  resources. 

Control  System  with  Dynamic  Prediction 
The  FCMIS  idea  can  be  expanded  and  detailed  in  the 
Control  System  with  Dynamic  Prediction  shown  in 
figure  3,  consisting  of  5  stages. 

The  first  stage  consists  in  real  time  observation  in 
which  the  parameters  that  characterise  the  rain  event 
such  as  duration,  depth,  amplitude  and  interfade 
duration  are  detected  and  measured.  In  the  prediction  and 


evaluation  of  statistical  models  stage,  these  parameters 
are  stored,  elaborated  and  compared  with  the  statistics 
obtained  with  previously  collected  data.  The  statistical 
models  are  thus  updated.  Data  are  elaborated 
considering  real  time  observation,  previous  events  and 
attenuation  curve  characteristics. 

The  results  of  this  stage  are  then  utilised  in  the 
decision  stage.  Here  the  probability  that  the  parameters 
could  assume  values  greater  than  a  threshold  is 
compared  with  a  prefixed  value  of  probability.  The 
results  of  this  comparison  are  then  utilised  for  the 
decision,  with  a  logical  AND  type  combination  among 
the  different  inputs. 

Thus  the  start  of  the  compensation  procedure  depends 
on  resource  availability,  verified  in  the  activation  stage 
and  can  be  dynamically  controlled  utilising  the 
predicted  values  of  fade  and  interfade  duration. 

In  the  management  stage  the  procedure  is  carried  out. 
The  amplitude  and  depth  estimates  can  be  utilised  to 
control  the  amplitude  of  the  utilised  resources  in  case 
of  multilevel  compensation  (quantized  UPPC, 
multilevel  gain  coding,  etc.),  while  fade  and  interfade 
duration  estimates  can  be  utilised  to  determine  resource 
availability. 

Dynamic  Prediction  Parameter 
Statistical  independence  between  two  fade  or  interfade 
durations,  as  verified  in  [13],  doesn’t  imply  absence  of 
correlation  with  the  characteristics  of  past  events.  It 
might  be  possible  to  elaborate  an  algorithm  to 
estimate  these  parameters  on  the  basis  of  past  events. 


An  auto-regressive  model  able  to  fit  rain  evolution 
needs  a  great  number  of  coefficients  and  very  large 
matrices,  as  well  as  the  polynomial  interpolation  of 
the  sequence  needs  a  polynomial  with  high  degree  and 

as  many  coefficients.  r  j  .u 

The  prediction  of  fade  or  interfade  duration  or  ot  depm 
is  more  accurate  if  a  great  number  of  samples  is 
considered,  taking  in  account  long  period  statistics 
with  a  high  enough  sampling  frequency.  On  the  other 
hand,  a  too  long  sequence  would  cause  such  a 
complexity  to  require  a  not  negligible  computation 

power  and  elaboration  delay . 

Thus  to  evaluate  the  next  value  it  is  possible  to 
consider  only  the  last  two  or  three  samples  (according 
with  [14]),  while  an  efficient  method  to  utilise  data 
collected  in  many  years,  avoiding  high  and 
unacceptable  complexity,  could  be  the  following. 
Assuming  the  slope  of  the  attenuation  curve  as  a 
meaningful  parameter  for  the  prediction  we  could 
suppose  that  different  parameters  would  correspond  to 
different  slopes. 

The  average  and  the  variance  of  the  collected  data  are 
dynamically  calculated,  updated  and  stored  according  to 
the  slope  at  the  beginning  of  the  rain  event. 

In  this  way  two  tridimensional  matrices  are  filled,  with 
the  columns  corresponding  to  the  parameters,  the  rows 
corresponding  to  different  values  of  the  slope  and  the 

third  dimension  to  season  or  month  (figure  4). 

This  sort  of  information  can  be  meaningful  considering 
that  the  statistical  independence  between  two  events 
verified  in  [13]  and  the  randomness  of  the  ram  don  t 
imply  necessarily  the  staUstical  independence  between 
two  events  that,  occurring  after  a  relatively  long  time, 
present  common  characteristics  (season,  slope,  etc.). 
With  this  procedure  it  is  possible  to  obtain  the  data 
site  by  site  because  each  station  can  detect  and 
elaborate  the  statistics  by  itself. 

The  algorithm  shown  in  figure  5,  that  represents  the 
prediction  and  evaluation  of  statistical  rnodels  stage  ot 
figure  3,  implements  the  procedure  described  above. 

In  this  diagram  Xj  is  the  i-th  sample  (last  sample),  S 
represents  the  threshold,  or  the  fixed  margin  for  the 
link,  d,  IN,  Xmax^  are  used  to  measure  duration, 
interfade  and  peak  values,  while  D,  INT  and  P  are  the 
parameters  and  D,  INI,E,  are  the  averages;  are  the 
variances;  A  is  the  number  of  estimated  samples. 

After  the  detection,  the  i-th  sample  is  compared  with 
the  threshold  and  the  different  counters  are  increased 
according  to  the  result  of  the  comparison.  The  value  of 
the  parameter  for  the  current  event,  once  calculat^,  is 
utilised  to  update  the  average  and  the  variance  of  the 
distribution  of  the  parameter  itself. 

The  evaluation  of  the  slope  is  made  active  only  when 
the  threshold  is  crossed.  The  estimated  value  is  thus 
compared  with  the  Up  values  of  slope  to  put,  m  the 
respective  position  of  the  matrices,  the  different  values 
thus  obtained.  Dynamically  updated  averages  and 
variances  are  obtained.  The  estimate  of  the  A  next 
values  is  made  active  only  during  the  fade  duration  and 


is  performed  using  the  last  K  samples  with  an  auto¬ 
regressive  algorithm. 


Figure  4  -  Parameters  matrix 


Limits  of  the  model  .  .  , 

In  this  moment  the  assumption  that  the  slope  is 
correlated  with  the  dynamic  evolution  of  rain  fading  is 
not  proved  by  any  kind  of  empirical  test,  but  this 
system  could  represent  a  significant  step  to 
demonstrate  or  not  this  statement.  From  this 
implementation  we  can  expect  one  of  the  following 

results;  .  ,  ,  , 

a)  data  classified  as  a  function  of  the  slope  are 

correlated  with  it;  •  u  u 

b)  it’s  not  possible  to  correlate  the  slope  with  the 

future  evolution  of  rain  fade; 

c)  the  slope  shows  correlation  only  with  some  ot  the 

parameters  (for  example  peak,  amplitude); 

d)  the  correlation  with  the  future  evolution  exists  only 
for  some  values  of  the  slope  (the  relationship  is  not 

linear).  ,  „  .  j  , 

The  system  has  been  tested  with  the  data  collected  zX 
Politecnico  of  Milano  with  a  29  GHz  station  during  7 
months  of  observation,  but  the  very  low  number  ot 
events  has  not  allowed  to  elaborate  any  result. 


Prediction  accuracy  _  . 

The  main  factors  of  inaccuracy  are:  inaccuracy  in  the 
measurement,  caused  by  equipment  or  by  frequency 
scaling,  correlation  between  the  initial  charactenstics 
of  rain  fading  (the  slope  in  our  case)  and  soitie 
parameters,  the  estimate  procedure  of  the  synthetic 
parameter  of  the  initial  characteristics  (different 
methods  to  evaluate  the  slope  of  a  sequence  provide 
different  results),  the  classification  of  the  events 
according  to  the  slope,  the  matching  procedure 
performed  between  empirical  probability  distributions 
and  theoretical  ones. 


CONCLUSIONS 


The  dynamic  management  and  the  optimisation  of  the 
resource,  in  case  any  rain  fade  compensation  technique 
is  used,  are  quite  complex  problems.  Besides,  it  is  not 


so  far  thoroughly  studied.  „  ,  , 

A  system  based  on  the  prediction  of  the  dynamic 
■  •  ''  •  hoc  Kppn  nronosed  and 


described. 


74 


Figure  5  -  Dynamic  prediction  algorithm 


Before  going  forward  with  one  of  the  dynamic 
prediction  methods  it  is  necessary  to  establish  which 
parameter,  referred  to  the  initial  conditions  or  to  the 
past  events,  is  more  correlated  with  the  future 
evolution  of  rain  fade.  The  slope  of  the  attenuation 
curve  has  been  proposed,  but  this  doesn’t  exclude  that 
another  parameter  can  better  cooperate  to  the 
prediction.  This  aspect  represents  in  this  moment  an 
open  problem  that  shall  be  object  of  further  studies. 
The  idea  of  a  control  system  with  dynamic  prediction 
could  anyway  be  valid  even  if  instead  of  the  slope 
another  parameter  would  be  considered. 
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Abstract 

This  paper  describes  the  rain  attenuation 
characteristics  of  radio  links  and  the 
possibility  of  outage  free  links.  Since  1988, 
NTT  has  been  conducting  continuous  rain 
attenuation  measurements  on  experimental 
radio  links.  The  experiments  have  been 
carried  out  in  Tokyo.  The  measured 
frequencies  are  15,  18,  20  and  26  GHz  with 
transmission  distances  of  0.75,  2.8,  6  and  9 
km.  Rain  attenuation  characteristics  are 
measured  with  0. 1  sec  time  accuracy.  Field 
trial  results  show  that  the  measured  rain 
attenuation  distribution  has  a  maximum 
value,  and  radio  links  with  fading  margins 
that  exceed  the  maximum  rain  attenuation 
become  outage-free. 

1.  Introduction 

Since  1984,  NTT  has  provided  high-speed 
digital  leased  line  services  using  26  GHz 
microwave  links.  This  system  possess  a 
broad  bandwidth  and  provides  a  good 
solution  to  the  "last  mile  problem"[l].  On 
the  other  hand,  microwaves'especially  those 
above  10  GHz,  are  susceptible  to  rainfall 
fading,  which  degrades  transmission 
performance  during  rain. 

Since  1988,  NTT  has  been  conducting 
continuous  rain  attenuation  measurements  on 
experimental  links  to  study  outage 
characteristics.  In  order  to  reduce  estimation 
error,  rain  attenuation  characteristics  are 
being  measured  with  0. 1  sec  time  accuracy. 
Field  trial  results  show  that 

(1)  the  rain  attenuation  distribution  can  not 
be  described  by  the  Gamma  distribution 
at  the  upper  end. 


(2)  all  measured  rain  attenuation  distributions 
have  maximum  values, 

(3)  the  maximum  values  vary  annually  and 
depend  on  frequency  and  transmission 
distance, 

(4)  radio  links  having  fading  margins  that 
exceed  the  maximum  rain  attenuation  are 
outage-free. 

2.  Rain  attenuation  characteristics 
2.1  Prediction  of  attenuation  due  to 
rain 

Estimation  of  rain  attenuation  is  essential  for 
the  link  design  of  radio  systems.  Rain 
attenuation  Z  between  two  points  (point  A 
and  B)  is  expressed  as 

Z  =  f^RixYdx  (1) 

where  R(x)  is  the  rain  rate  at  point  x  between 
A  and  B,  n  is  a  constant  that  depends  on 
frequency  [2]. 

From  eq.(l),  rain  attenuation  is 
calculated  from  the  rain  rate  and  the  rain 
attenuation  distribution  is  estimated  from  the 
rain  rate  distribution. 

Several  ways  have  been  proposed  to 
estimate  the  rain  rate  and  rain  attenuation 
distributions.  The  Gamma  distribution  is 
widely  used.  Using  the  Gamma 
distribution,  the  rain  rate  distribution  is 
expressed  as  follows[2]; 

=  (2) 

where  v  and  are  Gamma  distribution 
parameters. 

For  rain  attenuation,  the  following 
equation  is  derived  from  eq.  (2). 
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(3) 


Nakano 


r(v,) 

where  v  «  and  p  e  are  Gamma  distribution 
parameters. 

Mathematically  speaking,  eqs.  (2)  and 
(3)  are  easy  to  calculate  and  suitable  for 
computation.  However,  probabilities /(/?) 
and/(Z)  are  never  zero  even  if  variables  R 
and  Z  approach  infinity.  This  means  that 
eq.(3)  requires  the  system  to  offer  infinite 
fading  margin  against  rainfall  to  make  the 
link  outage  free.  Is  this  realistic  ? 

Let  us  consider  rain  and  rain 
attenuation  physically.  Rain  rate  is  defined 
as  the  amount  of  rain  observed  over  an 
integration  period.  As  the  integration  period, 
one  minute  is  commonly  used  in  Japan.  It  is 
obvious  that  the  rainfall  measured  during  this 
limited  period  cannot  be  infinite.  The 
reasons  are  as  follows; 

(1)  Rain  drops  are  generated  from  water 
vapor  in  the  air.  The  amount  of  water 
vapor  is  limited. 

(2)  Because  the  speed  at  which  the  rain 
drops  fall  is  limited  by  air  resistance,  the 
number  of  rain  drops  observed  during  a 
fixed  period  is  also  limited. 

If  the  rain  rate  distribution  is  limited, 
the  rain  attenuation  distribution  is  also 
limited.  This  is  easily  explained  from 
eq.(l). 

From  the  above  discussions,  it  is 
forecast  that  eqs.(2)  and  (3)  cause  excessive 
estimation  error.  Furthermore,  the 
estimation  of  rain  attenuation  is  based  on  the 
estimated  one-minute  rain  rate,  if  the 
probability  is  less  than  one-minute,  the 
estimation  error  will  be  increased. 

2.2  Field  trial  system  configuration 
To  achieve  highly  reliable  radio  systems,  a 
field  trial  was  begun  in  Tokyo  in  August 
1988.  Routes  of  the  trial  system  are  shown 
in  Figure  1. 


Karagasaki 


Fig.  1  Field  (rial  routes 

Four  routes  with  transmission 
distances  of  0.75,  2.8,  6  and  9  km  were  set 
around  the  Karagasaki  central  station.  The 
frequencies  were  15,  18,  20  and  26  GHz. 
Four  26  GHz  links  with  different  fading 
margins  were  set  to  Shibuya.  The  rain 
attenuation  of  each  link  was  measured  with 
0. 1  s  accuracy  and  continuously  recorded  by 
a  computer. 

2.3  Measured  rain  attenuation 
Figure  2  and  3  show  measured  rain 
attenuation  at  26  GHz  over  the  transmission 
distances  of  0.75  km  and  2.8  km, 
respectively.  The  dashed  line  is  the 
estimated  probability  from  the  Gamma 
distribution.  As  the  rain  attenuation  becomes 
large,  the  probability  decreases  between  lO'l 
and  10*3.  On  the  other  hand,  when  the 
probability  becomes  small,  it  drops  quite 
rapidly,  indicating  that  the  rain  attenuation 
has  reached  a  maximum  limit;  i.e.,  it  is 
truncated.  The  maximum  limits  vary  each 
year,  but  in  each  year  the  rain  attenuation 
shows  the  same  tendency  to  truncate.  This 
result  shows  that  there  are  some  regions  in 
which  the  probability  of  rain  attenuation 
decreases  rapidly,  and  if  we  can  increase  the 
fading  margin  so  that  it  exceeds  the  highest 
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rain  attenuation  recorded,  the  links  will  be 
"outage-free". 

Figures  4  and  5  show  measured  rain 
attenuation  over  transmission  distances  of  6 
km.  Frequencies  are  20  and  26  GHz, 
respectively.  The  truncations  shown  in  the 
0.75  km  and  2.8,  km  links  at  26  GHz  are 
also  observed  in  the  links  of  6  km. 

Figure  6  shows  the  measured  rain 
attenuation  distribution  for  the  transmission 
distance  of  9  km  at  1 8  GHz.  The  truncations 
are  also  observed  here. 


Fig.  2  Rain  attenuation  distribution  over 
transmission  distance  of  0.75km 


Fig.  3  Rain  attenuation  distribution  over 
transmission  distance  of  2.8km 


Rain  attenuation  (dB) 

Fig.  4  Rain  attenuation  distribution  over 
transmission  distance  of  6km 


Rain  attenuation  (dB) 

Fig.  5  Rain  attenuation  distribution  over 
transmission  distance  of  6km 


In  Figures  2  to  6,  the  maximum  values 
of  the  measured  rain  attenuation  vary  with 
frequency  and  transmission  distance.  They 
also  vary  from  year  to  year. 

3.  Towards  outage  free  radio  systems 
We  set  four  links  with  different  fading 
margins  between  Karagasaki  and  Shibuya  to 
examine  fading  margin  dependence  on  the 
BER  characterisrics.  The  margins  are  24, 
43,  55  and  60  dB. 
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Fig.  6  Rain  attenuation  distribution  over 
transmission  distance  of  9km 


Figure  7  shows  the  cumulative  time 
over  which  the  measured  BER  of  lO"^  was 
exceeded.  The  dashed  line  denotes  the 
probability  estimated  using  the  Gamma 
distribution.  In  1989,  there  was  a  major 
downpour  on  August  1.  The  downpour  was 
so  heavy  that  the  meteorological  observatory 
in  Tokyo  recorded  the  highest  daily  rainfall 
in  Japanese  history  for  that  month.  It  is 
notable  that  many  bit  errors  occurred  in  links 
with  fading  margin  below  54  dB,  but  the 
link  with  the  fading  margin  of  60  dB  had  no 
bit  errors.  In  1992,  we  also  experienced 
heavy  rainfall  on  December  8.  In  that  year, 
total  rainfall  was  small  compared  to  another 
year.  On  December  8,  however,  extremely 
heavy  rain  was  observed  for  several  tens  of 
minutes,  and  the  link  with  fading  margin  of 
60  dB  suffered  bit  errors.  Note  that  the 
outage  was  only  0.5  minutes  over  7  years. 
Furthermore,  if  the  link  had  10  dB  higher 
margin,  the  link  would  have  remained  outage 
free. 

From  the  above  discussion,  it  is  clear 
that  maintaining  a  high  but  practical  margin 
makes  radio  links  sufficiently  robust  against 
rainfall,  they  can  be  made  outage-free. 


Fig.  7  Fading  margin  dependence  of  outage  time 


4.  Conclusion 

This  paper  described  the  rain  attenuation 
characteristics  of  radio  links  and  showed 
how  to  make  outage-free-links.  The  results 
are  summarized  as  follows; 

(1)  the  rain  attenuation  distribution  can  not 
be  described  by  the  Gamma  distribution 
at  the  upper  end, 

(2)  all  measured  rain  attenuation  distributions 
have  maximum  values, 

(3)  the  maximum  values  vary  annually  and 
depend  on  frequency  and  transmission 
distance, 

(4)  radio  links  having  fading  margins  that 
exceed  the  maximum  rain  attenuation  are 
outage-free. 
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There  is  a  continuing  and  unmet  need  for  high  power,  solid-state  sources  at  microwave 
and  millimeter  wavelengths.  At  microwave  frequencies,  transmission  line  or  cavity 
combiners  have  been  successful,  but  at  millimeter  wavelengths  this  approach  has  problems 
in  terms  of  losses,  power  output  and  dimensional  tolerances.  In  the  past  few  years, 
significant  efforts  have  been  carried  out  to  develop  planar  arrays  of  solid-state  oscillators 
in  order  to  produce  spatial  power  combining.  These  structures  have  used  open  Fabry- 
Perot  as  well  as  conventional  transmission  lines  for  combining  the  active  devices,  to 
proved  feedback  for  coherent  locking  of  the  individual  oscillating  devices  and  to  couple 
energy  to  an  electromagnetic  wavebeam.  In  addition  a  new  class  of  hybrid  planar  quasi- 
optical  waveguides  and  resonators  are  under  investigation  for  power  combining.  Active 
devices  utilized  have  included  two  terminal  GUNN  or  IMP  ATT  diodes  as  oscillators,  and 
three  terminal  devices  such  as  FETs  or  HEMTs  as  both  oscillators  and  amplifiers.  With 
three  terminal  devices  currently  being  more  popular.  This  presentation  discusses 
architectures  under  active  investigation. 

Introduction 

For  many  years,  there  has  been  a  need  to  obtain  more  power  from  solid-state  devices,  both 
sources  and  amplifiers.  There  exists  many  system  applications  in  common  use  today  and 
their  range  of  applications  is  expanding.  It  is  safe  to  predict  that  these  systems  will  be 
utilized  in  the  future  across  the  spectrum  of  endeavors  from  communications,  radar,  to 
transportation,  industrial  and  scientific  applications.  Satisfying  this  expanding  demand 
mandates  the  utilization  of  previously  unused,  or  little  used,  mm-wave  and  sub-mm-wave 
bands.  In  accordance  with  a  long  term  trend  toward  systems  operating  at  higher  and 
higher  frequencies,  the  necessary  technology,  however,  is  not  very  well  developed  at  the 
present  time.  A  fundamental  limitation  has  been  and  continues  to  be  the  lack  of  convenient 
power  sources  and  amplifiers.  Component  costs  have  been  driven  by  the  small  size  and 
tight  tolerance  associated  for  mm  sources  and  in  the  case  of  waveguide  components,  by 
the  need  for  hand  assembly.  This  is  more  urgent  at  millimeter  wave  frequencies  than  at 
microwaves  because  microwave  sources  generally  give  better  performance  in  terms  of 
such  parameters  as  power  output,  efficiency  and  /  or  spectral  purity.  Solid  state  devices 
are  highly  reliable,  however,  their  output  power  tends  to  be  very  low  due  to  their  small 
physical  size  of  the  active  region,  resulting  in  the  well  known  1/f  ^  fall-off  of  available 
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power.  Hence,  a  need  exists  to  combine  the  outputs  of  many  individual  elements  to  satisfy 
the  system  power  requirement. 


Many  of  the  problems  stated  above  may  be  resolved  through  the  use  of  quasi-optical 
techniques.  Quasi-optical  devices  typically  have  cross  sectional  dimensions  in  the  order  of 
10  to  100  wavelengths  and  are  relatively  easy  to  fabricate.  Tolerance  requirements  are 
greatly  relaxed  since  boundary  surfaces  along  the  propagating  directions  of  the  guiding 
structure  are  not  critical  for  mode  selection  and  maintenance  of  mode  purity.  Rather, 
easily  manufactured  lenses  or  reflectors,  and  their  spacing  between  them,  establish  the 
mode  parameters.  In  addition,  the  rather  large  transverse  dimensions  of  quasi-optical 
structures  allow  one  the  freedom  to  include  numerous  solid  state  sources  to  achieve  the 
desired  output  power. 

Architectures 

All  quasi-optical  systems  have  several  features  in  common:  at  least  one  transverse 
dimension  is  large  compared  to  the  wavelength;  the  longitudinal  dimension  is  also  large 
and  may  be  large  compared  to  the  transverse  dimension;  hence,  many  individual  solid- 
state  sources  may  be  integrated  into  the  structure.  While  the  dimensions  are  large 
compared  to  the  wavelength,  the  mode  structure  is  such  that  only  a  single  mode,  or  at 
most  a  small  number  of  modes  may  exist  in  the  structure.  ‘ 

In  recent  years,  research  emphasis  has  been  placed  upon  three  principle  classes  of  quasi- 
optical  structures  for  oscillator  design.  The  first  structure  to  be  investigated  was  that  of  an 
open  resonator,  commonly  referred  to  as  the  Fabry-Perot  resonator,  with  solid  state 
sources  located  in  a  plane  transverse  to  the  direction  of  propagation.  The  electromagnetic 
mode  structure  of  open  resonators  is  well  known  and  characterized.  *  Since,  the  physical 
size  of  each  solid-state  source  is  much  smaller  than  the  wavelength,  a  coupling  device 
must  be  employed  to  extract  energy  from  the  active  element  and  transfer  it  to  the  wave 
beam.  Many  coupling  devices  have  been  investigated  including  small  dipoles  ^ ,  loop 
elements  ^ ,  microstrip  elements  '*  ,  and  one  dimensional  waveguide  structures  ® .  In  each 
case,  the  goal  of  such  coupling  elements  is  to  impedance  match  between  the  active  device 
and  the  electromagnetic  wavebeam.  An  example  of  an  open  resonator  structure  is  shown 
in  figure  1 .  Feedback  required  for  oscillation  is  obtained  primarily  through  the 
electromagnetic  wavebeam,  with  a  small  but  measurable  contribution  due  to  direct  mutual 
coupling  between  the  coupling  devices  ’ . 

The  second  structure  to  receive  significant  investigation  is  that  of  the  grid  oscillator  and 
amplifier.  In  this  case  an  array  of  active  elements  is  placed  in  a  uniform  grid,  which 
intercepts  the  electromagnetic  wave  beam,  with  each  element  connected  to  its  nearest 
neighbors  via  a  printed  transmission  line  /  antenna  structure.  *  When  one  considers 
the  grid  structures,  feed-back  to  obtain  oscillation  is  principally  via  the  transmission  line 
located  between  the  active  elements,  with  secondary  coupling  occurring  via  the 
electromagnetic  beam.  While  the  electromagnetic  beam  coupling  is  small,  it  is  essential  to 
insure  all  active  elements  oscillate  coherently  and  in  phase  as  opposed  to  anti-phase 
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oscillation  “ .  An  example  of  the  typical  grid  oscillator  is  shown  in  figure  2a  *.  Also  shown 
in  figure  2b  a  detailed  depiction  of  one  active  element  cell  .  To  date,  grid  oscillators 
have  produced  the  highest  power  level  of  about  10  watts  in  the  X-band.  Grid  amplifiers 
are  very  similar  to  grid  oscillators,  only  the  feedback  element,  in  this  case  a  reflector  is 
removed  from  the  system.  Grid  amplifiers  have  shown  relative  gain,  from  the  off  state  to 
the  on  state  of  about  15  dB  operating  at  about  40  Ghz*'‘ . 

The  third  class  of  quasi-optical  systems  are  based  upon  the  a  hybrid  dielectric  slab-beam 
waveguide.  **  The  hybrid  slab-beam  waveguide  consists  of  a  thin  dielectric  slab,  usually 
grounded  on  one  side,  into  which  phase  correcting  elements  are  inserted.  The  phase 
correcting  element  for  an  oscillator  is  usually  a  partially  transparent  curved  reflector. 
Energy  will  be  extracted  through  that  reflector.  The  hybrid  dielectric  slab-beam  waveguide 
confines  the  beam  in  dimension  through  wavebeam  techniques  as  discussed  above  and  in 
the  second  transverse  direction  (normal  to  the  dielectric  slab  surface)  through  the 
technique  total  internal  reflection  as  normally  employed  by  slab  dielectric  waveguides.  For 
this  structure,  one  obtains  a  large  area  suitable  for  the  integration  of  active  elements  along 
the  direction  of  propagation  and  across  the  wavebeam.  The  advantage  of  this  technique  is 
that  the  active  elements  may  be  integrated  as  a  component  of  the  ground  plane  and 
coupled  to  the  wavebeam  through  apertures  in  the  ground  plane.  This  technique  provides 
adequate  coupling  to  the  wavebeam,  while  providing  a  mechanism  for  efficient  cooling  of 
the  active  elements.  The  hybrid  dielectric  slab-beam  waveguide  techniques  continue  to 
show  promise,  however,  while  they  are  the  least  developed  of  the  three  techniques 
discussed  here  an  amplifier  based  upon  these  techniques  have  shown  input  to  output  gain 
of  about  10  dB  in  the  X-band  An  example  of  a  hybrid  dielectric  slab  /  wave-beam 
oscillator  is  shown  in  figure  3  a.  Also  shown  in  figure  3  b  is  a  detail  of  the  coupling  element. 

Conclusion 

Table  I  lists  the  specifications  of  reported  quasi-optical  spatial  combiners.  While  the 
intention  of  quasi-optical  techniques  is  to  develop  high  power  sources  at  millimeter 
wavelengths,  most  research  to  date  has  been  conducted  in  the  microwave  region  of  the 
spectrum.  It  should  be  pointed  out  that  research  results  shown  in  Table  I  has  been 
primarily  proof-of-concept  demonstrations  and  additional  development  effort  is  required. 
However,  several  academic  and  industrial  institutions  currently  have  programs  focused 
upon  the  millimeter  spectrum.  If  one  is  interested  in  pursuing  this  technology,  two 
extensive  overview  papers  are  available.  These  paper  also  extensively  reference 
research  that  has  been  conducted  for  the  interested  investigator. 
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i|aKU|tji 

KsiSl 

ARRAY 

SIZE 

DEVICE 

TYPE 

REFERENCE 

5.0 

10X10 

FET 

550 

7.3 

3X3 

FET 

282 

Mortazawi,  et.al 

8.2 

4X4 

FET 

184 

York,  et.al. 

9.8 

10X10 

FET 

10300 

Hacker,  et.al. 

34.7 

6X6 

HBT 

- 

37 

4X4 

HEMT 

1 

Wiltse,  et.al. 

60 

2X4 

IMPATT 

2200 

Compton,  et.al. 

TABLE  I  -  REPORTED  QUASI-OPTICAL  SOURCES 


FIGURE  1  -  OPEN  RESONATOR  CONFIGURATION 
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Collector  bias  line 
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FIGURE  2  -  GRID  OSCILLATOR  CONFIGURATION 
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Abstract.  This  paper  presents  a  quasi-op tical 
MESFET-based  dielectric  slab  waveguide  with  an  am¬ 
plifier  array.  Up  to  19  dB  amplifier  gain  and  9.5  dB 
system  gain  is  obtained  at  7.38  GHz.  Measurements 
of  amplifier  and  system  gain,  |52i|,  output  power  vs, 
input  power  and  transverse  power  distribution  are  pre¬ 
sented. 

1.  Introduction 

Several  quasi-optical  structures  have  been  reported, 
including  wave  beam  type  [1],  grid  type  [2],  mi¬ 
crostrip  coupling  type  [3],  and  dielectric  slab  beam 
waveguide  type  (DSBW)  [4]  for  power  combining. 
The  dielectric  slab  system  has  the  advantage  of  be¬ 
ing  two-dimensional  and  is  thus  more  amenable  to 
photolithographic  reproduction  than  the  conventional 
open  quasi-optical  power  combining  structures.  Pre¬ 
vious  investigations  of  the  quasi-optical  dielectric  slab 
cavity  and  waveguide  [5] ,  [6]  demonstrated  the  suitabil¬ 
ity  of  this  structure  for  the  integration  of  quasi-optical 
power  combining  MMIC  technology.  In  this  work,  the 
amplifier  arrays  are  located  on  'the  ground  plane  to 
reduce  perturbation  and  scattering  loss  of  the  beam- 
modes.  Measurements  of  amplifier  gain  and  system 
gain,  output  power  vs,  input  power  and  transverse 
electric  field  distribution  are  presented.  At  7.38  GHz, 
amplifier  gain  and  system  gain  reach  19  dB  and  9.5 
dB,  respectively. 

2.  Dielectric  Slab  Power  Combining  System 

The  dielectric  slab  waveguide  amplifier  system  in¬ 
corporates  four.  MESFET  amplifiers  and  two  thin  con¬ 
vex  lenses  as  shown  in  Fig,  1.  Detail  of  the  active 
region  is  shown  in  Fig.  2.  The  waveguide  system  was 
adjusted  with  the  transistors  turned  off  so  that  the 
guided  waves  are  focused  near  the  aperture  of  the  re¬ 
ceiving  horn(dl  =  12  cm,  d2  —  44  cm,  ci3  =  16  cm). 
The  dielectric  slab  is  Rexolite  (e  =  2.57,  tan(5  =  (1.0006 


d1  ,  d2  ,  d3 


Figure  1:  Dielectric  slab  waveguide  system  with  MES¬ 
FET  amplifiers  inside. 
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at  X-band  ),  and  it  is  27.94  cm  wide,  62  cm  long,  and 
1.27  cm  thick.  The  lenses  are  fabricated  from  Ma- 
cor  (e  =  5.9,  tan^  =  0.0025  at  100  kHz)  with  a  ra¬ 
dius  of  30.48  cm,  and  the  focal  length,  /,  is  28.54  cm. 
The  aperture  width  of  both  horn  antennas  is  9  cm, 
designed  to  be  wide  enough  to  catch  most  of  the  am¬ 
plified  power.  Energy  emitted  from  the  input  radiator 
propagates  in  a  quasi-optical  TE  Gaussian  mode  in  the 
dielectric  slab  waveguide, and  is  focused  by  the  first  lens 
in  the  middle  area  of  the  slab.  This  system  is  designed 
so  that  the  amplifier  unit  cells  are  within  the  beam 
waist  (the  1/e  field  points). 

The  amplifier  unit  cells  are  7  cm  x  1.5  cm  and  em¬ 
ploy  HP  ATF-10235  MESFETs.  This  design  was  de¬ 
rived  from  the  active  slot  line  notch  antenna  by  Lev- 
erich,  et  al.  [7].  An  amplifier  unit  includes  two  end- 
fire  Vivaldi  antenna  tapers  which  are  gate-receiver  and 
drain-radiator,  and  is  specifically  designed  to  eliminate 
surface-of-slab  to  ground-plane  resonance.  The  advan¬ 
tage  of  locating  the  amplifiers  on  the  ground  plane 
is  that  it  reduces  beammode  perturbation,  scattering 
losses,  and  reflection  of  the  input  energy  due  to  the 
amplifier  structure.  These  are  problems  with  ampli¬ 
fiers  mounted  on  the  surface  of  the  slab  [6]  and  in  the 
more  conventional  grid  system. 

3.  Experimental  Results  and  Discussions 

The  amplifier  arrays  were  located  at  two  loca¬ 
tions,  thus  receiving  different  input  power  distribu¬ 
tions.  Figs.  3  and  4  show  the  amplifier  gains  with 
system  input  powers  of  -16  dBm,  -13  dBm  and  -10 
dBm.  The  amplifier  gains  are  obtained  by  taking  the 
ratio  of  the  measured  |S'2i  |  when  the  amplifiers  are  with 
and  without  bi2ts.  The  highest  amplifier  gain  is  19  dB 
in  location  1  with  a  system  input  power  -16  dBm.  This 
amplifier  gain  can  compensate  the  insertion  loss  due  to 
the  beammode  energy  scattered  by  the  amplifiers. 

In  Fig.  5,  the  system  loss  and  insertion  loss  are 
shown.  The  system  loss  includes  beammode  scatter¬ 
ing  by  the  lenses,  and  the  horn  radiation  into  the  air. 
The  insertion  loss  is  due  to  beammode  perturbation 
by  the  amplifier  structure.  In  the  frequency  range  of 
operation,  the  system  loss  is  9-10  dB  when  the  system 
is  without  amplifiers,  and  the  insertion  loss  is  2.5  dB 
due  to  the  amplifiers  inserted  into  the  system. 

After  amplification,  the  guided  waves  spread  out 
wider  than  horn  aperture  (see  Fig.  9),  and  a  small 
proportion  of  the  amplified  energy  is  captured  by  the 
receiving  horn.  As  indicated  in  Fig.  6,  the  lenses  do 
serve  to  focus  energy  to  the  receiving  horn  in  spite  of 
the  scattering  losses  they  induce. 

Fig.  7  shows  the  system  output  power  P^ut  vs. 
input  power  Pin  when  the  amplifiers  were  with  and 
without  bias.  The  amplifier  gain  is  the  the  difference 


FREQUENCY  (GHz) 


Figure  3;  Amplifier  gain  for  location  1.  This  location  is 
22cm  away  from  the  second  lens. 


FREQUENCY  (GHz) 


Figure  4;  Amplifier  gain  for  location  2,  This  location  is 
18cm  away  from  the  second  lens. 
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FREQUENCY  (GHz) 


Figure  5:  \S2i  \  of  the  dielectric  slab  with/without  ampli¬ 
fiers.  The  insertion  loss  of  the  unit  cells  alone 
is  the  difference  between  these  two  |52i  |  curves. 
This  loss  is  believed  to  be  due  to  field  scatter¬ 
ing. 


FREQUENCY  (GHz) 


Figure  6:  Comparison  of  amplifier  gain  with /without 
lenses. 


between  Potit(Amp  ON)  and  Po«t(Amp  OFF).  The  sys¬ 
tem  power  gain  is  obtained  by  subtracting  Pin  from 
Po„t(Amp  ON).  When  Pout  =  “16  dBm,  system  power 
gain  and  amplifier  gain  are  9.5  dB  and  19  dB,  respec¬ 
tively.  Note  that  the  Pout  does  not  saturate.  This 
is  a  characteristic  of  the  quasi-optical  DSBW  ampli¬ 
fier.  When  the  the  amplifier  cells  saturate, the  excess 
input  power  continues  to  propagate  in  the  slab.  The 
power  added  efficiency  (PAE)  of  this  system  is  3.35%. 
However,  we  need  to  count  the  scattering  loss  from  the 
second  lens,  which  is  about  3.5  dB.  Therefore,  the  PAE 
is  about  7.5%. 

The  system  gain  (defined  as  Pout/Pin)^^  shown  in 
Fig.  8  with  Pin  =  10  dBm.  The  E-field  distribution 
measured  at  d  =  13  cm  is  shown  in  Fig.  9.  Compar¬ 
ing  the  field  distributions  for  amplifiers  without  biases 
and  for  no  amplifiers  inside  the  slab,  we  find  that  the 
\Ey\  decreases  about  2  to  4  dB  due  to  the  insertion  of 
the  amplifier  unit  cells  in  the  -5  cm  <  y  <  5  cm  re¬ 
gion.  Analysis  indicates  that  only  66%  of  the  amplified 
energy  is  captured  by  the  receiving  horn.  Consider¬ 
ing  37%  of  amplified  energy  is  not  received,  the  power 
added  efficiency  should  be  11.2%. 


Figure  7:  Pout  vs  Pin  as  amplifiers  with/ without  bias 
4.  Conclusions 

We  have  demonstrated  quasi-optical  power  combin¬ 
ing  in  a  dielectric  substrate  with  amplifiers  located  on 
ground  plane.  The  maximum  amplifier  gain  and  sys¬ 
tem  gain  reach  19  dB  and  9.5  dB,  respectively,  at  7.38 
GHz. 
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Figure  8:  Transmission  gain  for  the  slab  as  amplifier  with 
bias 


References 

[1]  J.  W.  Mink,  “Quasi-optical  power  combining  of 
solid-state  millimeter- wave  sources,”  IEEE  TVans, 
on  Microwave  Theory  and  Techniques,  vol.  MTT- 
34,  pp.  273-279,  Feb.  1986. 

[2]  Z.  B.  Popovicc,  R.  M.  Weilke,  M.  Kim,  and 
D.  B.  Rutledge,  “A  100-MESFET  planar  grid  os¬ 
cillator,”  IEEE  Trans,  Microwave  Theory  Tech,, 
vol.  MTT-39,  pp.  193-200,  Feb.  1991. 

[3]  N.  J.  Kolias  and  R.  C.  Compton,  "A  microstrip- 
based  quasi-optical  polarization  rotator  array”, 
IEEE  MTT-S  Ini,  Microwave  Symp.  Dig., 
pp.  773-776,  May  1995. 

[4]  J.  W.  Mink  and  F.  K.  Schwering,  ‘‘A  hybrid  dielec¬ 
tric  slab-beam  waveguide  for  the  sub-millimeter 
wave  region,”  IEEE  Trans.  Microwave  Theory 
Tech.,  vol.  MTT-41,  pp.  1720-1729,  Oct.  1993. 

[5]  F.  Poegel,  S.  Irrgang,  S.  Zeisberg,  A.  Schuene- 
mann,  G.  P.  Monahan,  H.  Hwang,  M.  B.  Steer, 
J.  W.  Mink,  F.  K.  Schwering,  A.  Paollela,  and  J. 
Harvey,  “Demonstration  of  an  oscillating  quasi- 
optical  slab  power  combiner,”  IEEE  MTT~S  Ini. 
Microwave  Symp.  Dig.,  pp.  917-920,  May  1995. 

[6]  H.  Hwang,  G.  P.  Monahan,  M.  B.  Steer,  J.  W. 
Mink,  J.  Harvey,  A.  Paollela,  and  F.  K.  Schwer¬ 
ing,  “A  dielectric  slab  waveguide  with  four  planar 
power  amplifiers,”  IEEE  MTT~S  Ini.  Microwave 
Symp.  Dig.,  pp.  921-924,  May  1995. 

[7]  W.  K.  Leverich,  X.-D.  Wu,  and  K.  Chang,  “FET 
active  slotline  notch  antenna  for  quasi-optical 
power  combining,”  IEEE  Trans.  Microwave  The¬ 
ory  Tech.,  vol.  MTT-41,  pp.  1515-1517,  Sept. 
1993. 


Figure  9:  \Ey\  pattern  on  the  top  surface  of  slab.  This 
pattern  is  measured  at  7.38GHz  with  Pin  =  10 
dBm 


92 


QUASI-OPTICAL  COMPONENTS  AND  SUBSYSTEMS 
FOR  COMMUNICATIONS 


Zoya  Popovic, 

Jon  Schoenberg,  Tom  Mader,  Wayne  Shiroma,  Stein  Hollung,  Milica  Markovic,  Jonathan  Dixon 
Department  of  Electrical  and  Computer  Engineering,  University  of  Colorado,  Boulder,  CO 

Scott  Bundy 

Superconducting  Core  Tech.,  Inc.,  Golden,  CO 


1.  INTRODUCTION  AND  BACKGROUND 

Although  most  wireless  communications  systems  are 
now  digital,  they  still  require  analog  front  ends,  since 
the  transmitted  signal  is  a  modulated  sinusoid.  Opti¬ 
mization  of  the  analog  transmitter  and  receiver  can  im¬ 
prove  the  digital  system  performance.  In  this  paper  we 
describe  a  new  approach  to  what  we  believe  are  more 
reliable,  less  expensive,  and  more  compact  front  ends  for 
microwave  and  millimeter-wave  communications.  This 
new  approach  is  based  on  quasi-optical  components, 
which  are  printed  surfaces  loaded  with  active  and/or 
passive  devices. 

Many  active  quasi-optical  components  have  been 
demonstrated  to  date:  oscillators  [1-8],  amplifiers  [9- 
13],  mixers  [14],  phase  shifters  [15,16],  multipliers  [17], 
and  switches  [18].  Although  these  demonstrations  show 
the  feasibility  of  a  quasi-optical  communication  system, 
a  practical  demonstration  combining  the  functions  of 
several  components  has  not  yet  been  reported. 

The  most  important  front-end  component  of  a  com¬ 
munication  system  is  the  transmit/receive  (T/R)  mod¬ 
ule.  A  quasi-optical  transceiver,  consisting  of  a  self- 
oscillating  FET  mixer,  was  reported  in  [19],  but  this 
circuit  did  not  incorporate  power  amplification,  low- 
noise  amplification,  or  modulation  functions.  Here  we 
describe  analysis,  design  and  measurements  of  the  nec¬ 
essary  quasi-optical  T/R  components.  We  also  discuss 
a  way  to  construct  a  quasi-optical  transceiver  from  the 
presented  components  and  present  some  demonstration 
results. 

Originally  developed  to  address  the  need  for  high- 
power  millimeter  wave  sources  [20],  quasi-optical  cir¬ 
cuitry  has  several  advantages  for  wireless  communica¬ 
tion  applications: 

•  Millimeter  waves  offer  more  bandwidth,  and  the  in¬ 
creased  atmospheric  attenuation  forms  a  natural  cell 
boundary,  making  these  frequencies  ideal  for  short- 
range,  high-capacity  indoor  networks  for  high-speed 
data  or  multimedia  transmission.  The  limited  output 


power  of  millimeter-wave  solid-state  devices  is  overcome 
by  spatially  power-combining  hundreds  of  low-power  de¬ 
vices  [1,5].  This  allows  power  levels  in  the  tens  of  watts 
range,  adequate  for  terrestrial  and  satellite  communica¬ 
tions. 

•  Quasi-optical  active  components  are  compact,  light¬ 
weight,  and  amenable  to  monolithic  integration.  The 
antenna  is  an  integral  part  of  the  circuit  design,  which 
allows  for  overall  system  size  reduction. 

•  In  a  mobile  environment  where  S/N  ratios  fiuctuate 
wildly  due  to  multipath  and  shadow-fading,  dynamic 
power  control  and  adaptive  beam-steering  techniques 
are  needed  to  obtain  energy-efficient  systems.  Recently, 
quasi-optical  amplifiers  with  beam-steering  and  beam¬ 
forming  capabilities  have  been  demonstrated  [12,13]. 

•  The  total  output  power  is  a  result  of  combining  a 
large  number  of  devices.  Hence,  the  reliability  of  quasi- 
optical  components  is  good;  a  fraction  of  the  devices  can 
fail  before  the  component  fails  [21,22]. 

2.  SYSTEM  COMPONENTS 

The  results  presented  here  describe  components  which 
perform  all  of  the  functions  that  a  quasi-optical  transcei¬ 
ver  requires:  signal  generation,  amplification,  reception, 
modulation  and  beam  control. 

Reliable,  compact,  solid-state  oscillators  are  needed 
for  signal  generation  and  as  LOs.  In  the  first  planar  grid 
oscillator  to  demonstrate  large-scale  power  combining 
[1],  the  powers  of  100  MESFET  oscillators  were  spa¬ 
tially  combined  to  produce  an  effective  radiated  power 
of  21  W  at  5  GHz.  This  type  of  device  has  demon¬ 
strated  graceful  degradation,  as  shown  in  [21,22].  In  [7], 
a  full- wave  theory  for  analyzing  grid  oscillators  consist¬ 
ing  of  arbitrarily  shaped  metal  gratings  printed  on  one 
or  both  sides  of  an  arbitrary  dielectric  was  presented. 
Several  successful  grid  oscillators  have  been  designed 
with  this  analysis  tool,  including  C-band  oscillators  [7], 
X-band  oscillators  [12],  and  stacks  of  grid  oscillators  [6]. 
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Figure  1:  A  10.25-GHz  grid  oscillator  (designed  with 
our  full- wave  analysis  program)  operates  exactly  at  the 
design  frequency  and  is  used  as  a  focal-point  feed  for  a 
lens  amplifier  combiner  [12]. 


For  example,  Fig.  1  shows  a  grid  oscillator  designed  to 
operate  at  10.25  GHz  as  a  feed  for  a  narrowband  free- 
space  lens  amplifier.  This  grid  has  a  rectangular  unit  cell 
and  an  asymmetric  metal  geometry.  It  was  built  with 
28  PHEMTs  and  oscillated  exactly  at  the  design  fre¬ 
quency  [12].  Another  example  is  a  millimeter- wave  grid 
oscillator,  fabricated  and  tested  in  collaboration  with 
TLC  Precision  Wafer  Technology  Inc.  and  Honeywell 
(Minneapolis).  The  simulated  oscillation  frequency  us¬ 
ing  small-signal  device  parameters  is  31.25  GHz,  while 
the  100-PHEMT  monolithic  grid  locked  at  31.1  GHz 

[23] . 

More  recently,  a  theory  for  analyzing  cascaded  sys¬ 
tems  of  multi-function  grids,  including  fixed-frequency 
and  tunable  filters,  and  mode-selective  grid  oscillators 
was  developed  [24,25].  Filters,  especially  those  with 
sharp  rolloff  characteristics,  are  essential  for  communi¬ 
cation  systems.  Tunable  filters  offer  additional  system 
flexibility.  A  voltage-controlled  frequency-selective  sur¬ 
face  (FSS)  was  designed  using  the  theory  presented  in 

[24] .  It  consists  of  a  printed  grid  loaded  with  varac¬ 
tor  diodes.  The  resonance  of  this  filter  tunes  in  a  30% 
bandwidth  when  the  bias  across  the  diodes  is  changed, 
as  shown  in  Fig.  2.  This  FSS  was  placed  in  front  of 
a  25-PHEMT  grid  oscillator  and  allowed  the  locked  fre¬ 
quency  to  be  set  at  either  4  or  6  GHz  [25].  The  radiation 
patterns  and  cross-polarization  levels  for  both  frequen¬ 
cies  are  the  same.  The  ability  to  selectively  choose  the 
carrier  frequency  is  useful  for  frequency-diversity  sys¬ 
tems. 

Low-loss  circular  polarizers  which  can  be  inserted 
in  the  transmission  path  are  useful,  as  circular  polar¬ 
ization  has  been  shown  to  suppress  multipath-delayed 
waves  [26],  resulting  in  improved  transmission  charac¬ 
teristics.  In  [25],  a  linear-to-circular  polarization  con- 
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Figure  2:  The  measured  electrical  tuning  of  the  reso¬ 
nance  of  a  varactor-loaded  tunable  FSS  shows  a  30% 
bandwidth  with  very  little  change  in  resonance  ampli¬ 
tude  [25]. 


verter  was  presented  which  has  a  1.2  dB  axial  ratio  with 
only  a  1-dB  transmission  loss  at  X-band.n 

High  gain,  low  noise  and  power  amplifiers  are  neces¬ 
sary  parts  of  every  transceiver.  A  lens  amplifier  array 
using  patch  antennas  is  shown  in  Fig.  3.  This  amplifier 
demonstrated  8  dB  of  absolute  power  gain  in  a  3%  band¬ 
width  at  9.7  GHz  [12].  The  input  side  patches  (shown 
in  black)  have  variable  delay  lines  across  the  array,  al¬ 
lowing  for  a  focal-point  near-field  feed.  This  enables  ef¬ 
ficient  input  feeding  as  well  as  simple  transmitter  design 
with  the  added  capability  of  beam-steering  and  beam¬ 
forming. 

In  a  mobile  environment,  where  signal-to-noise  ratios 
can  fluctuate  wildly,  dynamic  power  control  and  adap¬ 
tive  beam-steering  techniques  can  be  used  to  maintain  a 
low  bit  error  rate  and  reduced  sensitivity  to  multipath 
and  shadow-fading.  Beam-steering  and  beam-forming 
have  been  demonstrated  with  quasi-optical  lens  ampli¬ 
fiers  [12,13].  The  lens  amplifier  in  Fig.  3  is  fed  by  two 
identical  grid  oscillator  sources  positioned  along  a  fo¬ 
cal  arc  of  the  amplifying  lens.  When  the  two  sources 
are  turned  on  simultaneously,  the  fields  are  linearly  su¬ 
perimposed  at  the  amplifier  output  to  produce  a  pat¬ 
tern  with  two  distinct  beams  due  to  the  two  respec¬ 
tive  sources  [13].  However,  the  two  sources  can  also  be 
switched  on  and  off  one  at  a  time  to  produce  a  fixed 
number  of  discrete  switchable  beams  for  a  steering  sys¬ 
tem.  We  have  measured  a  5-kHz  steering  speed,  limited 
by  the  settling  time  of  the  oscillator  feeds. 

High-efficiency  quasi-optical  amplifiers  increase  bat- 
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Figure  5:  Layout  of  a  slot  antenna  receiving  low  noise 
Figure  3:  A  quasi-optical  planar  lens  amplifier  fed  by  lens  amplifier.  The  amplifiers  in  the  array  are  two-stage 
two  identical  grid  oscillator  sources  positioned  along  a  PHEMT  CPW  circuits  [13], 
focal  arc  of  the  amplifying  lens  [13]. 


Figure  4:  Layout  of  a  plane-wave-fed  slot  antenna  power 
amplifier.  Unfilled  rectangles  show  the  positions  of  the 
slot  antennas  on  the  back  of  the  substrate  [27]. 


tery  lifetime,  and  reduce  overall  size  and  weight  due  to 
lower  heat-sinking  requirements.  By  increasing  the  ef¬ 
ficiency  from  50%  to  90%,  the  dissipated  heat  power 
is  reduced  by  a  factor  of  nine  for  the  same  amplifier 
output  power.  For  two-dimensional  amplifier  antenna 
arrays,  low  heat  dissipation  is  especially  crucial,  since 
the  heat  flow  is  lateral.  A  high-efflciency  power  ampli¬ 
fier  array  using  anti-resonant  slot  antennas,  shown  in 
Fig.  4,  exhibits  10  dB  gain,  2.5  W  of  power  with  65% 
power  added  efficiency  at  5  GHz  from  only  4  MESFETs 
[27],  This  concept  can  be  extended  to  millimeter- wave 
frequencies,  as  was  demonstrated  in  [28]  for  a  Ka-band 
amplifier. 

To  demonstrate  a  receiving  amplifier,  an  X-band 
low-noise  PHEMT  lens  amplifier  array  using  antireso¬ 
nant  slot  antennas  and  two-stage  low-noise  CPW  am¬ 
plifiers  has  recently  been  reported  [13].  The  amplifier  is 
shown  in  Fig.  5  and  demonstrates  13  dB  absolute  power 
gain,  a  minimum  1.7  dB  noise  figure,  and  11%  3-dB 
bandwidth  with  an  average  noise  figure  below  2.3  dB 
in  this  bandwidth,  shown  in  Fig.  6.  The  high  gain  and 
low  noise  figure  demonstrated  by  this  amplifier  are  im¬ 
portant  for  maintaining  a  high  SNR,  and  thus  a  low  bit 
error  rate.  The  demonstrated  1.1-GHz  bandwidth  at 
10  GHz  demonstrates  the  applicability  of  quasi-optical 
amplifiers  to  broadband  data  communications. 
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3.  DISCUSSION 


Figure  6:  Measured  gain  and  noise  figure  of  the  LNA 
array. 
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Figure  7:  Block  diagram  of  a  quasi-optical  transceiver. 
The  transmitted  signal  is  shown  in  solid-line  arrows,  and 
the  receiving  signal  path  in  dashed-line. 


The  measured  properties  of  basic  components  which 
make  up  a  quasi-optical  T/R  module  were  discussed 
above.  A  block  diagram  of  a  posssible  quasi-optical 
transceiver  is  shown  in  Fig.  7.  It  consists  of  a  grid  os¬ 
cillator,  such  as  the  one  shown  in  Fig.  1,  which  acts 
as  a  solid-state  power-combining  source  that  generates 
the  carrier  frequency  for  the  transmitter  and  also  serves 
as  the  local  oscillator  and  self-oscillating  mixer  for  the 
receiver.  A  planar  array  of  T/R  active  antennas,  such 
as  the  ones  from  Figures  3,  4,  or  5,  performs  the  power 
amplification  for  the  transmit  and  the  low-noise  amplifi¬ 
cation  for  the  receive.  Both  the  input  and  output  waves 
are  focused.  Locating  the  source/mixer  at  the  focal 
point  of  the  amplifier  reduces  diffraction  loss  and  thus 
increases  system  efficiency.  The  close  proximity  of  the 
oscillator  and  amplifier  also  allows  compact  transceiver 
design.  Reliability  is  improved  since  the  degradation 
is  graceful.  This  approach  is  modular,  since  individual 
grids  or  arrays  serving  different  functions  can  be  cas¬ 
caded  into  systems. 

Modulation  can  be  achieved  either  through  the  gate 
bias  of  the  oscillator  or  amplifier,  or  by  cascading  an 
external  modulator  with  the  amplifier.  In  [4],  a  quasi- 
optical  VCO  consisting  of  an  array  of  transistor  oscil¬ 
lators  stacked  with  an  array  of  varactor  diodes  demon¬ 
strated  good  frequency  modulation  characteristics  in  a 
10%  modulation  bandwidth  with  low  associated  para¬ 
sitic  AM.  However,  since  most  modern  communications 
systems  are  digital,  it  is  important  to  have  digital  mod¬ 
ulators.  A  quasi-optical  digital  phase  modulator  can 
be  designed  based  on  3  antenna  arrays  connected  with 
transistor  SPDT  switches,  allowing  2  bits  of  phase  shift 
in  a  single  array.  The  4  bits  required  for  QPSK  can  be 
obtained  with  two  cascaded  arrays.  We  have  demon¬ 
strated  a  unit  cell  of  the  BPSK  modulator  with  phase 
shifts  of  0°  and  150°  with  no  associated  transmission 
loss,  as  the  switches  have  some  gain  to  overcome  the 
losses. 

In  order  for  the  quasi-optical  amplifier  to  function 
as  a  T/R  module,  bidirectional  amplifier  elements  need 
to  be  used,  with  transistors  placed  in  opposite  direc¬ 
tions  to  amplify  the  outgoing  and  incoming  waves  in¬ 
dependently.  The  transmit  amplifier  is  impedance- 
matched  for  maximum  power,  while  the  receive  ampli¬ 
fier  is  matched  for  low  noise.  (An  alternative  to  using 
two  amplifiers  in  each  element  is  the  grid  approach  [9], 
but  in  this  ceise  the  amplifier  needs  to  be  simultaneously 
designed  for  high  power/gain  and  low  noise.) 

The  frequency-dependence  of  the  antenna  loads  is 
included  in  the  design.  The  choice  of  the  antenna  is  an 
important  consideration.  Amplifier  arrays  using  folded 
slots  have  demonstrated  8%  bandwidth  [29],  while  those 
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using  a  combination  of  loaded  folded  slots  on  the  input 
and  anti-resonant  slots  at  the  output  have  demonstrated 
11%  bandwidth  [13].  The  second-resonance  slot  itself 
has  a  convenient  input  impedance  of  25  to  100  Q  and  a 
20%  bandwidth  (VSWR=2). 

Antenna  diversity  is  an  important  design  option  for 
wireless  communications  systems  suffering  from  multi- 
path  effects.  For  example,  in  a  recent  article  [30],  it  is 
shown  theoretically  that  a  significant  increase  in  system 
capacity  can  be  achieved  by  using  spatial  diversity  and 
optimal  combining.  It  has  also  been  proven  theoreti¬ 
cally  [31]  that  the  power  contained  in  the  electric  and 
magnetic  field  components  are  statistically  independent 
after  propagating  through  a  multipath  channel.  An  an¬ 
tenna  which  responds  independently  to  the  vector  com¬ 
ponents  of  both  the  electric  and  the  magnetic  field  can 
significantly  reduce  multipath  effects  [32].  The  wten- 
nas  themselves  can  therefore  be  designed  to  improve  the 
overall  system  and  effectively  change  the  model  of  the 
multipath  fading  channel.  As  quasi-optical  components 
are  essentially  active  antennas,  they  are  extremely  suit¬ 
able  for  spatial,  frequency,  polarization  and  field  diver¬ 
sity  architectures. 

An  important  issue  in  local  oscillator  applications  is 
the  phase  noise.  The  overall  noise  of  a  grid  oscillator 
should  be  less  than  the  noise  contribution  of  an  indi¬ 
vidual  device,  since  the  grid  is  self-injection-locked.  In 
addition,  since  the  phase  noise  of  the  devices  are  uncor¬ 
related,  the  individual  noise  contribution  of  each  device 
in  the  array  should  average  out.  Experimental  verifi¬ 
cation  for  the  single-sideband  noise  reduction  in  a  100- 
element  grid  was  presented  in  [5]. 

In  the  two-level  power  combining  approach  shown  in 
Fig.  7,  advantages  of  both  oscillator  and  amplifier  com¬ 
biners  are  used:  the  grid  oscillator  source  has  improved 
noise  characteristics,  while  the  amplifier  array  provides 
large  dynamic  range.  For  reception,  the  noise  floor  of 
the  entire  array  is  equal  to  the  noise  floor  of  each  of  the 
amplifier  elements,  provided  the  amplifier  noises  are  un¬ 
correlated,  On  the  other  hand,  the  input  power  of  the 
array  can  be  N  times  larger  than  that  of  a  single  element, 
as  the  power  gets  distributed.  This  means  that  for  a  10  x 
10  array,  the  dynamic  range  is  increased  by  20  dB.  In  ad¬ 
dition,  focusing  on  the  receiver  end  with  an  LNA  array 
allows  for  beam-forming  in  reception  by  placing  mixers 
at  different  points  along  the  focal  surface.  For  example, 
with  one  mixer  positioned  at  boresight,  and  two  others 
placed  at  ±45®,  one  could  receive  sum  and  difference 
patterns.  The  reciprocal  beam-forming  in  transmission 
was  already  demonstrated  with  the  array  from  Fig.  3. 
This  allows  adaptive  processing  at  the  receiver  analog 
front  end,  increasing  the  dynamic  range  of  the  system. 
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ABSTRACT 

Single  stage  and  double  stage  quasi-optical 
amplifiers  are  presented.  The  amplifiers  are 
constructed  on  double  layer  back  to  back 
microstrip  circuits  with  a  shared  ground  plane. 
The  ground  plane  provides  an  effective  isolation 
between  the  receiving  and  the  transmitting 
antenna  arrays.  Furthermore  it  serves  as  a  heat 
sink  in  high  power  amplifier  designs.  The  single 
stage  quasi-optical  amplifier  had  a  gain  of  7.54  dB 
at  10.04  GHz  and  the  two  stage  quasi-optical 
amplifier  had  a  gain  of  18.0  dB  at  9.95  GHz. 


1.  INTRODUCTION 

The  increasing  demand  for  millimeter-wave 
communication  and  radar  systems  has  lead  to  the 
need  for  solid  state  amplifiers  working  at  these 
frequencies.  However,  in  order  to  efficiently 
amplify  electromagnetic  waves  at  these 
frequencies,  conventional  amplifier  circuits  must 
be  modified  in  order  to  reduce  the  losses  in  the 
guiding  structures.  This  can  be  achieved  by 
coupling  the  waves  into  and  out  of  the  amplifier 
through  free  space  using  radiating  elements  rather 
than  by  conventional  waveguide  coupling.  This 
approach  has  lead  to  increased  interest  in  the  area 
of  quasi  optical  power  amplification  [1-6].  .This 
paper  reports  on  microstrip  quasi-optical 
amplifiers  based  on  two  types  of  coupling.  These 
amplifiers  were  constructed  on  double  layer  back 
to  back  microstrip  circuits  with  a  shared  ground 
plane.  The  continuous  ground  plane  provides  a 
very  effective  input-output  isolation.  The  ground 
plane  also  serves  as  a  heat  sink.  Furthermore, 
these  circuits  can  be  readily  incorporated  into 
multistage  quasi-optical  amplifiers. 


2.  THEORY  AND  DESIGN 

2.1.  A  Single  Stage  Double  Layer  Quasi-Optical 
Amplifier  Incorporating  Microstrip-Via  Hole- 
Microstrip  Interconnects 

The  first  circuit  to  be  reported  is  a  single  stage 
quasi-optical  amplifier  based  on  a  via-hole- 
coupled  two  layer  microstrip  structure.  Figs,  la 
and  lb  show  the  layout  of  the  amplifier 
incorporating  nine  HEMTs  in  a  3x3  array 
configuration.  The  input  and  output  layers  are 
placed  back  to  back  (with  a  shared  ground  plane) 
and  the  signal  is  coupled  from  one  layer  to  the 
other  through  via-holes  forming  a  three 
dimensional  interconnect  system.  In  order  to 
design  the  input  and  output  matching  circuits,  the 
simultaneous  conjugate  matched  admittances  are 
first  determined.  Open  circuited  stubs  are  used  to 
provide  simultaneous  conjugate  matched 
susceptances.  These  stubs  effectively  resonate  the 
device  admittances,  hence  the  admittances  seen 
looking  into  the  input  and  output  ports  are  purely 
real.  These  conductances  are  then  matched  to  the 
input  and  output  microstrip  antennas  using  quarter 
wave  transformers.  Thus  the  simultaneous 
conjugate  match  conditions  are  fulfilled.  The 
quasi-optical  amplifier  was  fabricated  on  31  mil 

thick  RT  -  Duroid™  with  er  =  2.3.  The  active 
device  used  is  a  Fujitsu  FHX06LG  HEMT  biased 
at  Yds  =  2  V,  and  Ids  =  10  tnA.  The  dimensions 
of  the  microstrip  antennas  used  are  1.17  cm  by 
0.91  cm.  The  patch  antennas  were  cross  polarized 
to  facilitate  the  gain  measurements.  The  circuit 
was  biased  using  high  impedance  lines  connected 
to  the  center  of  the  non  radiating  edges  of 
microstrip  antennas  as  shown  in  Fig.  1.  The  inter 

element  spacing  was  chosen  to  be  0.67  x  0.75  X,  in 
order  to  obtain  the  optimum  directivity  from  the 
antenna  arrays.  The  far  field  measurement 
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procedure  reported  in  [6]  was  used  to  determine 
the  quasi-optical  amplifier  gain.  The  amplifier's 
gain  was  measured  to  be  G=  7.54  dB  at  10.04 
GHz  with  a  bandwidth  of  0.4  GHz.  Fig.  2  shows 
the  gain  versus  frequency  plot  for  the  3  x  3  quasi 
optical  amplifier.  This  calculation  takes  into 
account  the  variation  of  the  array  gain  with 
frequency.  Fig.  3  shows  a  close  match  between 
the  theoretical  and  experimental  H-plane  patterns. 
The  E-plane  pattern  was  also  measured  and  good 
agreement  with  the  theoretically  predicted  pattern 
was  observed. 

2.2.  A  Two  Stage  Double  Layer  Quasi-Optical 

Amplifier  Incorporating  Microstrip-Slot  Line- 
Microstrip  Interconnects 

Here  a  two  stage  quasi-optical  amplifier 
employing  microstrip  line  -  slot  line  -  microstrip 
line  interconnects  is  presented.  The  two  layers  are 
placed  back  to  back  with  a  shared  ground  plane. 
Fig.  4  shows  a  perspective  view  of  the  quasi- 
optical  amplifier.  The  coupling  between  the  two 
stages  is  accomplished  through  microstrip  to  slot 
transitions,  therefore  there  is  no  electrical  contact 
from  one  layer  to  another.  This  facilitates  the 
monolithic  fabrication  of  such  amplifiers. 
Furthermore  this  provides  DC  isolation  between 
the  two  amplifier  stages.  The  common  ground 
plane  provides  a  very  effective  isolation  between 
the  receiving  antenna  array  and  the  transmitting 
antenna  array  and  it  serves  as  a  heat  sink  in  high 
power  amplifier  designs.  Separating  the  two 
stages  may  also  play  an  important  factor  when  the 
excess  heat  density  is  of  concern.  This  type  of 
structure  allows  the  circuit  designer  to  reduce  the 
device  density  by  spreading  the  devices  between 
the  two  layers,  thereby  somewhat  reducing  the 
thermal  power  density.  Therefore,  this  structure 
has  the  potential  to  address  the  heat  flow,  the 
isolation  between  input  and  output  ports  and 
MMIC  compatibility  problems  simultaneously. 
The  same  active  devices  and  substrate  materials 
which  were  used  to  construct  the  amplifier 
described  in  section  2. 1  were  also  employed  here. 
The  design  procedure  follows  the  same  sequence, 
except  that  the  output  of  the  first  stage  and  the 
input  of  the  second  stage  are  matched  to  the  real 
impedance  of  the  slot  line.  Fig.  5  shows  the 
measured  gain  of  a  3x3  two  stage  amplifier  versus 
frequency.  There  are  a  total  of  18  hEMTs  used  in 
this  amplifier.  Maximum  gain  of  18  dB  was 
obtaineu  at  9.95  GHz.  The  calculated  and 


measured  E  plane  patterns  of  the  3x3  amplifier 
array  are  compared  in  Fig.  6. 

3.  CONCLUSION 

Two  quasi-optical  microstrip  circuits  based  on 
different  multilayer  coupled  structures  have  been 
presented.  The  via  hole  coupled  single  stage 
amplifier  had  a  gain  of  7.54  dB  at  10.04  GHz. 
The  slot  line  coupled  two  stage  quasi-optical 
amplifier  had  a  gain  of  18.0  dB  at  9.95  GHz.  Both 
structures  address  isolation  and  heat-sinking  for 
MMIC  implementation  of  these  amplifiers  at  mm- 
wave  frequencies. 
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Fig.l  Single  stage  double  layer  quasi-optical  amplifier  layout  (a)  Input  side,  (b)  Output  side 
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Fig.  2  Gain  versus  frequency  responce  of  Fig.  3  Measured  and  calculated  H-pl^e 

the  single  stage  quasi-optical  amplifier  patterns  of  the  single  stage  quasi-optical 
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Fig.  5.  Gain  versus  frequency  plot  of  Fig.  6  Measured  and  calculated  E-plane 

a  3x3  two  stage  amplifier  array.  patterns  of  a  3x3  two  stage  amplifier  array. 
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ABSTRACT 

Recent  work  in  active  quasi-optical  coupled 
arrays  have  shown  this  technology  to  be  a  fea¬ 
sible  and  promising  method  for  achieving  ef¬ 
ficient  and  high-power  solid  state  power  com¬ 
bining.  Quasi-optical  techniques  should  find 
greatest  use  at  millimeter-wave  frequencies 
where  waveguide  circuits  are  lossy  and  diffi¬ 
cult  to  build.  In  this  paper,  we  examine  some 
of  the  advantages  of  this  technology  and  re¬ 
view  the  recent  developments  in  active  grid 
arrays.  Because  of  their  versatility,  special 
emphasis  is  given  to  transistor  grids  and  their 
use  as  oscillators  and  amplifiers. 


I.  INTRODUCTION 

In  recent  years,  engineers  and  scientists  have  in¬ 
tensified  their  efforts  to  find  efficient  and  reliable 
methods  for  generating  power  at  millimeter  and 
submillimeter  wavelengths.  This  work  has  been 
motivated  largely  by  the  wide  variety  of  mili¬ 
tary,  industrial,  and  research  applications  of  this 
part  of  the  electromagnetic  spectrum.  In  spite  of 
these  efforts,  the  development  and  implementa¬ 
tion  of  systems  operating  in  the  millimeter  wave 
region  has  been  slow  and  costly.  This  is  due,  in 
part,  to  the  difficulties  associated  with  fabricat¬ 
ing  and  integrating  circuit  components  that  can 
operate  at  these  wavelengths.  Perhaps  the  most 
serious  obstacle  to  the  widespread  development 
and  use  of  millimeter  and  submillimeter  wave 
systems,  however,  has  been  the  lack  of  powerful 
and  reliable  solid  state  sources. 

Although  millimeter-wave  semiconductor  device 
technology  has  reached  a  high  level  of  maturity 
over  the  past  decade,  vacuum  tube  sources  re¬ 
main  the  state-of-the-art  with  regard  to  power 
output  and  efficiency.  Consequently,  much  ef¬ 
fort  has  focussed  on  developing  power  combining 
techniques  at  millimeter-wave  frequencies.  Un¬ 
fortunately,  the  traditional  methods  based  on 


waveguide  cavities  and  “corporate”  combiners 
suffer  from  increasing  losses,  decreasing  com¬ 
bining  efficiencies,  and  stricter  tolerances  when 
scaled  to  millimeter-wave  frequencies.  In  great 
part,  these  problems  arise  from  the  application 
of  standard  RF  and  microwave  engineering  prin¬ 
ciples  to  a  region  where  optical  techniques  may 
be  more  appropriate.  As  a  result,  researchers 
have  begun  to  investigate  the  possibility  of  com¬ 
bining  the  outputs  of  large  numbers  of  devices 
using  quasi-optical  techniques. 

A  number  of  distinct  approaches  to  quasi-optical 
power  combining  have  been  demonstrated  over 
the  past  several  years  [1].  Many  of  these  are 
similar  to  classical  spatial  power  combiners  in 
which  active  devices,  along  with  the  necessary 
matching  circuits,  are  integrated  into  an  array 
of  planar  antennas  (such  as  microstrip  patches). 
The  elements  of  these  arrays  typically  interact 
through  guided-wave  coupling,  weak  radiative 
coupling,  or  are  synchronized  to  an  external  sig¬ 
nal.  A  different  approach,  which  is  more  analo¬ 
gous  to  the  laser,  is  the  grid  oscillator  or  ampli- 


Active  Grid 


Figure  1.  Diagram  of  a  typical  grid  oscillator.  A 
grid  of  active  devices  is  placed  in  an  open  res¬ 
onator 

The  essential  difference  between  grid  oscillators 
and  amplifiers  and  the  spatial  power  combiners 
described  above  is  that  the  device  spacing  is  a 
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small  fraction  of  a  wavelength.  These  grids, 
as  a  result,  approximate  a  continuous,  two 
dimensional  gain  medium  in  which  the  active 
devices  are  strongly  coupled.  An  advantage 
of  this  approach  is  it’s  amenability  to  modern 
IC  fabrication  technology  and  the  relatively 
high  device  packing  density.  From  a  circuit 
point-of-view,  a  grid  oscillator  can  be  under¬ 
stood  as  a  frequency-selective  surface  that 
contains  active  devices  and  presents  a  nega¬ 
tive  resistance  to  an  incident  electromagnetic 
wave.  An  oscillator  is  realized  by  placing  the 
grid  in  a  resonant  cavity  to  provide  positive 
feedback  (see  Figure  1). 

II.  CIRCUIT  MODELLING 

To  understand  the  behavior  of  a  quasi-optical 
array,  it  is  necessary  to  determine  the  embed¬ 
ding  impedances  presented  to  devices  placed 
in  the  grid.  For  finite  grids  of  arbitrary  ge¬ 
ometry,  this  is  a  formidable  task.  Complica¬ 
tions  arise  from  the  unknown  behavior  of  the 
fields  at  the  grid’s  edges.  In  addition,  each 
device  in  the  array  couples  to  all  other  de¬ 
vices  through  the  radiated  fields.  Even  grids 
of  moderate  size  can  generate  a  large  number 
of  unknowns,  making  analysis  very  difficult. 

The  problem  is  made  more  tractable  by  as¬ 
suming  the  array  is  infinite  in  extent  and  is 
illuminated  by  a  uniform  plane  wave.  With 
these  assumptions,  the  symmetry  of  the  ar¬ 
ray  can  be  exploited  to  define  an  equivalent 
waveguide  representing  each  unit  cell  of  the 
grid  (Figure  2).  The  boundary  conditions  on 
the  equivalent  waveguide  are  imposed  by  the 
grid  symmetry  and  account  for  the  interac¬ 
tion  of  all  the  devices  in  the  array.  In  effect, 
the  entire  grid  is  reduced  to  a  single  wave¬ 
guide  circuit  that  can  be  modelled  using  any 
number  of  standard  analysis  techniques. 


Figure  2.  Illustration  of  a  symmetric  planar 
grid  showing  the  equivalent  waveguide.  The 
waveguide  has  electric  walls  on  the  top  and 
bottom  and  magnetic  walls  on  the  sides. 


A  simple  technique  for  finding  the  embed¬ 
ding  impedance  for  a  grid  unit  cell  is  the  in¬ 
duced  EMF  method.  This  method  relies  on 
an  assumed  current  distribution  from  which 
the  radiated  fields  are  found.  The  driving- 
point  impedance,  Z,  seen  by  a  device  in  the 
unit  cell  is  then  found  by  applying  Poynting’s 
Theorem  or  reciprocity: 

Z  =  -W  Iv  E(r)-J(r)<iV 

Because  each  device  in  the  grid  effectively 
sits  in  a  TEM  waveguide,  the  radiated  field 
can  be  decomposed  into  a  set  of  TEM,  TM, 
and  TE  modes.  An  equivalent  circuit  model 
representing  the  grid  embedding  impedance 
is  obtained  by  associating  lumped  elements 
with  each  waveguide  mode  that  is  excited 
(Figure  3).  Including  the  device  model  with 
the  grid  equivalent  circuit  allows  the  behav¬ 
ior  of  the  grid  as  an  oscillator  or  amplifier  to 
be  predicted. 


L 


Mkror 


Figure  3.  An  equivalent  circuit  model  for  a 
planar  grid.  Propagating  modes  are  repre¬ 
sented  with  transmission  lines  and  evanescent 
modes  with  reactive  lumped  components. 

III.  GRID  OSCILLATORS 

Although  a  variety  of  signal  processing  func¬ 
tions  can  be  realized  with  active  quasi-optical 
arrays,  a  primary  motivation  remains  the  re¬ 
alization  of  reliable,  high-power  solid  state 
sources.  Consequently,  much  work  has  fo¬ 
cussed  on  the  development  of  grid  oscillators 
for  high-frequency  power  combining.  In  prin¬ 
ciple,  any  active  device  can  be  incorporated 
into  a  grid,  including  transferred  electron  de¬ 
vices  and  tunnel  diodes.  However,  low  DC- 
to-RF  conversion  efficiencies  and  the  difficul¬ 
ties  associated  with  stabilizing  two-terminal 
active  devices  for  coherent  power  combining 
has  led  most  researchers  to  focus  on  planar 
transistor  grids. 

Most  planar  transistor  grid  oscillators  that 
have  been  studied  to  date  are  based  on  two 
basic  configurations.  The  first,  demonstrated 
by  Popovic  et  al  in  1991,  used  packaged 
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MESFET’s  soldered  to  a  grid  of  vertical  and 
horizontal  leads  (Figure  4)  [2],  The  device 
package  restricted  the  vertical  leads  to  be  the 
drain  and  gate,  with  the  source  connected  to 
the  horizontal  (or  common)  lead.  As  a  result, 
this  configuration  is  known  as  a  “common- 
source”  grid  oscillator.  The  grid  was  fabri¬ 
cated  on  a  Duroid  substrate  and  placed  in 
front  of  a  planar  mirror  to  synchronize  the 
devices  and  restrict  radiation  to  the  forward 
direction.  When  biased,  the  array  produced  a 
single  frequency  output  at  5  GHz  which  could 
be  tuned  over  a  10%  bandwidth  with  the  mir¬ 
ror  position.  The  grid  antenna  pattern  had 
a  directivity  of  16  dB  and  radiated  600  mW 
with  a  DC-to-RF  efficiency  of  20%. 


(a)  (b) 

Figure  4.  (a)  Unit  cell  for  a  common-source 
grid  which  uses  packaged  FET’s.  (b)  Unit  cell 
for  a  common-gate  grid  oscillator  with  chip 
FET’s.  The  gate  lead  is  capacitively  coupled 
to  the  incident  electric  field. 

One  potential  disadvantage  of  the  common- 
source  grid  is  the  gate  lead  is  parallel  (and 
strongly  coupled)  to  the  radiated  field.  As 
a  result,  the  array  may  tend  to  oscillate  at 
lower  frequencies  where  the  devices  have  high 
gain. 

In  the  “common-gate”  configuration,  shown 
in  Figure  4,  the  FET’s  are  oriented  so  the 
drain  and  source  leads  radiate  [3].  The  gate  is 
coupled  to  the  radiated  field  through  a  series 
resonant  circuit  (Figure  3).  The  resonant  fre¬ 
quency  of  this  coupling  circuit  is  the  primary 
factor  determining  the  oscillation  frequency 
of  the  grid.  Grid  oscillator  designs  based 
on  the  common-gate  configuration  have  been 
demonstrated  operating  over  a  wide  range  of 
frequencies  and  with  relatively  high  power 
outputs.  Initial  work  on  common-gate  grids 
was  a  proof-of-principle  investigation  using 
16  chip  MESFET’s  that  produced  335  mW  of 
power  at  11.6  GHz  with  an  efficiency  oF20%. 
A  scaled  version  of  this  array  using  36  devices 
produced  235  mW  at  17  GHz  with  7% 


efficiency.  The  first  demonstrated  monolithic 
millimeter-wave  grid  oscillator  was  a  “common- 
base”  heterojunction  bipolar  transistor  (HBT) 
array  operating  at  35  GHz  and  producing  an  ef¬ 
fective  radiated  power  of  170  mW  [4].  Hacker  et 
aLj  in  1992,  built  a  hybrid  100-element  common- 
gate  grid  oscillator  using  Fujitsu  medium  power 
MESFET’s  (FLK052XP)[5J.  This  grid  was  op¬ 
timized  for  power  output  and  produced  10.3  W 
of  radiated  power  at  9.8  GHz  with  a  DC-to-RF 
efiiciency  of  23%.  Thus  far,  this  is  the  highest 
power  produced  from  a  transistor  grid  oscillator. 

Although  the  central  goal  of  quasi-optical  power 
combiners  is  high  power  and  efficiency,  other 
performance  factors  such  as  oscillator  stability 
and  noise,  tuning  bandwidth,  and  phase  con¬ 
trol  are  also  important.  It  is  expected  (and  has 
been  verified  experimentally)  that  the  noise  per¬ 
formance  of  grid  oscillator  arrays  improves  with 
an  increasing  number  of  devices.  This  results 
because  - the  individual  elements  are  mutually 
locked  to  a  single  frequency  whereas  the  noise 
produced  by  each  of  these  elements  in  uncorre¬ 
lated.  Locking  the  grid  to  an  external,  stable 
source  can  further  reduce  the  phase  noise  of  the 
oscillator. 


Because  a  grid  oscillator,  in  essence,  is  an  array 
of  active  devices  embedded  in  a  frequency  selec¬ 
tive  surface,  the  tuning  bandwidth  tends  to  be 
quite  limited  (typically  less  than  10%).  Mader 
et  alf  have  shown  that  the  operating  bandwidth 
of  a  common-source  array  can  be  increased  us¬ 
ing  an  array  of  varactor  diodes  as  a  tuning  ele¬ 
ment  [6].  Oak  demonstrated  an  improvement  in 
the  tuning  bandwidth  of  a  common-gate  grid  by 
integrating  varactors  directly  into  the  gate  lead 
of  the  array  (Figure  5)  [7].  In  this  case,  the  var¬ 
actor  was  used  to  control  the  resonant  frequency 
of  the  gate  feedback  circuit.  Similar  approaches 
to  grid  oscillator  design  may  also  be  useful  for 
beam  steering  applications  or  adjusting  the  im¬ 
pedance  and  frequency  of  individual  unit  cells 
to  improve  power  and  noise  performance. 


Figure  5.  Diagram  of  a  varactor  tuned  common- 
gate  grid  oscillator.  Packaged  varactors  placed  in 
the  gate  bias  line  tune  the  embedding  impedance 
and  frequency. 
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IV.  GRID  AMPLIFIERS 

Much  of  the  recent  work  on  quasi-optical  arrays 
has  focussed  on  the  development  of  amplifier 
grids  as  well  as  oscillators.  Grid  amplifiers,  in 
essence,  are  active  planar  structures  that  accept 
an  incident  beam  and  re-radiate  an  amplified 
version.  They  offer  the  same  advantages  as  grid 
oscillators:  increased  dynamic  range,  elimina¬ 
tion  of  transmission  line  and  waveguide  losses, 
and  robustness  to  device  failure.  Amplifiers, 
however,  provide  greater  flexibility;  with  proper 
feedback  they  may  be  used  as  oscillators. 

Kim  et  aL,  demonstrated  the  first  grid  amplifier 
using  50  packaged  low-noise  FET’s  mounted  on 
a  crossed-dipole  array  [8].  The  FET’s  are  con¬ 
nected  as  differential  pairs  with  the  gate  leads 
parallel  with  the  polarization  of  the  incident  sig¬ 
nal  and  the  drain  and  source  leads  parallel  with 
the  output  signal  (Figure  6).  Bias  is  fed  to  the 
devices 


- a - H 

(a)  (b) 

Figure  6.  (a)  A  unit  cell  of  the  C-Band  amplifier 
grid,  (b)  Bias  lines  run  across  the  back  of  the 
substrate  and  feed  the  FET’s  with  via  holes. 

through  via  holes  from  the  back  of  the  substrate. 
An  advantage  of  this  design  is  that  the  input 
and  output  of  the  amplifier  have  orthogonal  po¬ 
larizations.  This  feature  permits  high  isolation 
between  the  input  and  output  and  is  advanta¬ 
geous  for  stabilizing  the  amplifier.  Elimination 
of  spurious  oscillations  is  perhaps  the  most  diffi¬ 
cult  task  in  designing  a  grid  amplifier.  This  ini¬ 
tial  proof-of-principle  investigation  gave  a  peak 
gain  of  11  dB  at  3.3  GHz  and  a  3dB  bandwidth 
of  90  MHz.  Polarizers  were  used  to  provide  sep¬ 
arate  input  and  output  matching. 

More  recently,  Kim  demonstrated  a  100-element 
grid  amplifier  using  differential  pair  HBT  chips 
manufactured  by  Rockwell  [9].  An  on-chip  re¬ 
sistive  bias  network  was  provided  for  the  bases 
and  allowed  the  elimination  of  the  base  bias  line. 
This  array  produced  a  peak  gain  of  10  dB  at 
10  GHz  and  a  3dB  bandwidth  of  1  GHz.  The 
noise  performance  and  reliability  of  this  grid  am¬ 
plifier  were  also  investigated.  It  was  found  that 


the  grid  noise  figure  was  7  to  8dB,  compara¬ 
ble  to  that  for  a  single-device  amplifier.  De¬ 
vice  failure  was  simulated  by  detuning  the 
matching  for  several  devices  in  the  grid.  The 
grid  amplifier  proved  quite  robust  with  re¬ 
spect  to  device  failure;  with  10%  of  the  device 
detuned,  the  gain  dropped  by  only  1%. 

V.  SUMMARY 

In  this  paper,  we  have  reviewed  some  of  the 
important  features  and  recent  developments 
in  quasi-optical  active  grids.  In  principle,  this 
approach  to  millimeter  wave  circuit  design 
should  offer  new  advantages  over  traditional 
design  techniques  and  be  straightforward  to 
incorporate  into  existing  semiconductor  fab¬ 
rication  processes.  Although  much  of  this 
initial  work  has  demonstrated  the  feasibility 
and  soundness  of  the  approach,  much  work 
remains  before  complete  systems  will  utilize 
this  technology.  More  accurate  design  models 
and  a  better  understanding  of  strongly  cou¬ 
pled  oscillators  are  needed.  In  addition,  the 
effect  of  high  dielectric  substrates  on  finite 
grids  at  millimeter-wave  frequencies  and  the 
thermal  properties  of  high-power  arrays  re¬ 
quire  study.  Nonetheless,  quasi-optical  grids 
remain  a  promising  and  exciting  concept  for 
future  high  power  solid  state  sources. 
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Introduction 

GaAs  layers  can  be  grown  epitaxially  over  a  wide  range  of  temperature  reaching  from  180°C  up  to  the  thermal  stability 
limit.  While  above  4(X)‘’C  growth  temperature  the  stoichiometry  is  preserved  within  the  ppm-range,  at  low  growth 
temperature  surplus  As  can  be  incorporated  into  the  GaAs  matrix  within  the  percentage  range.  The  material  grown  in  this 
lower  temperature  range  is  called  Low-Temperature-Grown  GaAs.  By  high  temperature  annealing  the  solubility  limit  will 
be  surpassed  and  most  of  the  excess  As  precipitates.  Thus,  the  material  can  have  a  variety  of  unusual  properties.  They 
have  stimulated  many  investigations,  also  into  related  materials,  resulting  in  more  than  300  publications  to  date. 

The  first  application  was  in  the  FET  buffer  layer  structure  virtually  eliminating  backgating  and  short  channel  effects. 
Using  LTG-GaAs,  the  stability  of  GaAs  FET  devices  has  been  improved  dramatically  and  the  Schottky  gate  breakdown 
limit  has  been  surpassed  for  the  first  time.  The  second  successful  replication  are  untrafast  THz  optoelectronic  switches 
and  detectors.  Here  the  advances  in  FET's  will  be  reviewed.  However,  the  impact  of  this  material  is  essentially  deeper. 
LTG-GaAs  can  serve  as  raw  model  for  a  new  class  of  materials:  off-stoicheometric  or  composite  semiconductors.  They 
offer  multifunction  properties  with  metallic,  magnetic,  optically  active  or  superconducting  nano-clusters. 

The  A.s-Grown  LTG-GaA.s  Material 

Most  of  the  excess  As-atoms  of  LTG-GaAs  will  be  found  on  interstitional  sites  widening  the  lattice  [1].  However,  a 
small  fraction  of  the  surplus  As  is  located  on  the  AsQa  position,  where  the  As  will  form  a  deep  multiple  donor  near  mid 
gap  similar  to  EL2  [2].  Since  the  concentration  of  surplus  As  is  a  function  of  growth  condition,  the  properties  of  LTG- 
GaAs  material  may  vary  substantially  and  the  lattice  mismatch  with  the  GaAs  substrate  may  be  used  as  a  prime  indicator 
when  comparing  materials. 

At  a  growth  temperature  of  200°C  to  250°C  an  Asoa  anti-site  density  between  lO^^cm'^  and  lO^^cm"^  is  observed  and 
wave  function  overlap  occurs  resulting  in  a  deep  donor  mini  band  and  hopping  conduction.  The  Fermi-level  is  piimed  at 
approx.  0.4eV  below  the  conduction  band  [3,4].  Labeled  o-LT-GaAs  [5],  it  is  the  only  semiconductor  with  an 
energetically  isolated  miniband  and  hopping  conduction.  These  two  features  control  the  electrical  characteristics.  What  are 
the  consequences? 

-  The  carrier  mobility  in  the  miniband  is  low.  Since  the  Fermi-level  is  pinned,  the  conduction  band  is  essentially 
unpopulated.  This  results  in  a  resistivity  between  lOkQcm  and  IMQcm  and  a  characteristic  frequency  of  dielectric 
relaxation  in  the  MHz-range.  The  activation  energy  for  the  hopping  process  is  approx.  70meV.  Therefore,  the  bias  and 
temperamre  dependences  are  weak,  and  a  linear  IV-characteristic  is  observed  up  to  high  fields  (see  fig.l). 

-  Due  to  the  high  deep  donor  density,  the  material  cannot  be  depleted  noticeably,  despite  its  high  resistivity.  2()nm  thin 
surface  layers  have  been  found  conductive;  non-alloyed  tunneling  contacts  can  be  formed  [4]  and  the  interface  potential  to 
bulk  GaAs  is  the  LTG-GaAs  Fermi-level  [5]. 

-  The  material  cannot  be  doped  in  case  the  deep  donor  anti-site  concentration  is  above  the  activation  limit  of  common 
donors  or  acceptors.  Doping  profiles  reaching  into  this  layer  are  suppressed. 

-  Equilibrium  for  injected  carriers  is  reached  within  fs  due  to  fast  trapping  by  the  AsGa-ai*ti-site  deep  donor  or  the  Voa' 
vacancy  deep  acceptor.  Thus,  the  material  is  optically  inactive  [6]. 

As  seen  from  fig.l,  between  100  kV/cm  and  200  kV/cm  breakdown  is  observed.  It  is  thought,  that  carriers  are  emitted 
into  the  conduction  band  by  field  emission  and  the  material  becomes  highly  conductive.  Fig. 2  shows  a  calculation  of  tbe 
density  of  conduction  band  electrons  as  function  of  applied  field  assuming  a  triangular  barrier,  confirming  a  transition  to 
the  hi^ly  conductive  state  in  the  correct  field  range  [M-  The  breakdown  strength  is  therefore  less  than  that  of  bulk  GaAs. 

Annealed  LTG  GaAs 

Subjecting  this  material  to  a  high  temperature  treatment,  like  for  subsequent  growth  of  active  layers,  the  solubility  limit 
for  the  excess  As  is  surpassed.  As  will  precipitate  into  clusters  and  the  lattice  mismatch  disappears  [8].  The  clusters  act 
as  metallic  inclusions  forming  internal  Schottky  barriers  [9].  Accordingly,  the  clusters  are  surrounded  by  spherical 
depletion  regions  in  a  material  with  substantially  reduced  anti-site  concentraticm.  If  the  cluster  density  is  high  enough  and 
the  anti-site  concentration  low  enough,  these  depletion  regions  may  overlap  forming  an  essentially  depleted  material  of 
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high  resistivity  in  the  order  of  10^  flcm.  The  conduction  through  this  material  is  rather  complex,  involving  tunneling 
through  the  internal  barriers  or  thermionic  emission  aaoss  them  and  hopping  conductivity  or  space  charge  limited  current 
between  the  barriers  [10].  Growth  and  annealing  conditions  can  be  tailored  to  combine  high  resistivity  and  breakdown 
field.  However,  the  breakdown  field  in  the  order  of  300kV/cm  is  still  within  the  limit  of  avelanche  breakdown  in  undoped 
GaAs.  But  in  contrast  to  bulk  GaAs,  injected  carriers  recombine  fast  and  the  Fermi-level  is  firmly  pinned  near  midgap. 
Therefore,  premature  injection  instabilities  are  suppressed.  And  indeed  LTG-GaAs  buffers  have  eliminated  side  gating  and 
cross  talk  in  digital  circuits  [1 1]  and  improved  breakdown  and  backgating  in  power  MESFET's  [6]. 

Diffusion  Barriers  and  Contacts 

To  be  implemented  into  device  structures  the  surplus  As  as  well  as  the  As-precipitates  need  to  be  confined  to  thin  layers. 
However,  the  first  experiments  indicated  an  outdiffusion  of  As  into  active  channels  within  the  first  lOOnm.  Thus,  AlAs 
[12,13],  InAlAs  [13]  and  hiGaP  [14]  have  been  deployed  as  diffusion  barriers  with  a  thickness  down  to  5nm.  Fig.3  shows 
a  TEM  cross  section  of  an  active  GaAs  channel  sandwiched  between  an  annealed  LTG-GaAs  buffer  layer  and  an 
unannealed  a-LT-GaAs  passivation  layer,  separated  by  a  top  and  bottom  AlAs  diffusion  barrier.  Also  shown  is  the  planar 
contact  to  this  system  by  through-alloying  of  AuGe  through  the  AlAs.  Other  techniques  used  are  overlapping  contacts 
[15]  or  selectively  re-grown  contacts  [16].  The  requisite  for  these  contacts  is  the  elimination  of  lateral  exposed  free  surface 
areas.  Since  the  free  surface  potential  will  differ  from  that  of  the  LTG-GaAs/GaAs  interface  potential,  these  areas  will 
determine  the  lateral  field  distribution  and  may  cause  severe  high  field  instabilities. 

Using  the  above  heterostructure  materials  systems,  not  only  active  layers  can  be  confined  but  also  layers  containing 
precipitates.  AlGaAs/GaAs  MQW's  show  for  example  precipitate  accumulation  in  the  GaAs  layers  [17].  But  doping  can 
also  be  used  to  stear  precipitation  [18].  This  strongly  indicates  an  influence  of  the  solubility  as  well  as  the  position  of  the 
Fermi-level  on  the  defect  formation. 

Lossy  Dielectric  Passivation  Concept 

Controling  the  electronic  surface  conditions  has  been  one  of  the  main  difficult  tasks  of  GaAs  technology  and  surface 
related  instabilites,  noise  and  breakdown  still  limit  device  performance.  Here  LTG-GaAs  passivation  layers  can  offer 
destinct  advantages.  The  LTG-GaAs/channel  interface  potential  is  well  defined  and  the  channel  can  be  totally  shielded  from 
the  surface.  MESFET  devices  passivated  by  such  layers  have  therefore  shown  reduced  gm-dispersion  with  drain  bias, 
reduced  1/f  noise  and  light  immunity  [19-21].  Used  as  a  lossy  dielectric,  a  o-LT-GaAs  passivation  layer  pins  the  lateral 
surface  potential  distribution  and  smoothens  out  lateral  high  field  regions.  In  essence,  the  donor  depletion  charge  in  the 
channel  can  now  be  vertically  compensated  by  surface  charge  provided  by  the  conductive  o-LT-GaAs  (see  fig.4).  This  has 
two  effects:  (1)  improving  the  Schottky  gate/drain  breakdown  conditions  and  (2)  extending  the  drift  region.  Indeed, 
extremely  high  drain  breakdown  voltages  are  observed  in  LTG-passivated  MESFET’s  [22];  and  an  extremely  small  feed 
back  capacitance  and  ouq)ut  conductance  are  observed  causing  extremely  high  finax  cut-off  frequencies  (see  fig.5). 

These  results  indicate  that,  by  careful  design  of  the  device  structure,  the  power  limit  of  GaAs  FETs  of  approx.  1.2W/nun 
(RF)  can  be  surpassed.  This  limit  is  well  characterized  by  the  lateral  spreading  model  [23],  and  states  that  due  to  lateral 
Schottky  barrier  breakdown  the  maximum  source/drain  voltage  is  inversely  linked  to  the  maximum  open  channel  current 
High  gate/drain  breakdown  voltages  can  thus  only  be  obtained  at  the  expense  of  a  low  open-channel  current. 

A  MISFET  can  overcome  this  Schottky-limit.  However,  GaAs-MISFET's  with  dielectrics  of  high  breakdown  strength 
like  Si3N4  or  Si02  suffer  from  severe  interface  state  charging  instabilities.  To  circumvent  this  problem  a  controlled 
interface  charging  current  needs  to  be  supplied  modifying  the  MISFET  concept  by  the  leaky  dielectric  concept  [24].  A 
conductivity  can  be  introduced  into  the  above  dielectric  materials,  if  they  are  made  Si-rich.  However,  the  IV-characteristics 
are  essentially  exponential  due  to  Pool-Frenkel  activation  and  the  concept  can  only  be  implemented  in  a  narrow  bias  range 
[25].  Here,  LTG-GaAs  offers  a  new  alternative  with  its  essentially  linear  IV-characteristics,  however  with  a  lower 
breakdown  field  than  the  channel  material.  On  the  other  hand,  the  lateral  leakage  will  enable  high  lateral  breakdown 
voltages  essentially  independent  of  open  channel  current  and  it  has  indeed  been  possible  to  surpass  the  Schottky  limit 
essentially  [15].  However,  an  inherent  property  of  the  lossy  dielectric  MISFET  is  a  frequency  dispersion  in  gm  associated 
with  the  characteristic  frequency  of  dielectric  relaxation  [26],  which  needs  to  be  minimized. 

Early  experiments  with  LTG-GaAs  MISFETs  indicated  an  RF-power  handling  capability  of  1.57  WAnm  at  1.1  GHz  [27], 
already  outside  the  Schottky-limit.  Recent  experiments  with  a-LT-GaAs  recessed  gate  MISETs  have  yielded  in  DC 
characteristics  as  shown  in  fig.  6.  Best  results  indicate  an  RF  power  density  of  3.8W/mm,  when  operated  below  the 
dispersion  frequency  at  IMHz.  However,  above  the  dispersion  frequency  (indicated  by  a  redution  in  gm  of  15%  between 
50MHz  and  500MHz),  the  potential  in  the  device  structure  will  redistribute  and  care  has  to  be  taken,  that  the  limiting 
breakdown  fields  in  the  structure  are  not  surpassed  degrading  the  maximum  power.  Since  detailed  models  are  not  yet 
available  for  this  new  device  structure,  the  conditions  have  to  be  identified  experimentally.  This  had  proved  difficult.  An 
analysis  of  the  gate/drain  region  at  2GHz  indicates  that  the  lossy  dielectric  passivation  concept  is  also  valid  above  the 
dielectric  relaxation  dispersion  frequeny,  since  breakdown  did  not  degrade  essentially,  hi  a  separate  experiment  560m/mm 
maximum  current  could  be  modulated  at  IGHz  by  the  MIS  diode.  Combining  both  results,  it  can  be  predicted  that 
properly  designed  LTG-GaAs  FETs  will  handle  RF  power  densities  twice  as  high  as  the  Schottky-limit. 
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LTG-InP  Devices 

InP  grown  at  low  temperatures  contains  a  surplus  of  P  and  also  a  high  Pin-anti-site  density.  However,  in  this  material 
system  the  first  exited  state  of  the  donor  is  in  the  conduction  band  (two  localized  P-ionization  states  are  in  the  band  gap) 
and  LTG-InP  will  be  highly  conductive  [28].  The  material  cannot  be  used  as  MISFET  insulator.  Instead,  active  n-type 
channels  can  be  grown  without  doping  at  low  temperature.  This  might  be  very  attractive  for  optoelectronic  applications. 
Here,  fully  processed  optoelectronic  device  structures  can  be  selectively  overgrown  with  FET  layers  and  the  electronic 
circuit  added  using  a  low  temperature  budget  for  processing  not  degrading  the  optoelectronic  components. 

Conclu.sion 

Low  temperature  grown  off-stoichiometric  materials  offer  many  unusual  properties.  They  may  be  defined  by  the  growth 
process  as  well  as  by  post  growth  annealing.  It  has  been  discussed,  how  LTG-GaAs  layers  can  offer  a  new  degree  of 
freedom  in  the  design  of  GaAs  FETs.  In  perspective,  the  results  shown  also  apply  to  related  materials  like  AlGaAs  and 
PMHEMT  devices.  Other  materials,  like  LTG-InP  will  offer  advantages  in  different  directions.  These  have  not  been 
discussed  in  the  open  literature  yet  and  concepts  have  still  to  be  developed.  The  precipitate  phase  has  up  to  now  been  used 
in  buffer  layer  structures  and  optoelectronic  switches.  However,  using  their  full  potential  means  to  modify  our  picture  of 
a  pure  semiconductor  towards  that  of  a  composite.  This  has  only  been  attempted  to  a  very  limited  extent.  Thus, 
applications  are  still  extensions  of  common  concepts.  However,  composites  offer  multi-function  properties  combining 
semiconductor  properties  with  magnetical,  optical  or  even  superconducting  properties  as  suggest^  for  example  in  the 
superconducting.SUPERFET  [29]. 
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Fig-3  F.ifcft 

T]^  cross  section  of  planar  FET  contact  region,  showing  through-  Equivalent  circuit  representation  of  LTG-GaAs 
alloyed  contact,  LTG-  GaAs-buffer,  AlAs  diffusion  barrier,  GaAs  passivated  drift  region  with  low  conductive  by¬ 
active  channel,  top  AlAs  diffusion  barrier  and  o-LT-GaAs  pass  on  the  surface  [32]. 
passivation  layer. 


fT  and  fmax  data  for  o-LT-GaAs  MISFET  with  DC  output  characteristics  of  o-LT-GaAs  MISFET  indicating 
Lg=l|jin  and  Vd=3.5V,  Vg=-0.3V.  3.3W/mm  power  capability;  Ig  =  l|xni,  w  =  100|4in 
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ABSTRACT 

In  this  paper,  recent  results  concerning  FETs  for  power 
applications  are  presented.  The  study  is  based  on  two 
kinds  of  simulation  which  make  possible  to  perform 
exhaustive  investigations  :  a  physical  two  dimensional 
hydrodynamic  energy  model  and  a  quasi  two 
dimensional  model.  The  study  of  the  potentialities  and 
limitations  of  HEMTs  for  applications  beyond  60  GHz 
and  LT  GaAs  MISFETs  for  low  frequency  ones  is 
proposed  for  examples. 

INTRODUCTION 

The  development  of  new  power  FET 
structures  for  microwave  applications  needs  the 
determination  of  accurate  predictions  which  involves 
improvements  of  the  modeling  due  to  the  introduction 
of  specific  physical  phenomena,  the  device  complexity 
or  high  operating  frequencies.  As  a  first  approach,  the 
output  power  is  proportional  to  the  drain-source 
breakdown  voltage  Vqr  and  the  maximum  current  Ijso- 
Usually,  larger  is  obtained  for  devices  made  with 
high  bandgap  semiconductors.  The  current  density  is 
mainly  depending  on  the  carrier  concentration  in  the 
channel  and  its  transport  properties.  According  to  the 
operating  frequency  application,  different  new  devices 
are  developed  at  the  present  time.  As  an  example,  the 
study  of  the  potentialities  and  limitations  of  two  kinds 
of  structure  is  proposed.  First,  conventional  or  5-doped 
high  electron  mobility  transistors  (HEMTs)  which 
constitutes  the  most  promising  structure  beyond 
60  GHz  applications  [1]  and  the  low  temperature  (LT) 
GaAs  MISFET  which  is  a  new  structure  for  low 
frequency  ones  presenting  high  breakdown  voltage  [2]. 

Due  to  the  complexity  of  the  physical 
phenomena  which  must  be  taken  into  account  and  the 
large  number  of  parameters  to  consider,  two  kinds  of 
simulations  are  used.  A  two  dimensional 
hydrodynamic  energy  model  making  possible  to 
understand  the  physical  mechanisms  and  a  quasi  two 
dimensional  model  from  which  it  is  possible  to 


perform  device  optimizations.  The  microwave 
performance  of  the  devices  is  described  and  the  main 
limitations  are  studied  in  particular  the  breakdown 
phenomena  which  constitute  one  of  the  most  limitative 
effects  for  power  devices. 

MODELING  DESCRIPTIONS 

The  proposed  study  is  based  on  two  different 
physical  simulation  tools  : 

A  two  dimensional  hydrodynamic  energy 
model  which  takes  into  account  a  large  part  of  the 
physical  phenomena  which  occur  in  the  different 
structures.  It  is  based  upon  the  resolution  of  a  set  of 
equations  derived  from  Boltzmann  transport  equation  : 
the  particule,  momentum  and  energy  conservation 
equations  which  are  associated  with  Poisson’s  equation 

[3] .  These  equations  are  solved  numerically  using  a 
finite  difference  method  with  non  uniform  meshes  and 
a  variable  time  step.  The  main  advantage  of  this  model 
is  the  results  accuracy  but  it  needs  very  long  computing 
time  due  to  a  large  number  of  mesh  points  and  the  use 
of  small  time  steps.  For  this  reason,  programming 
techniques  that  include  vectorial  methods  and  parallel 
processors  have  been  developed.  This  model  uses  a 
CRAY  C98  computer. 

A  quasi  two  dimensional  model  where 
simplifying  assumptions  are  introduced,  but  it  accounts 
for  the  non  stationary  electron  dynamic  effects  that 
take  a  particular  importance  in  submicron  gate  devices 

[4] .  The  model  is  based  under  the  successive  resolution 
along  the  transversal  and  longitudinal  axes.  In  a  first 
time,  the  charge  control  law  of  the  device  is  computed 
(transversal  axis)  and  in  a  second  time,  the  average 
values  of  the  physical  parameters  in  the  device  are 
calculated  using  the  drain-source  current  equation, 
momentum  and  energy  conservation  equations  and 
Poisson's  equation  (longitudinal  axis).  The  validity  of 
this  simulation  is  prouved  by  a  comparison  with  the 
results  obtained  from  the  2D  hydrodynamic  model.  Its 
main  advantage  is  the  relatively  short  computing  time 
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which  makes  possible  to  optimise  the  power  structure 
versus  the  different  parameters. 

HEMT  STRUCTURES  RESULTS 

Mainly,  two  kinds  of  structure  are 
investigated.  The  pseudomorphic  AlGaAs/GalnAs 
HEMT  on  GaAs  which  constitutes  the  most  promising 
structure  for  60  GHz  applications  and  the 
AlInAs/GalnAs  HEMT  on  InP  which  has  emerged  as  a 
device  of  choice  for  high  microwave  applications. 


Fig.  1  :  Distribution  of  physical  quantities  in  a 
PMHEMT  gate-recess  device  (Vgg  =  OV  ;  =  4  V). 

For  the  evaluation  of  power  HEMTs 
performance,  recent  improvements  have  been 
introduced  into  the  models.  First,  the  impact  ionisation 
phenomenon  is  taken  into  account.  The  minority 
carriers  are  considered  and  generation  terms  are  added 
in  the  continuity  equations.  In  order  to  simplify  the 
calculation  in  the  quasi  2D  model,  the  hole  effect  is 
neglected  in  the  charge  control  law  [5].  The  gate  recess 
topology  constitutes  an  important  parameter  that  allows 
to  improve  the  breakdown  voltage.  So,  the  real  shape 
of  the  gate  recess  is  introduced  in  the  2D  model.  It 
makes  possible  to  study  various  gate  recess 
configuration,  so  it  constitutes  a  useful  means  for 
improving  the  device  breakdown  voltage  optimisation. 


For  instance,  figure  1  shows  the  distribution  of 
physical  quantities  (charge  concentration,  total  energy 
and  potential)  in  a  device  where  a  double  gate  recess  is 
introduced.  The  energy  variation  in  the  gate-drain  zone 
allows  to  determine  the  location  of  the  ionisation 
effect.  In  this  case,  it  can  be  noted  that  electric  field 
and  average  carrier  energy  remain  uniform  in  the  gate 
recess  offset  zone  which  is  favourable  to  obtain  a  large 
breakdown  voltage.  Figure  2  represents  the  current- 
voltage  characteristics  deduced  from  the  quasi  2D 
model  for  two  devices  with  a  different  gate  recess 
offset.  An  improvement  is  observed  on  the  breakdown 
voltage  for  larger  gate  recess  offsets.  But  it  may  cause 
a  degradation  of  microwave  performance  and  then,  a 
compromise  must  be  found.  For  this  purpose,  are  also 
studied  the  microwave  capabilities.  For  instance, 
figure  3  represents  the  intrinsic  current  gain  cut-off 
frequency  evolutions  for  two  different  structures. 


Fig.  2  :  Current  voltage  characteristics  for  two  gate 
recess  offsets. 


Fig.  3  :  Influence  of  the  gate-recess  offset  on  the 
intrinsic  cut-off  frequency. 
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Recent  improvement  of  the  2D  model  also 
consists  in  the  determination  of  the  gate  current  which 
constitutes  a  strong  limitation  for  forward  gate  bias. 
Figure  4  shows  the  gate  current  versus  the  gate  voltage 
Vgs  for  three  doping  levels  of  the  doped  AlGaAs  zone 
in  a  conventional  HEMT.  The  evolutions  show  that  up 
to  Vgs  =  0.5  V,  the  gate  current  remains  very  small 
which  confirms  earlier  assumptions  of  zero  gate 
current  at  low  Vg^.  Moreover,  the  gate  current  is 
decreasing  when  the  concentration  of  doped  layer 
decreases.  The  main  advantage  of  the 
AlInAs/GalnAs/InP  structure  is  the  possibility  to 
obtain  large  carrier  concentration  Ng  in  the  well  due  to 
larger  conduction  band  discontinuity  between 
Alo48lno52As  and  Gao47lno53As  than  GaAs  based 
HEMTs.  A  concentration  of  4x10^^  cm'^  is  obtained  for 
a  single  5-doped  device  (fig.  5).  Larger  values  can  be 
obtained  by  dual  5-doped  structures.  Due  to  the  higher 
electron  velocity  in  the  channel  large  drain-source 
current  densities  can  be  achieved.  The  main  problem 
consists  in  the  breakdown  voltage  but  recent  results 
show  that  a  large  improvement  is  obtained  when  an 
undoped  Alo6lno4As  layer  is  inserted  under  the  gate 
[6]. 


VGS(V) 

Fig.  5  :  AilnAs/GalnAs/InP  HEMTs  charge  control  law 


LT  GaAs  MISFET  results 

Modifications  have  been  introduced  in  the 
simulations  to  take  into  account  the  specific  physical 
phenomena  which  occur  in  the  low  temperature  GaAs 
MISFET.  The  basic  structure  is  shown  in  figure  6. 
Under  static  conditions,  the  LT  GaAs  zone  is  taken  as 
conductive  and  under  dynamic  conditions  the  LT  GaAs 
zone  behaves  as  an  insulator.  This  effect  has  been 
introduced  in  the  quasi  2D  model  by  a  determination  of 
a  specific  charge  control  in  static  and  dynamic  regimes 
(figure  7).  From  this  simulation,  the  static  and  dynamic 
parameters  can  be  determined.  Figure  8  represents  for 
instance  the  DC  and  microwave  transconductances 
versus  Vgg  and  figure  9  the  Cgg  capacitance  evolutions. 
For  the  two  regimes,  the  behaviour  is  different.  The 
dynamic  transconductance  is  lower  than  the  static  one. 
Concerning  the  capacitances  Cgg,  in  static  the  evolution 
is  similar  to  a  MESFET  structure  but  in  dynamic  a 
saturation  is  observed  for  forward  biases  as  in 
MISFETs.  Note  that  theses  evaluations  are  in  good 
agreement  with  measurement. 


Fig.  6  :  LT  GaAs  MISFET  structure. 


Fig.  7  :  Charge  control  in  static  and  dynamic  for 
different  LT  GaAs  zone  thicknesses. 


113 


Fig.  8  :  Static  and  dynamic  transconductance  for  a  LT 
GaAs  MISFET 
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Fig.  9  :  Static  and  dynamic  capacitance  Cgs  for  a  LT 
GaAs  MISFET. 


0.3/i^  0.99fim _ ^  ^ 

SOURCE  Src 


Fig.  10  ;  Effect  of  the  carrier  generation  on  the 
potential  distribution. 

LT  GaAs  devices  present  higher  breakdown 
voltages  than  similar  GaAs  MESFETs.  This  can  be 
explained  by  tunnelling  of  electrons  from  the  trap 


miniband  located  in  the  LT  GaAs  zone  to  the 
conduction  band.  These  carriers  will  relax  the  local 
electric  field  peaks  which  involves  an  uniformisation 
of  its  distribution  that  can  give  more  favourable 
breakdown  conditions.  This  phenomenon  introduced  in 
the  2D  model  shows  that  an  improvement  of  the 
electric  field  and  potential  distributions  is  obtained  in 
the  gate-drain  zone  (figure  10) 

CONCLUSION 

The  simulation  of  complex  structures  by  2D 
and  quasi  2D  models  allows  to  perform  complete 
investigations  of  the  physical  understanding  and  the 
expected  performance.  Accurated  results  are  obtained 
with  a  good  agreement  with  those  deduced  from  the 
experimental  measurement.  Improvements  must  be 
always  brought  to  take  into  account  new  physical 
phenomena  for  devices  operating  at  very  high 
frequencies  or  close  to  breakdown  (tunneling  effect  at 
the  gate).  In  the  future,  other  new  devices  can  be 
studied  such  as  those  based  on  high  bandgap 
semiconductors  (SiC  or  GaN)  from  which  a  large 
breakdown  voltage  can  be  expected. 
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ABSTRACT:  First,  a  review  of  the 
laboratories,  which  have  studied  the 
GaInP/GaAs  HBT  for  electronic  applications, 
is  made.  Then  the  peculiarities  of  this  device 
are  addressed  compared  to  the  more 
conventional  GaAlAs/GaAs  HBTs.  Finally,  a 
presentation  of  some  recent  Thomson-CSF  X- 
band  power  results  are  presented  and  compared 
to  the  literature. 


1.  INTRODUCTION 

GaInP/GaAs  HBT  is  a  new  emerging 
challenger  in  the  field  of  GaAs  heterojunction 
bipolar  transistors.  Up-to  recently,  AlGaAs 
/GaAs  heterostructures  have  been  mainly  used, 
giving  rise  to  excellent  results.  All  the  current 
studies  are  focused  on  analogic  microwave 
applications  such  as  high  power  amplifiers  and 
oscillators.  The  present  results  using 
GaInP/GaAs  material  are  very  close  to  the  ones 
obtained  using  the  GaAlAs/GaAs  system, 
except  for  the  reliability  aspect,  which  seems  to 
be  better. 

More  than  twenty  laboratories  have  worked  on 
the  GaInP/GaAs  HBT  technology  in  Europe, 
USA,  and  Japan.  These  studies  have  explored 
all  the  topics  required  to  produce  devices  from 
fundamental  bandgap  arrangement  to  MMICs 
and  reliability.  The  table  I  shows  a  summary  of 
the  studies  presented  by  various  laboratories. 

If  one  makes  a  review  of  all  these  papers,  one 
could  discover  the  following  peculiarities  for 
this  heterojunction; 

*  MOCVD  growth  is  mostly  used  thanks  to  its 
high  throughput  capability  and  carbon  doping; 

*  Absence  of  aluminum  and  related  effect; 


*  Very  high  etching  selectivity  between  GalnP 
and  GaAs.  That  eases  the  emitter  mesa  etching 
by  preserving  the  base  layer.  This  is  not  the 
case  for  the  GaAlAsIGaAs  system,  which  does 
not  show  such  high  selectivity. 

*  Favorable  GaInP/GaAs  band  gap  matching. 
The  composition  grading  is  no  longer 
necessary  during  the  base-emitter  interface 
growth,  and  a  high  injection  efficiency  is 
obtained. 

*  Limited  dependence  of  the  current  gain 
versus  collector  current  density  and 
temperature  without  ledge  passivation 
compared  to  GaAlAsIGaAs  HBTs; 

*  Low  1/f  noise  without  any  critical  device¬ 
structure  optimization,  e.g.: 

Noise  characterization  of  microwave  devices 
has  been  performed  by  the  team  of  Pr. 
Graffeuil  at  low  and  high  frequencies.  The  1/f 
noise  is  found  to  be  lower  than  1  x  10-22  A2 
Hz-1  at  10  kHz  for  a  2  x  30  |j.m2  emitter  at  a 
collector  current  of  0.5  mA.  This  is  obtained 
with  a  simple  PECVD  nitride  passivation.  The 
RF  noise  figure  is  about  three  dB  between  2 
and  20  GHz.  The  GaInP/GaAs  HBT  is 
therefore  a  good  candidate  for  making 
microwave  oscillators. 

VCO  MMICs  working  from  15  GHz  to  34 
GHz  have  been  realized.  Wideband  monolithic 
VCOs  are  therefore  achievable.  For  instance,  - 
85  dBc/Hz  at  100  kHz  off  carrier  have  been 
obtained  at  17  GHz  [8]. 

*  Better  reliability  compared  to  GaAlAs/GaAs 
thanks  to  phenomena  not  yet  totally  understood 
[20]. (106  hours  at  200C  under  bias) 
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University  of  California,  Santa  Barbara 

DC  characterization 

MBE 

[1] 

Sony 

DC  characterization 

MOCVD 

[2] 

T.I. 

Physics  of  the  heterostructure 

MOCVD 

[3] 

Power  amplification 

MOCVD 

[4-5] 

Thomson-CSF 

X-Band  Power  amplifiers 

MOCVD 

[6-7] 

MMICs  VCO 

MOCVD 

[8] 

LAAS  &  Thomson-CSF 

1/f  &  RF  noise 

MOCVD 

[9-10] 

University  Paris  VII  &  Thomson-CSF 

Physics  of  heterojunction 

MOCVD 

[11] 

Fraunhofer  &  Siemens  Research 

Device  modeling 

MOCVD 

[12] 

Daimler-Benz  &  Fraunhofer 

RF  performances 

MOCVD 

[13] 

VCO’s 

MOCVD 

[14] 

Daimler-Benz 

X-Band  Power  amplifiers 

MOCVD 

[24] 

University  of  Illinois,  Urbana 

D-HBT 

MOCVD 

[15] 

Martin-Marietta  &  AT&T  Bell  Lab 

RF  performances 

CBE  &  MOCVD 

[16] 

Tektronics 

RF  performances 

MOCVD 

[17] 

CNET-Bagneux 

DC  characterization 

CBE 

[18] 

NRC-Ottawa 

DC  characterization 

CBE 

[19] 

Fujitsu 

Reliability 

MOCVD 

[20] 

University  of  Sheffield 

Optoelectronics 

MBE 

[21] 

Wright  Patterson 

High  efficiency  devices 

MOCVD 

[22] 

University  of  Michigan  &  Thomson-CSF 

DC  characterization 

MOCVD 

[23] 

Table  I:  Review  of  the  main  laboratories  involved  in  the  GaInP/GaAs  HBT  study 
2.  THOMSON-CSF  POWER  AMPLIFIERS 


N-P-N  GaInP/GaAs  heterostructures  are 
grown  on  75mm  wafers  using  LP-MOCVD 
technique.  The  collector  thickness  is  kept  close 
to  1  |i.m  with  a  doping  level  of  2.0  x  10i6cm-3. 
The  current  gain  of  microwave  transistors  is 
close  to  20  which  gives  rise  to  a  common 
emitter  breakdown  voltage  BVceo  of  about 
18V.  This  value  has  been  up-to-now 
sufficiently  high  since  the  present  limitation  of 
HBT  performances  is  mainly  dominated  by 
thermal  behaviour.  The  current  gain  cut-off 
frequency  ft,  which  depends  on  the  base  and 
collector  layer  thicknesses,  is  about  55  GHz  at 
the  optimal  bias  point.  The  elementary  emitter 
is  2  pm  wide  and  30  pm  long.  Two  2-pm  wide 
0.2-pm  thick  base  contact  stripes  are  used  to 
give  rise  to  a  reasonably  low  base  resistance. 


Boron  ion  implantation  is  employed  to  decrease 
the  parasitic  element  contribution  and  to  realize 
the  inter-device  electrical  isolation.  All  three 
electrodes  are  contacted  using  3.5-pm  thick 
gold  air-bridges.  Passive  components  and  via 
hole  technology  are  compatible  with  standard 
microwave  GaAs  MESFET  foundry. 
Substrates  are  100  pm  thick  for  microstrip 
integration.  The  lOGHz  small  signal  power 
gain  of  a  monofinger  HBT  is  close  to  14  dB  at 
20  mA  and  7V,  which  is  our  class-A  bias 
point.  At  10  GHz,  the  output  power  per  finger 
is  close  to  90  mW  and  the  collector  efficiency 
is  60%  for  all  topologies. Figure  1  shows  a 
compilation  of  the  power  performances  of 
GaAlAs/GaAs  and  GalnP/GaAs  HBTs  at  10 
GHz. 
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3.  CONCLUSION 


One  [7]  or  two  stages  X-band  power  amplifiers 
have  been  realized.  Circuit  characterizations  are 
done  either  in  CW  or  using  10  ps  pulses  and  a 
10%  duty  cycle.  Figure  2  shows  the  power 
performance  of  a  5W  amplifier  under  pulsed 
conditions.  5W  are  reached  with  a  power 
added  efficiency  of  33%.  These  results  have  to 
be  compared  to  the  recent  Texas  Instruments 
result  obtained  also  using  GaInP/GaAs,  which 
indicates  a  9.9W  output  power  with  a  peak 
power  added  efficiency  of  33.3%  [5]  or  to 
Daimler-Benz  [24]  ones,  which  has  obtained 
40%  and  8W  CW.  Figure  3  shows  a 
comparison  of  our  results  with  recent  HBT 
publication. 


GaInP/GaAs  HBTs  are  now  able  to  produce 
microwave  oscillators  and  X-band  high-power 
high-efficiency  amplifiers.  Higher  reliability 
compared  to  GaAlAs/GaAs  HBts  obtained  by 
Takahashi  et  al  from  Fujitsu  may  be  one  of  the 
major  advantages  for  the  future  of  this  device. 
The  high  etching  selectivity  between  GalnP 
and  GaAs  is  very  attractive  for  MMIC 
fabrication. 

Thomson-CSF  contribution  is  supported  by 
DRET.  We  want  to  emphasize  our  ever 
growing  with  our  colleagues  from  Daimler- 
Benz  and  DAS  A. 
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Figure  1:  Power  performances  of  discrete  HBTs  at  10  GHz 
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Figure  2:  Output  power  versus  input  power  at  10  GHz  of  a  5W  amplifier 
(10  ps,  duty  cycle:  10%) 

X'Band  HBT  power  amplifier 


Output  power  (W)  at  f  >  9.5  GHz 


Figure  3:  Comparison  of  X-band  power  amplifier  performances 
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ABSTRACT 

High-performance  60GHz-band  MMIC's  for 
wireless  communication  systems  based  on  pseudo- 
morphic  heterojunction  device  technologies  are 
presented.  Dielectrically  stabilized  signal  sources 
for  FM/FSK  modulators  and  LOs,  such  as  a  DRO 
with  low  phase  noise  of  -90dBc/Hz  at  lOOkHz  off- 
carrier,  lO.ldBm  output  power  and  1.6pptn/‘C 
stability,  and  a  stabilized  VCO  with  55MHzW 
modulation  sensitivity,  a  52.5-62. 5GHz  wide-band 
power  amplifier  wi^  14dB  gain  and  IT.ldBm 
output  power,  and  a  low-noise  high-gain  down- 
converter  consisting  of  an  LNA  and  an  active-gate 
mixer  with  25.9dB  gain  and  2.8dB  DSB  noise 
figure,  have  been  developed. 

1.  INTRODUCTION 


systeras[2]  are  presented.  High-performance 
MMIC  chip  sets  for  T/R  modules  of  stabilized 
signal  sources  for  FM/FSK  modulators  and  LOs, 
wide-band  power  amplifiers,  and  low-noise  high- 
gain  downconverters  have  been  developed. 

2.  DEVICE  TECHNOLOGIES 

For  signal  sources  and  power  amplifiers  where 
high  output  power  is  required,  0.15;im  gate  N- 
AlGaAs/InGaAs/N-AlGaAs  double-doped  double 
heterojunction  FET  (HJFET)  with  high  fmax  of 
240GHz  and  high  current  drivability  of  500mA/ 
mm  are  adopted  as  the  active  elements.  Figure  1 
shows  a  cross  sectional  structure  of  an  MMIC 
voltage  controlled  oscillator  (VCO)  for  the  FM/FSK 
modulator.  In  addition  to  the  active  HJFET,  a  drain 
DC-biased  HJFET  is  adopted  as  a  variable  reactance 
element.  The  drain  DC-biased  HJFET  varactor 


Millimeter  wave  communication  systems  are 
attracting  much  attention  for  high  speed  wireless 
LANs,  image  data  transmissions,  contactless  ID 
cards  and  so  on  because  of  high  potential  of 
realizing  very  compact  equipment,  of  fine  beam 
forming  and  of  utilizing  wide  bandwidth[l]. 
60GHz-band  systems  have  an  additional  advantage 
of  reduced  mutual-interference  among  systems 
owing  to  high  atmospheric  attenuation  loss  of  the 
radiated  signals.  In  this  paper,  60GHz-band 


shows  a  reduced  series  resistance  and  gradually 
decreasing  C-V  characteristics  near  pinch-off, 
suitable  for  linear  frequency  control  characteristics 
as  shown  in  Fig.2,  and  can  be  fabricated  in  a  simple 
process,  completely  compatible  with  active  HJFETs 
for  the  oscillator  itself  and  amplifiers[3]. 

For  low  noise  MMIC's,  0.15;im  gate  N-InGaP/ 
InGaAs/GaAs  pseudomorphic  HEMTs  are  adopted, 
which  have  superior  performance  to  AlGaAs/GaAs 
based  HEMT's[4].  The  fmax  is  180GHz  . 
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Fig.  3  MMICDROchip. 


Fig.4  Circuit  schematic  0  f  the  VCO. 


3.  SIGNAL  SOURCES 

A  60GHz-band  MMIC  dielectric  resonator 
oscillator  (DRO)  has  been  developed[5].  Figure  3 
shows  a  chip  photograph  of  a  DRO  with  a  buffer 
amplifier.  The  oscillator  circuit  has  a  series  feed¬ 
back  topology.  The  capacitive  stub  is  connected  to 
the  source  of  the  FET  as  the  series  feedback 
element  and  the  resonance  circuit  at  the  gate  side  is 
coupled  to  a  dielectric  resonator  (DR).  A  rod 
shaped  TEoi  3  mode  DR  is  placed  directly  on  the 


Fig. 6  Oscillation  frequency  and  output  power  vs. 
control  voltage  (Vc)  with  drain  bias  (Vdv) 
of  the  varactor  as  a  parameter  for  the  VCO. 


MMIC  chip.  The  MMIC  chip  size  is  2.22  X 
3.37mm^.  The  MMIC  DRO  exhibits  the  state  of  the 
art  performance  of  10.1  dBm  output  power  and  low 
phase  noise  of  -90dBc/Hz  at  lOOkHz  off-carrier 
with  the  oscillation  frequency  of  59.157GHz.  The 
frequency  stability  is  high  with  the  temperature 
coefficient  of  1.6ppm/'C.  These  performances 
are  nearly  comparable  to  those  of  Ka-band  DROs 
[6],  and  suitable  for  signal  sources  and  LO  s. 

In  addition,  a  dielectrically  stabilized  VCO  has 
been  developed  for  the  FM/FSK  modulator [7].  The 
circuit  schematic  of  the  developed  MMIC  is  shown 
in  Fig.4.  The  circuit  is  composed  of  a  VCO  with  an 
integrated  varactor,  stabilized  with  a  DR,  and  a 
buffer  amplifier.  A  resonance  circuit  is  placed  at 
the  gate  side  of  the  active  FET  with  a  drain  biased 
HJFET  varactor  circuit  and  a  microstrip  line 
coupled  to  a  DR.  The  MMIC  chip  is  shown  in 
Fig. 5.  The  chip  size  is  2.22 X  2.22mm\  Figure  6 
shows  the  oscillation  frequency  control  character¬ 
istics  and  output  power  with  the  drain  DC-bias 
(Vdv)  of  the  varactor  FET  as  a  parameter.  With 
increase  in  Vdv,  the  oscillation  frequency  variation 
and  its  linearity  with  regard  to  the  control  voltage 
(Vc)  are  improved.  The  modulation  sensitivity  of 
55MHzA^  at  60.0GHz  is  obtained  for  0.5V  Vdv. 
The  output  power  is  6.9dBm  and  is  fairly  constant 
over  the  control  voltage  range.  The  phase  noise  is 
-80dBc/Hz  at  lOOkHz  off-carrier  and  the  frequency 
stability  is  -4.1ppm/‘C. 
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Vd.lF 


Fig. 7  Two-stage  wide-band  power  amplifier  chip. 


4.  POWER  AMPLIFIER 

Wide-band  power  amplifiers  have  been  develop¬ 
ed  for  transmitters  and  LO  buffer  amplifiers. 
Figure  7  shows  a  MMIC  chip  of  a  two-stage 
amplifier  with  200;<m  wide  HJFETs.  The  chip  size 
is  1.07  X  2.22mm^  Both  input  and  output  matching 
circuits  basically  consist  of  a  microstrip  trans¬ 
mission  line  and  an  inductive  shunt  shorted  stub 
grounded  by  a  via-hole  through  an  MIM  capacitor. 
Due  to  these  short  length  elements,  the  phase 
deviation  with  the  frequency  is  small,  resulting  in 
wide-band  operation  capability  [8].  An  open  stub 
in  the  input  matching  circuit  and  an  RC  network  in 
the  bias-line  are  introduced  in  order  to  prevent  low 
frequency  bias-line  oscillations.  The  developed 
MMIC  amplifier  exhibits  a  very  flat  gain  of  14dB 
from  52.5GHz  to  62.5GHz  with  high  isolation  of 
30dB  as  shown  in  Fig. 8.  The  output  power  of 
17.1dBm  is  obtained  at  58.0GHz.  This  is  sufficient 
for  the  transmitting  power  in  the  indoor  communi¬ 
cations. 


Frequency  (GHz) 

Fig. 8  Frequency  response  of  the  wide-band 
power  amplifier. 
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90“  hybrid  circuit 

Fig. 9  Circuit  schematic  of  the  SB  gate  mixer. 


LNA  Mixer 

Fig.  10  Downconverter  chip. 


5.  LOW-NOISE  DOWNCONVERTER 

For  receivers,  low-noise  downconverters  consist¬ 
ing  of  an  LNA  and  an  active  HEMT  mixer  have 
been  developed[9]-[ll].  A  60GHz-band  one-chip 
downconverter  integrated  with  a  four-stage  LNA 
and  a  single-balanced(SB)  active-gate  mixer  has 
been  demonstrated  utilizing  0.15;<m  gateN-InGaP/ 
InGaAs/GaAs  pseudomorphic  HEMT's[ll].  The 
circuit  schematic  of  the  SB  mixer  is  shown  in  Fig. 
9.  The  mixer  consists  of  a  pair  of  HEMT's,  a  90° 
hybrid  circuit,  and  a  A  /4  transmission  line  to  make 
a  180°  phase  difference  between  the  RF  and  LO 
signals  for  isolation.  The  MMIC  chip  is  shown  in 
Fig.  10.  The  chip  size  is  1.75  X  2.5mm^ 

Figure  1 1  shows  the  LO  power  dependence  of 
the  active-gate  mixer  conversion  gain.  A  maximum 
conversion  gain  of  1.5dB  at  OdBm  LO  power  is 
obtained.  It  is  noted  that  the  MMIC  active-mixer 
has  a  positive  gain  for  the  first  time  over  the 
millimeter  wave  developments.  Figure  12  shows 
LO  frequency  dependences  of  the  gain  and  noise 
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LO  power  (dBm) 


Fig.  1 1  LO  power  dependence  of  the  mixer 
conversion  gain. 
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LO  frequency  (GHz) 

Fig.  12  Downconverter  conversion  gain  and 
noise  figure. 


figure  of  the  developed  MMIC  downconverter.  The 
IF  frequency  is  150MHz.  The  downconverter  has 
exhibited  a  high  conversion  gain  of  ^5.9dB  and  a 
double  side-band(DSB)  noise  figure  of  2.8dB  for 
low  LO  power  of  -0.27dBm  at  60.8GHz. 

6.  CONCLUSION 

High-performance  MMIC  chip  sets  for  60GHz- 
band  wireless  communication  systems  have  been 
developed.  0.1 5^m  gate  N-AlGaAs/InGaAs/N- 
AlGaAs  HJFET  technologies  are  used  for  power 
MMIC's.  A  DRO  exhibits  low  phase  noise  of 
-90dBc/Hz  at  lOOkHz  off-carrier  and  high  stability 
of  1.6ppm/'C .  As  the  FM/FSK  modulator,  a  dielect¬ 
rically  stabilized  VCO  with  a  drain  biased  HJFET 
varactor  has  realized  55MHzA^  modulation  sensi¬ 
tivity.  Wide-band  power  amplifiers  have  been 
developed  for  transmitters  and  buffer  amplifiers.  A 


two-stage  amplifier  exhibits  a  very  flat  gain  of 
14dB  from  52.5GHz  to  62.5GHz  with  17.1  dBm 
output  power.  For  receivers,  low-noise  down- 
converters  consisting  of  an  LNA  and  an  active-gate 
mixer  have  been  developed  adopting  0.15)<m  gate 
N-InGaP/InGaAs/GaAs  pseudomorphic  HEMT 
technologies.  The  active-mixer  has  a  positive  gain 
for  the  first  time,  and  the  downconverter  has 
exhibited  a  high  conversion  gain  of  25.9dB  and  a 
DSB  noise  figure  of  2.8dB.  These  MMIC's  are 
suitable  for  compact  wireless  communication 
systems  such  as  indoor  wireless  LANs,  in  which  the 
transmitting  power  is  planned  to  be  lOdBm. 
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ABSTRACT 

Blind  identification  consists  of  estimating  the  impulse 
response  of  a  linear,  time-invariant  channel  used  for 
transmission  of  digital  data  by  observing  the  channel 
output  without  knowledge  of  the  transmitted  symbol 
sequence. 

The  aim  of  this  paper  is  twofold.  First  we  compare, 
in  order  to  assess  their  applicability  to  the  equalization 
of  digital  radio  links  affected  by  selective  fading,  some 
recently  proposed  algorithms  based  on  the  second-order 
statistics  of  the  received  signal.  Further,  one  of  these 
algorithms  is  modified  to  cope  with  correlated  noise. 

1.  INTRODUCTION 

By  blind  identification  we  mean  here  the  estimate  of  the 
impulse  response  of  a  linear,  time-invariant  noisy  chan¬ 
nel  used  for  transmission  of  digital  data;  this  estimate 
is  obtained  by  observing  the  channel  output  without 
knowledge  of  the  transmitted  symbol  sequence.  Our 
motivation  for  the  present  work  was  to  assess  the  ap¬ 
plicability  of  blind  identification  to  the  equalization  of 
microwave  radio  links,  to  limit  communication  outages 
caused  by  severe  selective  fading.  However,  the  conclu¬ 
sions  drawn  are  applicable  to  other  channels  as  well. 

The  desirable  features  of  the  ultimate  blind  identi¬ 
fication  algorithm  are: 

♦  Low  identification  error  in  the  presence  of  noise. 

♦  Fast  convergence. 

♦  Computational  simplicity. 

♦  Insensitivity  to  data-symbol  correlation. 

♦  Insensitivity  to  noise  correlation. 

♦  Possibility  to  make  it  adaptive. 

This  research  was  done  in  the  framework  of  the  “Human 
Capital  and  Mobility”  program  sponsored  by  the  European 
Community. 


Recent  work  on  the  subject  was  based  on  exploiting 
the  cyclostationarity  of  digital  signals,  and  was  spurred 
by  [4].  Tong,  Xu,  and  Kailath  [7,  8,  9,  10]  have  devel¬ 
oped  a  blind  identification  algorithm  (herewith  referred 
to  as  TXK  algorithm)  which  is  based  on  an  estimate  of 
the  autocorrelation  function  of  the  observed  channel- 
output  samples.  This  feature  entails  a  convergence 
faster  than  other  blind  algorithms  based  on  higher- 
order  statistics  [12],  which  is  highly  desirable  when  the 
channel  is  time-varying  and  its  variations  have  to  be 
tracked  quickly  in  order  to  compensate  for  them.  This 
algorithm  converges  globally,  can  resolve  the  non-min¬ 
imum-phase  zeros  of  the  channel  transfer  function,  and 
is  robust  with  respect  to  timing  recovery.  However,  it 
suffers  from  some  drawbacks,  viz., 

•  It  is  computationally  intensive,  as  it  requires  two 
singular-value  matrix  decompositions. 

•  It  requires  data  symbols  to  be  uncorrelated. 

•  It  requires  the  noise  to  be  uncorrelated. 

•  It  is  not  adaptive. 

An  algorithm  that  improves  on  TXK  (herewith  re¬ 
ferred  to  as  MDCM)  was  proposed  by  Moulines  ei  al.  [5]. 
The  advantages  of  this  new  algorithm  over  TXK  are; 

•  A  lower  computational  complexity. 

•  Convergence  even  with  correlated  data  symbols. 

•  A  lower  identification  error  for  the  same  observa¬ 
tion  length. 

More  recently,  Baccala  and  Roy  [1,  2]  have  pro¬ 
posed  a  new  algorithm  (herewith  referred  to  as  BR) 
that  presents  a  significantly  lower  computational  com¬ 
plexity  and  an  identification  error  close  to  that  of  TXK 
and  MDCM  algorithms. 

The  aim  of  this  paper  is  twofold.  First  we  com¬ 
pare,  by  combining  analysis  and  simulation,  the  TXK, 
MDCM,  and  BR  algorithms,  in  order  to  assess  their 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


123 


applicability  to  the  equalization  of  digital  radio  links 
affected  by  selective  fading.  Our  results  show  that  in 
general  these  algorithms  based  on  second  order  statis¬ 
tics  outperform  standard  blind  equalization  in  terms 
of  convergence  speed.  Moreover,  while  the  BR  algo¬ 
rithm  has  a  lower  computational  complexity,  in  this 
specific  application  the  MDCM  algorithm  outperforms 
both  TXK  and  BR  in  terms  of  robustness  to  source- 
data  correlation  and  mean-square  estimation  error. 

Further  we  present  a  modification  of  the  MDCM 
algorithm  in  [5]  in  order  to  achieve  blind  identification 
in  the  presence  of  unknown  correlated  noise.  Our  al¬ 
gorithm  is  based  on  a  matrix  decomposition  method 
proposed  in  [6]. 


2.  CHANNEL  MODEL  AND 
IDENTIFICATION  ALGORITHMS 

We  assume  a  digital  signal  linearly  modulated  by  a 
symbol  sequence  The  complex  envelope  r(^)  of 

the  signal  observed  at  the  output  of  the  channel  can  be 
expressed  as 


oo 

-•«)=  E  Snh{t-nT)  +  b{t),  (1) 

n=:  — OO 


where  {«„}  form  a  sequence  of  identically  distributed 
complex  random  variables  (the  channel  symbols),  h{i) 
is  the  channel  impulse  response,  T  is  the  symbol  period, 
and  b{t)  denotes  additive  noise. 

The  task  of  blind  identification  algorithms  is  to  ob¬ 
tain  an  estimate  of  h{t)  based  upon  the  observation  of 
the  process  r{t)  during  a  finite  time  interval.  We  as¬ 
sume  that  h{  • )  has  a  finite  duration,  say  (M-f  1)T,  and 
that  the  conditions  of  identifiability  based  on  second- 
order  statistics  [1 1]  are  satisfied  by  the  channel  impulse 
response. 

The  basic  step  in  this  approach  consists  of  oversam¬ 
pling  signal  r{t)  with  period  A.  Thus  every  T  we  get 
L  =  T/A  samples  of  the  observed  process,  taken  at  in¬ 
stants  to  iAy  0  <  i  <  L  —  1.  This  generates  a  vector 
process  {r„}. 


2.1.  The  TXK  algorithm. 

The  TXK  algorithm  is  based  on  the  further  assump¬ 
tions  that 

♦  The  symbols  are  uncorrelated,  and 

•  The  noise  process  6(^)  is  white. 


2.2.  The  MDCM  algorithm. 

This  algorithm  does  not  require  the  assumption  that 
the  symbol  sequence  be  uncorrelated.  Blind  identifi¬ 
cation  with  the  MDCM  algorithm  requires  only  one 
singular- value  decomposition,  along  with  the  solution 
of  an  eigenvalue/eigenvector  problem. 

2.3.  The  BR  algorithm. 

This  is  based  on  the  same  assumptions  as  the  MDCM 
algorithm,  and  provides  identification  by  solving  a  sys¬ 
tem  of  linear  equations. 

2.4.  Blind  identification  with  correlated  noise 

None  of  the  above  algorithms  works  in  the  presence  of 
correlated  noise.  We  were  able  to  modify  the  MDCM 
algorithm  (which,  as  we  shall  see  later  on,  provides  the 
best  performance  for  the  problem  we  are  considering 
here)  in  order  to  make  it  applicable  to  the  case  where 
the  noise  is  not  white.  We  use,  with  some  adjustments, 
the  matrix  decomposition  method  proposed  in  [6]. 


3.  RESULTS  AND  COMPARISONS 


To  model  the  fading  radio  channel  transfer  function 
H{f),  we  use  here  the  classical  simplified  three-ray 
model.  Under  normal  propagation  conditions,  H{f)  = 
1,  while  during  a  fading  event 


H{f)  =  a  (l  -  =  a  (l  - 


(2) 

where  a  is  the  amplitude  of  the  direct  ray  and  accounts 
for  flat  fading  (hereafter  we  shall  assume  a  =  1),  0  < 
6  <  1  is  the  relative  amplitude  of  the  delayed  ray,  and 
fn  G  (— l/2r,  l/2r)  is  the  notch  frequency,  and  r  is  the 
difference  of  time  delay  between  the  two  rays. 

At  the  channel  output  the  observed  signal  is  r(<)  = 
x(t)  -h  6(<),  where  b{t)  accounts  for  thermal  noise  (and 
possibly  other  interferences,  which  we  shall  disregard 
here.)  With  no  fading  the  impulse  response  of  the  chan¬ 
nel  is  a  raised-cosine  function  with  roll-off  factor  a,  that 
is. 


p{t)  =  A 


sin  irt/T 
~  irtlT 


cos  ant /T 


while  in  the  presence  of  fading  h{t)  is  given  by 


hit)  =  a  [p(0  -  -  r)]  (3) 


where  fc  denotes  the  carrier  frequency  (note  that  h{t) 
is  generally  complex.) 

The  following  parameters  values  were  selected  as 
typical: 


124 


•  Relative  amplitude  of  the  reflected  ray: 

J5  =  20  log(l  -  6)  =  20  dB 

•  Delay  of  the  reflected  ray: 

r  =  6.3  ns 

•  Difference  between  the  notch  frequency  and  the 
carrier  frequency: 

/n-/c=0 

•  Modulation  scheme:  128-QAM  at  140  Mbit/s, 
corresponding  to  a  symbol  interval  50  ns. 

•  Raised-cosine  waveforms  with  roll-off  a  (in  par¬ 
ticular,  we  have  examined  the  cases  a  =  .11  and 
a  =  ,25.) 

•  Oversampling  factor  L  (we  have  chosen  L  =  3 
and  L  =  7.) 

•  Observation  length  N . 

The  correlations  were  first  estimated  by  using  a 
time  average  over  N  symbol  intervals.  TXK,  MDCM, 
and  BR  algorithms  were  then  used  to  identify  the  chan¬ 
nel  impulse  response.  The  quality  of  the  identification 
was  evaluated  by  computing  the  mean-square  error 

1 

USE  = —Y^\hi -hi\\ 

t  =  l 

We  also  assume  that  the  carrier  phase  is  estimated 
exactly,  that  the  duration  of  the  impulse  response  is 
known,  and  that  the  unknown  scalar  complex  multi¬ 
plying  factor  inherent  to  all  the  algorithms  examined 
here  can  be  recovered  in  some  way. 

We  hasten  to  say  that  the  word  “convergence”  may 
not  be  fully  appropriate  when  describing  the  behavior 
of  these  algorithms,  which  are  not  recursive  in  nature. 
Since  they  depend  on  a  preliminary  estimate  of  corre¬ 
lations,  by  “speed  of  convergence”  we  mean  to  describe 
how  many  samples  are  necessary  for  a  correlation  esti¬ 
mate  that  generates  a  good  enough  identification  of  the 
channel  impulse  response.  Thus  our  “speed  of  conver¬ 
gence”  actually  reflects  the  robustness  of  an  algorithm 
to  the  uncertainties  in  the  knowledge  of  the  correlation. 

3.1.  Equalization  through  channel  identification 

The  estimate  of  the  impulse  response  of  the  channel 
may  be  used  to  evaluate  the  tap  gains  of  a  linear  trans¬ 
versal-filter  equalizer.  It  was  found  in  [3]  that  the  TXK 
algorithm  outperforms  blind  equalization  in  terms  of 
speed  of  convergence,  at  the  price  of  being  computa¬ 
tionally  much  more  intensive  than  the  latter. 


3.2.  Simulations  with  white  noise  and  uncorre¬ 
lated  data 

Table  1  summarizes  the  performance  of  the  three  algo¬ 
rithms  in  the  presence  of  white  noise  and  uncorrelated 
data.  Here  AtxKj  Amdcm  3'^^d  Abr  denote  the  MSE 
of  TXK,  MDCM  and  BR  algorithm,  respectively.  By 
Ai  ^  Aj  we  mean  that  at  convergence  the  MSE  per¬ 
formances  of  algorithms  i  and  j  are  about  the  same. 
By  Ai  <  Aj  we  mean  that  at  convergence  the  MSE  of 
algorithm  i  is  lower  than  for  algorithm  j.  By  Ai  <  Aj 
we  mean  that  at  convergence  the  MSE  of  algorithm  i  is 
significantly  lower  (i.e.,  lower  by  more  than  one  order 
of  magnitude)  than  for  algorithm  j.  Nc  denotes  the 
number  of  symbols  needed  to  reach  the  convergence 
according  to  the  criterion  described  in  [3]. 

We  can  see  that,  for  low  signal- to- noise  ratio  EblNo 
(energy  per  bit  over  noise  power  spectral  density),  the 
MDCM  algorithm  gives  always  better  estimates  than 
the  other  two  algorithms,  and  the  BR  algorithm  gives 
similar  MSE  with  respect  to  the  TXK  algorithm  but 
with  significantly  lower  computational  cost. 

For  high  Eb/No,  MDCM  and  BR  give  better  es¬ 
timates  than  the  TXK  algorithm,  but  always  with  a 
lower  computational  cost  for  the  BR  algorithm. 

In  general,  we  found  that  the  convergence  proper¬ 
ties  of  these  blind  identification  algorithms  are  very 
sensitive  to  the  value  of  the  roll-off  factor  oc.  In  partic¬ 
ular,  for  relatively  small  Eb/No  values  the  algorithms 
exhibit  poorer  convergence  properties  as  a  becomes 
larger.  We  may  interpret  this  by  looking  at  the  shape  of 
the  raised-cosine  pulse  p(f).  If  a  is  large,  p{t)  exhibits  a 
fatter  central  lobe  and  smaller  sidelobes:  therefore,  the 
effect  of  the  second  ray  is  an  almost  complete  cancella¬ 
tion  of  a  portion  of  the  impulse  response  for  low  values 
of  r/T.  Conversely,  for  smaller  a  this  cancellation  ef¬ 
fect  is  correspondingly  reduced.  We  may  say  that  the 
identification  problem  becomes  better  conditioned  as  oc 
decreases, 

3.3.  Simulations  with  white  noise  and  correlated 
data 

The  effect  of  data  correlation  was  examined  by  choos¬ 
ing  a  binary  symbol  sequence  with  correlation 

E[sn5n~A:]  cqual  to  1  for  fc  =  0,  to  —  1  for  fc  =  ±1, 
and  zero  elsewhere.  The  convergence  of  the  MDCM 
algorithm  is  considerably  slowed  down,  particularly  for 
lower  Eb/No.  Roughly,  with  20,000  symbols  the  MSE 
is  greater  than  for  uncorrelated  data  by  a  factor  of  2 
for  Eb/No  =  50  dB,  by  one  order  of  magnitude  for 
Eb/No  =  30  dB,  and  by  two  orders  of  magnitude  for 
Eb/No  =  20  dB,  On  the  other  hand,  the  TXK  and  BR 
algorithms  fail  to  converge,  as  expected. 
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oc 

L 

Ei/Na  (dB) 

Nc 

MSE 

0.11 

3 

20 

6000 

^MDCM  ^  ^TXK  ~  Abr 

0.11 

7 

20 

1500 

Amdcm  ^  Axxk  <  Abr 

0.25 

3 

20 

20000 

Amdcm  Atxk  ^  Abr 

0.25 

7 

20 

8000 

Amdcm  <  Atxk  ^  Abr 

0.11 

3 

30 

<500 

Amdcm  <  Abr  <  Atxk 

0.11 

7 

30 

<500 

Amdcm  <  Atxk  ^  Abr 

0.25 

3 

30 

<500 

Amdcm  <  Abr  <  Atxk 

0.25 

7 

30 

<500 

Amdcm  ^  Abr  <  Atxk 

0.11 

3 

50 

<500 

Amdcm  ^  Abr  -C  Atxk 

0.11 

7 

50 

<500 

Amdcm  ^  Abr  <C  Atxk 

0.25 

3 

50 

<500 

Amdcm  «  Abr  <  Atxk 

0.25 

7 

50 

<500 

Amdcm  ^  Abr  <  Atxk 

Table  1:  Comparison  among  three  blind  identification  algorithms. 


3.4.  Simulations  with  correlated  noise 

The  effect  of  noise  correlation  was  also  examined.  Col¬ 
ored  noise  was  obtained  by  passing  white  noise  through 
a  single-pole  filter.  It  was  seen  that  the  three  algo¬ 
rithms  considered  here  failed  to  converge. 

The  next  step  was  modify  the  MDCM  algorithm  so 
as  to  achieve  blind  identification  with  correlated  noise. 
It  was  seen  from  simulation  that  the  channel  estimates 
obtained  from  this  modified  MDCM  algorithm  are  sat¬ 
isfactorily  good,  although  the  convergence  was  some¬ 
what  slowed  down. 
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ABSTRACT 

This  paper  presents  an  investigation  into  Real¬ 
time  Channel  Evaluation  (RTCE)  from 
predicted  zero  crossing  analysis  of;  (i)  a  pilot- 
tone,  and,  (ii)  Frequency  Shift  Keying  (FSK) 
modem  tones  within  a  HF  (High  Frequency) 
radio  system.  This  work  extends  the  initial 
investigations  from  the  early  1970s  which  used 
an  analogue  implementation  analysing  phase 
perturbation  by  time-differential  phase 
comparisons  of  a  low-level  pilot-tone  inserted 
in-band.  The  results  to  be  presented  in  this 
paper  compare  theoretical  analysis  and 
measured  results  in  Gaussian  and  flat  fading 
conditions,  and  hence  identify  the  close 
relationship  between  phase  threshold,  zero 
crossing  error  rate  and  SNR  (Signal  to  Noise 
Ratio).  This  type  of  non-parametric  system 
measurement  technique  is  widely  applicable  for 
other  constant  envelope  modulation  types,  e.g. 
MFSK,  and  as  such  is  suitable  for  incorporation 
into  frequency  management  systems  for  various 
different  radio  systems  operating  in  the  HF, 
VHF  and  UHF  bands. 


1.  INTRODUCTION  AND  BACKGROUND 
INFORMATION 

The  utilisation  of  a  low-level,  co-channel  pilot 
tone  as  a  diagnostic  probe  to  facilitate 
monitoring  of  current  channel  state  can  provide 
an  insight  into  communication  system 
performance.  Specifically,  both  the  amplitude 
and  phase  of  the  tones  are  affected  by 
underlying  propagation  conditions  and  co¬ 
channel  interference;  with  phase  sensitivity 
providing  the  best  estimates  of  prevailing 
conditions  [1].  Consequently,  analysis  of  phase 
perturbation  by  time-differential  phase- 
comparison  can  be  used  to  estimate  the 
prevailing  SNR, 


An  original  analogue  implementation,  (Darnell, 
1975)  [2]  required  long  observation  times  (e.g, 
>200  seconds)  to  obtain  a  channel  estimate  for 
high  SNRs.  Latterly,  by  concentrating  on 
reducing  the  observation  time,  (Grayson,  1991) 
[3]  identified  that,  by  using  a  Digital  Signal 
Processing  (DSP)  based  implementation, 
complete  channel  observations  could  be 
completed  in  2.5  seconds.  Work  here  with  an 
enhanced  DSP  implementation  optimises 
observation  time  still  further  to  produce 
meaningful  channel  quality  estimations  in  the 
region  of  750  ms. 

Application  of  this  research  is  focused  on 
effective  spectrum  management,  since  efficient 
system  operation  depends  upon  the  ability  to 
react  to  a  range  of  parameters  that  characterise 
both  propagation  and  channel  noise.  The  use  of 
RTCE  in  an  automated  communication  system 
can  reduce  the  use  of  high  radiated  powers  by 
selecting  frequencies  and  transmission  times  for 
which  the  received  SNR  is  maximised  and 
interference  avoided.  By  effectively 
incorporating  appropriate  RTCE  techniques, 
significant  improvement  in  system  performance 
can  be  achieved,  as,  in  many  instances, 
interference  caused  by  congestion  is  the  limiting 
factor  in  a  communications  system  performance, 
rather  than  the  propagation  conditions. 


2.  RTCE  FROM  PREDICTED  ZERO 
CROSSING  ANALYSIS  WITH  A  PILOT- 
TONE 

Pilot-tone  RTCE  uses  a  low  level  tone  placed  in- 
band  close  to  frequencies  of  interest.  The  pilot- 
tone  is  analysed  to  consider  aspects  such  as 
instantaneous  phase,  amplitude,  and  other 
properties  that  are  affected  by  the  noise  and 
channel  perturbations.  In  this  application  the 
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pilot-tone  is  used  to  provide  channel  quality 
information  to  determine  the  required 
transmission  SNR  at  a  given  instant  in  time. 
Although  both  the  amplitude  and  phase  of  a  pilot 
tone  are  affected  by  propagation  conditions  and 
interference,  it  is  the  phase  that  is  more  sensitive 
to  the  prevailing  conditions  [4].  Consequently, 
analysis  of  phase  perturbation  by  time- 
differential  phase  comparison  can  provide 
insight  into  channel  performance. 

In  a  digital  system,  the  incoming  baseband  signal 
is  sampled  and  filtered  to  extract  the  highly- 
tuned  pilot  tone  using  a  narrow-band  FIR  filter. 
The  occurrence  of  a  zero-crossing  is  predicted 
by  adding  a  constant  value  of  onto  the 

previous  detected  crossing,  ( n  being  an  integer 
and  T  representing  the  period  of  the  pilot  tone). 
The  timing  of  the  predicted  crossing  point  is 
used  to  calculate  the  phase  error  of  the  received 
tone  and,  if  this  is  greater  than  a  chosen 
threshold,  a  failure  is  recorded.  The  rate  of  phase 
comparison  failure  is  then  used  to  infer  channel 
quality.  This  system  does  not  require  any  form  of 
decoding  or  demodulation  to  determine  the 
channel  quality  and  in  this  respect  is  non- 
parametric  system,  also  no  synchronisation  of 
the  zero-crossing  of  the  pilot  tone  is  required. 

A  pilot  tone  subject  to  additive  Gaussian  noise 
was  used  in  initial  tests.  The  theoretical  results 
were  derived  from  equation  (3)  in  [1],  given 
below  in  equation  (1). 


where;  is  the  probability  of  zero 

crossing  error  for  a  signal  in  Gaussian- 
distributed  noise,  is  the  pilot-tone  phase 

threshold,  (J^  RMS  noise  amplitude  at  the 
output  of  the  RTCE  filter  (digital 
implementation),  Aq  is  the  signal  amplitude 

(normalised),  and  and  envelope  of 

autocovariance  of  the  output  noise  of  the  RTCE 
filter. 

Practical  results  were  compared  to  the 
theoretical,  and  shown  in  Figure  1.  As  can  be 
seen  the  results  for  both  theory  and  experiment 
are  in  close  agreement  at  all  threshold  values.  It 
was  noted  that  at  high  values  of  SNR,  where  the 

value  of  Pa  „  ,  was  very  small,  the  measured 

results  had  a  tendency  to  diverge  from  the 
theory.  The  general  agreed  reason  for  this 
phenomena  is  that  the  number  of  comparison 
possible  at  high  SNRs  is  not  statistically 
relevant  when  compared  to  the  probability  of 
error  expressed  in  the  theory. 
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3.  OPTIMISING  THE  NUMBER  OF 

PHASE  COMPARISONS 

In  order  to  produce  the  results  given  above  the 
number  of  averaged  phase  comparisons  carried 
out  to  obtain  a  measure  of  the  probability  of 
pilot-tone  phase  error  was  varied  as  a  function 
of  the  SNR  (high  number  of  comparisons  for  a 
low  SNR),  thus  producing  a  statistically  relevant 
result  at  low  probabilities  of  error.  An 
investigation  was  carried  out  into  the  actual 
number  of  phase  comparisons  required  for 
optimal  channel  evaluation.  It  was  found  that  an 
observation  time  of  750  ms  was  required,  this  is 
in  comparison  to  200  seconds  for  the  original 
analogue  implementation. 

4.  RTCE  FROM  PREDICTED  ZERO 
CROSSING  ANALYSIS  WITH  FSK 

MODEM  TONES 

In  order  to  obtain  the  theoretical  phase-error 
rates  for  an  FSK  modem,  above  that  of  a  single 
pilot  tone,  equation  ( 1 )  was  considered, 
detailing  the  probability  of  zero-crossing  error 
for  a  steady  signal  in  Additive  Gaussian  white 
noise  (AGWN).  In  order  to  adapt  the  equations 
given  above  to  determine  the  zero  crossing  error 
rate  for  FSK  modem  tones  with  respect  to  a 
pilot  tone,  each  tone  was  considered  separately, 
and  in  the  limit  equi-probable. 

Results  shown  in  Figure  2  are  similar  to  those 
shown  in  Figure  1,  and  show  a  strong 
correlation  between  calculated  theoretical  and 
measured  results  for  both  steady  signal  and 
Rayleigh  fading  channel  conditions.  It  is  clear, 
however,  that  at  large  window  sizes  the 
correlation  is  reduced.  An  explanation  for  the 
large  window  size  recording  a  lower 

error  rate  than  expected  in  such  in  high  noise 
situations  is  based  on  the  DSP  implementation 
being  indiscriminate  where  subsequent  valid 
zero  crossing  should  occur.  Hence  invalid 
crossings  are  erroneously  counted  as  correct. 


are  numerous:  a  reduction  in  system  complexity 
and  reduced  power  consumption  within  the 
transmitting  system,  RTCE  from  predicted  zero 
crossing  analysis  is  an  important  non-parametric 
channel  quality  and  evaluation  system,  since  it 
is  possible  to  infer  system  performance  without 
demodulation  processes,  e.g.  synchronisation, 
error  control  or  decryption.  This  research  can  be 
extended  using  in-band  and  out-of-band  digital 
filters  for  constant  envelope  signalling  methods 
e.g.  MFSK.  It  is  believed  that  this  research  has 
merits  in  a  broad  range  of  radio  systems  and 
forms  an  important  part  of  channel  spectrum 
management, 
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5.  CONCLUSIONS 

Applications  using  this  zero-crossing  analysis 
RTCE  technique  would  be  best  suited  to  using  a 
narrow  window  size  especially  at  low  SNR 
values.  This  paper  demonstrates  that  RTCE  can 
be  performed  without  the  need  for  an  inserted 
pilot-tone  co-located  with  the  information 
bearing  modem  tones.  The  advantages  of  this 
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Figure  1  Comparison  of  theoretical  and  measured  results  for  zero  crossing  error  rates 
for  a  pilot-tone  at  various  threshold  values  under  Gaussian  conditions 


Figure  2  Comparison  of  theoretical  and  measured  results  for  zero  crossing  error  rates 
for  FSK  modem  tones  at  various  threshold  values  under  Gaussian  conditions 
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ABSTRACT 

This  work  is  concerned  with  the  use  of  Total  Least 
Squares  criterion  in  Forward-Backward  Linear  Predic¬ 
tion  optimization,  yielding  a  new  frequency  estimation 
method,  named  TLS-FBLP.  A  detailed  comparison  of 
our  method  to  Tufts  and  Kumaresan^s  Modified  FBLP, 
revealed  that  TLS-FBLP  is  better  than  Modified  FBLP 
at  low  prediction  orders,  as  well  as  they  are  similar  for 
high  orders. 

1.  INTRODUCTION 

Studies  in  Signal  Processing  have  paid  special  attention 
to  the  detection  of  sinusoids  in  the  presence  of  white 
noise.  The  basic  problem  consists  in  estimating  the 
sinusoids  frequencies  using  a  few  signal  samples. 

This  problem  becomes  critical  either  when  the  fre¬ 
quency  separation  is  smaller  than  the  reciprocal  of  the 
signal  observation  interval  or  when  the  signal-to-noise 
ratio  (SNR)  decreases. 

The  best  performance  for  this  situation  is  achieved 
by  Maximum  Likelihood  (ML)  based  methods.  How¬ 
ever,  its  computational  effort  is  prohibitive.  Tufts 
and  Kumaresan  [6]  introduced  their  Modified  FBLP 
method,  which  solves  Forward-Backward  Linear  Pre¬ 
diction  (FBLP)  in  Least  Squares  (LS)  sense  and  re¬ 
duces  noise  effects  by  using  Singular  Value  Decompo¬ 
sition  (SVD).  This  method  is  computationally  less  ex¬ 
pensive  than  ML,  but  its  peformance  attains  that  of 
ML  for  high  SNR^s. 

In  1980,  Golub  and  Van  Loan  [3]  presented  Total 
Least  Squares  (TLS)  criterion,  which  optimizes  Linear 
Prediction  (LP)  by  taking  into  account  perturbation  in 
both  the  observation  vector  b  and  the  data  matrix  A. 

The  aim  of  this  paper  is  to  present  a  new  frequency 
estimation  method,  named  TLS-FBLP,  which  is  based 
on  the  application  of  TLS  criterion  to  FBLP  optimiza¬ 
tion.  Its  performance  is  compared  to  that  of  Tufts  and 
Kumaresan’s  Modified  FBLP.  Also,  a  new  measure  for 
the  optimum  value  of  prediction  order  is  proposed. 


2.  FREQUENCY  DETECTION  BY  LINEAR 
PREDICTION 

Let  us  consider  N  uniformly  spaced  samples  of  the  sum 
of  M  complex  exponential  signals,  initially  noiseless,  as 
given  by: 

M 

“W  =  + n  =  l,2,...,Ar  (1) 

k=l 

where  the  ajk  are  the  amplitudes,  the  u)]^  are  the  fre¬ 
quencies  and  the  (j)k  are  the  phases  of  the  exponentials. 

Linear  Prediction  allows  us  to  estimate  signal  fre¬ 
quencies  by  using  vector  w  of  coefficients  of  a  predic¬ 
tion  error  filter  (PEF),  optimized  in  order  to  minimize 
the  mean  square  error. 

Ulrich  and  Clayton  [7]  have  shown  that  the  joint  op¬ 
timization  of  forward  and  backward  PEF^s,  subject  to 
the  forward  coefficients  being  the  same  as  the  complex 
conjugate  of  backward  ones  in  reverse  order  (FBLP), 
yields  better  results  when  estimating  the  frequencies  of 
undamped  sinusoids. 

The  related  optimum  vector  w  of  coeflRcients,  for  a 
prediction  order  L,  has  dimensions  L  x  1  and  defines  an 
L  order  polynomial  IV(z),  given  by  the  formula: 

W{z)  =  l-wlz-'^ -w*2Z-'^  -  (2) 

for  the  upper  asterisk  meaning  complex  conjugate. 

Antunes  [1]  demonstrates  that  M  of  W{z)  zeros  lie 
on  the  unit  circle  (UC),  and  are  called  signal  zeros. 
The  remaining  ones  are  called  extraneous  zeros  and  are 
approximately  uniformly  distributed  within  the  UC. 

The  desired  frequencies  can  be  estimated  through  the 
angular  locations  of  the  M  signal  zeros  on  the  UC. 
These  are  the  basic  features  of  FBLP  method. 

The  corruption  of  u[n]  by  white  noise  leads  the  ze¬ 
ros  of  W{z)  to  fluctuate  around  their  ideal  locations. 
This  behavior  is  shown  in  Figure  1,  where  we  superim¬ 
pose  the  zeros  of  W{z)  for  50  experiments  under  the 
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following  conditions: 


iV  =  25, 

ai  =  l.Oj 
02  =  1.0, 


M  =  2,  L  =  24, 
ui  —  (1,  00)7r  and 

W2  =  (lj04)7r  and 


SNR=  10  dB. 

(^i  =  (-^l,00)7r. 
(^2  =  (-0,79)7r. 


Each  experiment  uses  a  new  realization  of  noise,  added 
to  the  same  signal  samples. 


Real 


Figure  1:  FBLP  transfer  function  zeros. 


Figure  2:  Comparison  among  performances  of  ML, 
FBLP  and  Modified  FBLP  methods. 

These  fiuctuations  degrade  the  detection  perfor¬ 
mance.  While  SNR  is  greater  than  about  25  of  dB’s, 
performance  degradation  is  due  only  to  random  fiuctu¬ 
ations  of  the  M  signal  zeros.  However,  the  fluctuations 
of  both  types  of  zeros  grow  as  SNR  decreases,  leading 
to  the  possibility  of  an  extraneous  zero  to  be  located 
closer  to  UC  than  a  signal  zero.  In  this  situation,  this 
extraneous  zero  is  interpreted  as  signal  zero  and  a  large 
error  is  produced.  So,  performance  decreases  rapidly, 
characterizing  a  threshold  effect. 

In  Figure  2,  we  show  the  inverse  of  the  estimates  vari¬ 
ance  versus  SNR.  We  also  show  the  theoretical  bound 
for  the  performance,  namely,  Cramer-Rao  Bound,  and 
the  performance  of  Maximum  Likelihood  [6] . 

Focusing  on  the  dotted  line  curves,  we  observe  that 
they  attain  the  ML  performance  at  high  SNR^s.  How¬ 
ever,  both  curves  and  even  Cramer-Rao  Bound,  decay 


slowly  as  SNR  decreases,  until  the  threshold  is  reached. 
Consequently,  their  performances  degrade  rapidly. 

3.  MODIFIED  FBLP  METHOD 

The  general  solution  for  vector  w,  in  least  squares  sense, 
is  given  by: 

wls  =  A^b.  (3) 

where  #  means  pseudo-inverse. 

Singular  Value  Decomposition  (SVD)  [1]  applied  to 
data  matrix  A,  allows  us  to  write  (3)  in  the  following 
form: 


min[2(iV-L),L]  ^ 

=  E 

where  cr^(jb)  are  the  singular  values,  and  u^(ib), 
the  left  and  right  singular  vectors  of  A,  respectively. 

In  the  noiseless  case,  rank(A)  =  M.  Thus,  only  the 
first  M  singular  values  are  nonzero  and,  obviously,  cor¬ 
respond  to  the  signal.  These  are  the  signal  singular 
values.  In  the  presence  of  noise,  A  has  full  rank  and 
has  others  min[2(7V  —  L),  L]  —  M  nonzero  singular  val¬ 
ues,  related  to  the  noise.  These  are  the  noise  singular 
values. 

In  Modified  FBLP  method,  we  take  into  account  only 
the  M  largest  (TA{k)^  signal  singular  values^ 

truncating  the  sum  in  (4)  to  M  in  order  to  approxi¬ 
mate  the  noiseless  case. 

Continuous  line  curves  in  Figure  2  allows  us  to  ob¬ 
serve  the  lowering  of  threshold  SNR’s  and  the  increasing 
of  estimate  precision,  due  to  Modified  FBLP  method. 

4.  TOTAL  LEAST  SQUARES  CRITERION 

In  the  following,  we  develop  a  mathematical  character¬ 
ization  of  TLS  criterion  and  describe  the  solution  for 
the  corresponding  w. 

4.1.  Analysis  Through  Perturbation  Theory 

When  applying  Perturbation  Theory  to  Linear  Predic¬ 
tion,  we  first  write  the  LP  set  of  equations  as  the  ap¬ 
proximation: 

Aw  «  b  (5) 

In  order  to  have  an  equality,  we  can  add  an  error 
vector  e  to  the  right  side  of  (5).  So,  we  can  write: 

Aw  =  (b  -h  e)  (6) 

The  solution  in  least  squares  sense  amounts  to  finding 
such  a  vector  w  that  minimizes  the  quadratic  norm  of 
e  ,  subject  to  (6). 
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TLS  criterion  takes  into  account  perturbations  in 
both  b  and  A,  in  order  to  have  an  equality  in  (5).  So, 
we  can  write: 

(Ad-E)w^  =  (b  +  e),  (7) 

where  E  is  the  perturbation  matrix  for  A,  with  dimen¬ 
sions  2{N  -  L)  X  L.  We  can  rewrite  (7)  as: 


If  <^p(L+i)  is  a  repeated  singular  value  of  P,  we  have 
infinite  solutions.  In  this  case,  we  can  adopt  minimum 
norm  solution  which  is  given  by  Golub  and  Van  Loan 
[3]  as: 


WtX5  == 


k=M+l  .I3iz^W+l  I^P[(L+l),i]h, 


(14) 


([A|b]  +  [E|e])[-^]  =0  (8) 

Thus,  the  solution  to  TLS  problem  amounts  to  finding 
such  a  vector  w  that  minimizes  the  quadratic  norm  of 
[E  I  e],  subject  to  (8). 

4.2.  Solving  in  Total  Least  Squares  Sense 

Define  P  =  [A  |  b]  and  Q  ^  [E  |  e],  with  dimensions 
2{N  —  L)  X  (L  +  1).  TLS  solution  amounts  to  finding 
such  a  vector  w  that  assures: 

min  llQlIf^,  subject  to  (b  -{-  e)  G  7^(A  +  E),  (9) 

where  7^(A)  denotes  the  range  of  A. 

Following  the  steps  in  [3],  TLS  estimate  can  be  ob¬ 
tained  from  the  SVD  of  P  for  the  case  where  2{N—L)  > 
(L  +  l): 

(L+l) 

p  =  XI  (10) 

jfe=l 

for  <Tp(i)>-  ♦  •>  <Tp(^M)>  crp(M-fi)  =  •  •  •=  crp(L+i)>  0. 

Golub  and  Van  Loan  [3]  show  that  the  minimum  is 
attained  when: 

Q  =  -Pvv^  (11) 

where  v  is  any  unit  vector  in  subspace  Sw,  defined 

by  the  linear  combination  of  {vp(jvf-f-i))  *  *  *  > 

i.e.,  the  right  singular  vectors  of  P  corresponding  to 

For  a  ^  0  being  the  {L  +  l)-st  component  of  vector 
V,  we  make: 


and  substitute  (11)  and  (12)  in  (8).  Then  we  get: 

([A|b]+[E|e])  =  P(I_vv^)(-^) 

=  0,  (13) 

for  v^v  =  1,  i.e.,  v  is  a  unit  vector. 

Therefore,  vector  wtl5  given  by  (12)  satisfies  equa¬ 
tion  (8)  and  assures  that  ||Q||p  is  minimal.  Thus,  it 
solves  TLS  problem.  However,  if  there  is  no  v  in  Sw 
such  that  a  0,  TLS  problem  has  no  solution. 


y^TLS 

-1 


where 


yp(k)  = 


(15) 


For  an  underdetermined  set  of  equations,  i.e.,  2{N  — 
L)  <  (L-1-1),  we  have  more  unknowns  than  equations  in 
(8).  Lawson  [4]  states  that  this  yields  infinite  solutions 
with  null  error.  In  this  situation,  TLS  and  LS  solutions 
are  the  same.  Thus,  we  can  assume  vttls  =  A^b. 


5.  TLS-FBLP  METHOD 


TLS-FBLP  method  consists  in  the  use  of  TLS  criterion 
in  order  to  find  an  optimum  value  for  w  in  forward- 
backward  linear  prediction,  as  stated  by  the  formula: 


=  0  (16) 


where  the  indexes  /  and  6  refer  to  forward  and  to  back¬ 
ward  LP,  respectively. 

To  the  best  of  our  knowledge,  this  approach  is  unpub¬ 
lished  and  yields  a  new  frequency  detection  method. 


5.1.  Performances  Under  Perspective 

In  order  to  compare  the  performance  of  TLS-FBLP 
to  that  of  Modified  FBLP,  we  computed  performance 
curves  for  prediction  orders  from  L  =  2  to  L  =  24.  This 
allowed  us  to  generate  performance  surfaces,  as  shown 
in  Figure  3  and  Figure  4.  We  observe  the  threshold 
SNR’s  through  the  shape  of  the  cutting  edge  of  the  sur¬ 
face.  At  low  prediction  orders,  Modified  FBLP  shows 
deep  valleys  even  at  high  SNR’s.  However,  the  methods 
present  the  same  performance  curves  for  L  >  20. 

The  shape  of  the  threshold  edge  for  TLS-FBLP  is 
smoother  than  that  for  Modified  FBLP.  Threshold  SNR 
values  for  TLS-FBLP  are  almost  the  same  for  L  from  8 
to  18,  whereas  the  best  performance  of  Modified  FBLP 
is  observed  in  the  range  L  =  14  to  L  =  18,  degrading 
significantly  outside  this  interval.  Also,  the  former  is 
better  than  the  latter  at  low  prediction  order  values. 

Therefore,  TLS-FBLP  method  is  less  sensitive  to  pre¬ 
diction  order  changes  than  Modified  FBLP. 

The  lowest  threshold  SNR  value  for  TLS-FBLP  is  8 
dB  at  L  =  16,  whereas  the  lowest  threshold  SNR  for 
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Figure  3:  Performance  surface  for  TLS-FBLP  method. 


Figure  4:  Performance  surface  for  Modified  FBLP  method. 

Modified  FBLP  had  the  value  7  dB  and  occurred  at 
L  =:  17,  as  shown  in  Figure  5. 

Although  the  best  performance  has  been  verified  in 
Modified  FBLP  method,  TLS-FBLP  best  performance 
is  only  1  dB  worse.  So,  we  can  say  that  TLS-FBLP 
shows  performance  comparable  to  Modified  FBLP. 

Both  usual  FBLP  and  TLS-FBLP  do  not  make  sub¬ 
space  restriction  as  Modified  FBLP  does,  but  TLS- 
FBLP  performance  is  quite  close  to  that  of  Modified 
FBLP,  and  significantly  superior  to  that  of  FBLP. 

The  best  performances  of  both  TLS-FBLP  and  Mod¬ 
ified  FBLP  occurred  when  data  matrices  P  and  A,  re¬ 
spectively,  became  almost  square.  So,  the  empyrical 
measure  for  the  optimum  prediction  order  in  Modified 
FBLP,  defined  by  Tufts  and  Kumaresan  as  (3/4)A,  can 
be  substituted  by  the  measure  that  states:  the  opti¬ 
mum  prediction  order  value  is  such  that  it  leads  the  data 
matrix  or  the  extended  data  matrix  to  be  aproximately 
square. 

6.  CONCLUDING  REMARKS 

Total  Least  Squares  (TLS)  criterion  allows  us  to  pro¬ 
pose  a  new  frequency  detection  method,  which  we  called 


-5  0  5  10  15  20  25  30  35  40 

SNR  (dB) 

Figure  5:  Comparison  between  the  best  performances 
for  TLS-FBLP  and  Modified  FBLP  methods. 

TLS-FBLP.  This  method  lowers  the  threshold  SNR’s 
and  its  best  performance  is  only  1  dB  worse  than  the 
best  result  of  Tufts  and  Kumaresan ’s  Modified  FBLP. 
Also,  TLS-FBLP  has  shown  to  be  less  sensitive  to  pre¬ 
diction  order  changes  and  performed  better  than  Mod¬ 
ified  FBLP  for  low  prediction  orders.  Moreover,  best 
performances  occurred  when  the  data  matrix  or  ex¬ 
tended  data  matrix  were  approximately  square. 

TLS-FBLP  is  similar  to  FBLP,  as  they  do  not  make 
subspace  restriction.  However,  its  performance  is  com¬ 
parable  to  that  of  Modified  FBLP. 
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ABSTRACT 

A  novel  rigorous  approach  to  the  spectral  density  es¬ 
timation  problem  based  on  the  trigonometric  moment 
problem  technique  is  considered.  Using  the  trigonomet¬ 
ric  moment  problem  results,  all  possible  extrapolations 
of  the  autocorrelation  function,  which  are  in  agreement 
with  a  set  of  known  values  are  found.  A  wide  set  of 
spectral  estimators  is  described  in  terms  of  polynomi¬ 
als  orthogonal  with  respect  to  the  given  autocorrelation 
sequence.  The  parametric  representation  for  this  set  is 
given. 

1.  Introduction 

At  present,  there  are  several  standard  methods  of  deal¬ 
ing  with  the  classical  problem  in  digital  signal  and  sen¬ 
sor  array  processing.  [1,  2,  3].  These  methods  include 
the  conventional  nonparametric  Fourier  approach  and 
more  recently  developed  parametric  techniques  such  as 
maximum  entropy  (ME),  maximum  likelihood  (ML), 
MUSIC,  ESPRIT  and  others. 

Until  1967,  most  of  the  procedures  used  for  estimat¬ 
ing  the  spectral  density  of  a  stochastic  process  were 
based  on  the  Blackman-Tukey  approach  [1].  The  ex¬ 
pectation  of  an  estimate  is  equivalent  to  the  convolu¬ 
tion  of  the  true  spectral  density  of  the  stochastic  pro¬ 
cess  with  the  spectral  window.  The  statistical  stability 
and  resolution  of  the  spectral  density  estimate  using 
the  Blackman-Tukey  procedure  are  highly  dependent 
on  the  choice  of  the  window  function  [4].  Moreover,  the 
spectral  density  estimator  based  on  Blackman-Tukey 
approach  is  linear  because  it  involves  the  use  of  linear 
operations  on  the  available  time  series.  A  major  prob¬ 
lem  with  the  Fourier  transform  is  that  the  accuracy  of 
the  mode  estimates  is  roughly  inversely  proportional 
to  the  total  simulation  time  interval.  Moreover,  there 

This  work  was  supported  by  the  Russian  Foundation  of  Fun¬ 
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is  problem  with  appropriate  choice  of  the  windowing 
function. 

The  windowing  problem  may  be  overcome  by  using 
the  nonlinear  estimators  of  spectral  density  based  on 
ME  approach,  ML  approach  or  others.  But  these  ap¬ 
proaches  are  not  universal.  Really,  these  approaches  do 
not  permit  one  to  get  other  spectral  estimations  which 
corresponds  to  time  series,  whose  autocorrelation  func¬ 
tions  agree  with  the  same  set  of  known  values,  i.e.  to 
get  other  extrapolations  of  the  correlation  function. 

Besides,  when  the  spectral  estimate  with  good  res¬ 
olution  is  necessary  to  solve  a  more  concrete  prob¬ 
lem,  for  example,  the  power  spectral  density  estimation 
problem  with  additional  a  priori  information  about  the 
spectrum,  it  seems  unreasonable  to  apply  the  known 
methods  of  spectral  estimation,  such  as  ME  method, 
ML  method  or  others  in  their  original  form,  because 
these  methods  of  spectral  estimation  are  not  intended 
to  solve  concrete  problems  [5]. 

So,  in  spite  of  the  progress  in  spectral  estimation 
and  a  list  of  successful  practical  application  of  the  more 
popular  methods  of  spectral  estimation  such  us  ME, 
MUSIC,  ML  and  some  others,  new  methods  for  effec¬ 
tive  spectral  estimation  are  necessary.  Besides,  it  is 
desired  to  have  a  common  approach  both  for  the  con¬ 
struction  of  spectral  estimation  methods  and  for  the 
investigation  of  their  performance. 

The  aim  of  this  paper  is  to  propose  and  develop 
a  novel  rigorous  approach  to  spectral  density  estima¬ 
tion  problem.  We  describe  all  possible  extrapolations 
of  the  autocorrelation  function  which  agree  with  a  set 
of  known  values.  The  set  of  possible  spectral  estima¬ 
tions  is  described  as  a  parametric  family  of  spectral 
estimators  with  paramethzation  function.  The  quanti¬ 
tative  performance  of  the  proposed  spectral  estimator’s 
family  is  established. 
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2.  Spectral  density  estimation  prob-  problem  (i)-(2)  for  sequence  c^,  fc  =  o,  i,..., 

unsolvable. 

lem 


2.1.  Problem  formulation 

Consider  the  problem  of  estimating  a  power  spectral 
density  function  given  only  that  it  is  positive  on  the 
spectral  support,  zero  outside  and  compatible  with  the 
known  values  of  the  autocorrelation  sequence. 

The  aim  of  spectral  estimation  is  to  find  an  estimate 
of  the  spectrum  which  agrees  with  the  set  of  the  known 
values  of  autocorrelation  sequence  c^,  fc  =  1,  2, . . . ,  iV, 
i.e.,  spectral  density  estimation  problem  is  to  determine 
the  nonnegative  function  P{6)  such  that 

P{0)>0,  for  0€[-7r,7r],  (1) 


and 


TT 

—  /  d9  =  Ck,  k^0,l,...,N.  (2) 

27r  J 

—  TT 

Note  that  the  considered  problem  (l)-(2)  may  be  un¬ 
solvable  for  an  arbitrary  sequence  of  the  values  Ck  if  we 
strictly  follow  conditions  (2).  To  become  solvable,  this 
problem  has  to  be  regularized. 

There  are  two  ways  to  attach  a  meaning  to  the  prob¬ 
lem  (l)-(2).  The  first  way  is  to  replace  conditions  (2) 
by  approximate  relations.  In  this  case  the  problem 
arises  of  choosing  a  good  approximate  relation.  An¬ 
other  way  to  attach  a  meaning  to  the  problem  (1)— (2)  is 
to  relax  the  condition  (1)  and  replace  it  by  P{d)  >  “Mi 
for  0  €  [-7r,7r],  where  ^  is  a  positive  value.  Then,  the 
shifted  spectral  estimation  Pn{d)  =  P{9)  +  M  will  be 
nonnegative  and  problem  transforms  to  problem  (1)~ 
(2)  with  modified  autocorrelation  sequence  Cfc  -I-  fxSko, 
k  =  0,  1, . . . ,  N,  where  8ko  is  the  Kroneker  symbol. 

It  is  relevant  to  remark  that  the  problem  for  shifted 
function  P^{9)  is  not  solvable  for  arbitrary  positive 
value  fi.  However,  we  can  always  make  the  problem 
solvable  by  choosing  a  sufficiently  large  value  fx. 

The  appearing  item  uSko  can  be  explained  in  a  dif¬ 
ferent  way.  First  of  all,  the  measurement  data  really 
may  have  a  white  noise  with  dispersion  characteristic 
Ix.  Besides,  by  having  added  a  component  fxSko  to  regu¬ 
lar  autocorrelation  coefficients  Cfc  we  imply  an  obvious 
method  to  impart  regularity  to  the  considered  problem. 
In  this  way  the  value  /x  is  a  regularization  parameter. 
Obviously,  the  regularized  problem  is  equivalent  to  ini¬ 
tial  spectral  density  estimation  problem  (l)-(2)  for  new 
autocorrelation  sequence  Ck  -f  fxSko,  k  =  0,  1, . .  ■ ,  N. 
Therefore,  below  we  consider  the  spectral  density  esti¬ 
mation  problem  (1)— (2)  with  the  regularization  param¬ 
eter  fx,  whenever  the  initial  spectral  density  estimation 


2.2.  Extrapolation  of  autocorrelation  se¬ 
quence 

Let  us  now  assume,  that  a  solution  of  the  spectral  den¬ 
sity  estimation  problem  (l)-(2)  is  known.  Then  we  may 
calculate  the  unknown  terms  of  the  autocorrelation  se¬ 
quence,  i.e,  Cfc  for  all  fc  >  iV  and  so  to  extrapolate  the 
autocorrelation  sequence.  The  inverse  is  also  true,  i.e., 
if  we  have  found  an  extrapolated  autocorrelation  se¬ 
quence  4,  fc  =  0,  1, . . .  such  that  its  TV  -b  1  first  terms 
are  equal  to  the  known  values  of  the  autocorrelation 
sequence,  i.e.,  the  values 

Ck  —  Ojt,  for  A)  =  0,  1,  .  •  • ,  A”, 

then  the  spectral  estimate  can  be  defined  by  means  of 
the  inverse  Fourier  transform 

OO 

k=~~oo 

where  c_fc  =4  for  all  integers  k. 

Let  us  note,  that  the  extrapolated  autocorrelation 
sequence  Ck  can  be  chosen  arbitrary  provided  that  the 
function  P(0)  is  nonnegative.  The  last  condition  gives 
us  a  restriction  for  the  possible  extrapolations  of  the 
autocorrelation  sequence  Ck,  k  =  0,  1, . .  ■ ,  N.  Namely 
the  autocorrelation  sequence  cjt,  fc  =  0,  1, . .  • ,  A  and 
the  extrapolated  autocorrelation  sequences  {4}^o 
to  be  positive-definite  [6,  7].  The  set  of  all  spectral  esti¬ 
mations  written  in  form  (3)  with  an  arbitrary  positive 
definite  extrapolated  sequence  4)  k  =  0,  1,  2, . . .  forms 
the  Carateodory  class  of  function  [8].  The  Catateodory 
class  is  closely  associated  with  the  trigonometric  mo¬ 
ment  problem  [6],  which  plays  an  important  part  for 
the  description  of  all  solutions  of  the  spectral  density 
estimation  problem. 


2.3.  The  trigonometric  moment  problem 

The  trigonometric  moment  problem  consists  of  a  col¬ 
lection  of  complex  numbers 


fc  =  0,  1,...,A,  (4) 


where  P{9)  is  an  unknown  positive  function  of  9.  As¬ 
suming  that  the  moments  Cfe,  fc  ==  0,  1, .  ■  • ,  A  of  P{9) 
are  exactly  known,  the  problem  is  to  determine  the 
function  P{9).  Let  us  note,  not  any  set  of  numbers  Ck, 
k  =  0,  1, .  •  • ,  A  can  be  the  moments  of  nonnegative 
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function,  i.e.,  the  trigonometric  moment  problem  for  an 
arbitrary  sequence  of  moments  c^,  A:  =  0,  1, . . . ,  N  can 
be  unsolvable.  We  will  assume  that  the  Toeplitz  deter¬ 
minants  An  ~  det||cn_A;||  a-re  nonzero  for  all  integer 
n-=0  1,  N. 

Then  there  exists  the  system  {Pn{z)}  of  polynomi¬ 
als  of  the  first  type  and  the  system  {Pn{z)}  of  poly¬ 
nomials  of  the  second  type  orthogonal  with  respect  to 
the  unknown  measure  P{6),  which  correspond  to  given 
moment  sequence  {cfc}j^o  monic  polynomials 

{Pn{z)}  satisfy  the  recurrence  relations  for  the  orthog¬ 
onal  sets  of  polynomials: 


Pn+lfy)  =  zPn{z)^bn^lP:,{z),  (5) 

ciof 

where  R*{z)  =  z^R{l/z)  for  any  polynomial  R{z)  of 
degree  A:;  bk  are  some  constants  for  which  formal  ana¬ 
lytic  relations  in  determinant  form  can  be  obtained. 

The  following  criterion  of  solvability  of  the  moment 
problem  (4)  holds  [9]: 

Theorem  1  The  trigonometric  moment  problem  (4) 
is  solvable  if  and  only  if  the  inequalities 


\ak\  <1,  A;  =  0,  1,...,  iV, 

hold,  where  {afc}^Q  are  the  coefficients  of  the  recur¬ 
rence  relation  of  polynomials  orthogonal  with  respect  to 
the  moment  sequences  {ck}j^^Q. 

To  attach  this  theorem  a  constructive  mean,  a  recur¬ 
rence  algorithms,  for  example,  a  Levinson  algorithm 
has  to  be  used  to  calculate  the  coefficients  of  the 
recurrence  relation  for  orthogonal  polynomials  [9]. 

Let  us  now  assume  that  the  necessary  conditions 
of  solvability  of  the  trigonometric  moment  problem  are 
true.  Then  all  functions  P{0),  for  which  the  relation 
(4)  is  fulfilled  for  all  A:  =  0,  1, , . . ,  A^,  can  be  described 
by  means  of  the  linear-fractional  transformation  [6]. 

Theorem  2  Let  Pn{z)  and  Qn{^)  be  the  monic  poly¬ 
nomials  of  first  and  second  type  orthogonal  with  respect 
to  given  moment  sequence  {cfc}^o.  Let  the  moment 
problem  (4) ,  be  indeterminate.  Then  all  solutions  of 
the  moment  problem  (4)  can  be  described  by  means  of 
the  linear-fractional  transformation 


P,{z)  =  ^.^e 
flN 


zQN{z)e{z)-{-Q*j^{z) 
zPn{z)€{z)  -  Pfj{z)  ’ 


z  =  (6) 


of  the  function  e{z)  €  B,  where  B  is  a  class  of  functions 
holomorphi^in  C  and  bounded  there  in  modulus  by  1; 
P*{z)  =  Z-Pn{l/z),  Q*(z)  =  Z-Q^{l/z),  hn  = 

Due  to  Theorem  2  any  solution  P{6)  of  the  mo¬ 
ment  problem  can  be  expressed  by  means  of  the  linear- 
fractional  transformation  (6)  with  the  holomorphic  func¬ 
tion  e{z)  corresponding  to  this  solution. 


Using  the  properties  of  the  orthogonal  polynomials 
[6]  it  is  easy  to  transform  (6)  to  form 


\zPN{z)e{z)-Pf,{z)r 


3.  Spectral  estimator’s  family 

In  this  section  we  consider  the  spectral  estimation  prob¬ 
lem  as  a  trigonometric  moment  problem. 

As  it  follows  from  the  previous  section  any  solution 
of  the  spectral  estimation  problem  (1)^(2)  for  the  reg¬ 
ularized  sequence  Ck  4-  A:  =  0,  1, . . . ,  iV  can  be 
represented  in  form  (7)  where  e{z)  is  a  parametriza- 
tion  function  corresponding  to  this  spectral  estimate, 
Bn{z)  is  the  polynomial  with  the  coefficients  satisfying 
to  the  Yule- Walker  system  and  P’fjf^z)  is  a  polynomial 
conjugate  to  Pn{z), 

Using  representation  (7)  and  an  effective  algorithm 
for  calculation  of  the  orthogonal  polynomial  Pn{z)  and 
its  conjugate  by  the  given  autocorrelation  values  one 
may  obtain  different  spectral  estimations.  Obviously, 
the  form  and  performance  of  a  concrete  spectral  estima¬ 
tor  depend  on  the  choice  of  the  parametrization  func¬ 
tion  e{z).  Using  an  appropriate  parametrization  func¬ 
tion,  the  known  spectral  estimation  can  be  obtained. 
For  example,  in  the  special  case  where  €(z)  =  0  rela¬ 
tion  (7)  is  simplified  and  passed  to  Burg’s  maximum 
entropy  spectral  estimation 

Pme{0)  =  \PN{e^^)r\ 

where  Pn{z)  is  the  polynomial  with  the  coefficients  sat¬ 
isfying  to  Yule- Walker  system. 

Another  example  is  the  maximum  entropy  solution 
of  spectral  density  estimation  problem  with  a  priori 
information  about  spectrum  [5,  10],  where  the  aim  of 
spectral  estimation  is  to  find  the  function  which  agrees 
with  the  set  of  the  known  values  of  autocorrelation 
function  and  given  a  priori  information.  In  view  of  the 
given  a  priori  information,  the  original  spectral  estima¬ 
tion  problem  reduces  to  spectral  estimation  problem 
with  gaps  in  spectrum  [10].  The  spectral  estimation 
with  desired  properties,  can  be  found  using  relation 
(7).  However,  to  find  the  solution  of  this  spectral  esti¬ 
mation  problem,  we  have  to  choose  the  parametrization 
function  e{z). 

The  algorithm  choosing  the  parametrization  func¬ 
tion  €{z)  to  construct  the  maximum  entropy  solution 
of  the  spectral  estimation  problem  with  a  priori  infor¬ 
mation  about  the  spectrum  can  be  found  in  [5,  10]. 

Let  us  now  explain  the  way  of  choosing  of  the  pa¬ 
rametrization  function  £(z)  to  get  the  linear  spectral 
estimator. 
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Obviously,  any  linear  spectral  estimate  can  be  rep¬ 
resented  as  a  weight  Fourier  transform  from  the  known 
correlation  coefficients  Ck 


N 


P(z)  =  25fte  ^  /OfcCfc;z^  z  =  e*® 


(8) 


fe=o 


where  pk  is  the  weight  coefHcients,  which  are  deter¬ 
mined  by  the  chosen  windowing  function.  In  the  par¬ 
ticular  case  Pit  =  0  for  all  fc  =  0,  1,  . . . ,  N  the  relation 

(8)  transforms  to  ordinary  Fourier  spectral  estimate. 

Prom  relations  (6)  and  (8)  we  have 


Biz) 


Mn{z)  _  i9 

zB2N{zy 


(9) 


where 


A2n{z)  =  ^PUz)Y.CkPkZ^  +  Q*Ni^) 

^  Jfe=0 


N 


and 


B2n{z)  =  —  Piv(^)  -Qn{z) 


N 


are  polynomials  of  power  2N.  The  coefficients  of  this 
polynomials  can  be  easily  expressed  through  the  initial 
correlation  coefficients  Ck  and  the  given  weights  pk  ■ 
From  relation  (9)  it  follows  that  the  parametrization 
function  e  is  determined  by  the  autocorrelation  coeffi¬ 
cients  Ck  and  windowing  function  in  unique  way.  Using 
varies  windowing  functions  one  can  get  any  known  lin¬ 
ear  spectral  estimate.  Thus,  relations  (7)  and  (9)  give 
us  a  description  of  a  class  of  linear  spectral  estimates 
by  means  of  linear  fractional  representation. 


Conclusion 

We  have  considered  a  novel  approach  to  the  spectral 
density  estimation  problem  based  on  the  moment  method 
technique.  Using  this  approach  all  possible  extrapola¬ 
tions  of  the  autocorrelation  function  which  are  in  agree¬ 
ment  with  a  set  of  known  values  were  found.  A  wide 
spectral  estimator’s  family  has  been  obtained  in  terms 
of  polynomials  orthogonal  with  respect  to  a  given  au¬ 
tocorrelation  sequence. 
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ABSTRACT 

This  tutorial-style  talk  will  focus  on  the  practical  side  of 
designing  EA  data  converters.  Topics  to  be  discussed 
include  an  introduction  to  EA  conversion,  a  compari¬ 
son  with  other  converter  architectures,  a  discussion  of 
the  choice  of  circuit  types,  the  design  of  circuit  blocks, 
board  level  considerations,  simulation  and  testing,  and 
debugging  of  these  converters. 

1.  INTRODUCTION  AND  MOTIVATION 

There  have  been  many  papers  published  on  EA  con¬ 
version,  most  of  which  have  focused  on  theory  and  ar¬ 
chitectural  design.  Some  papers  have  also  been  pub¬ 
lished  on  particular  IC  implementations  of  these  de¬ 
signs.  There  have  been  very  few  publications  dealing 
with  the  practical  implementation  of  these  converters, 
discussing  the  wide  variety  of  tradeoffs  involved.  This 
tutorial  attempts  to  fill  this  need  by  discussing  these 
converters  from  a  practical  point  of  view. 

2.  THEORY  OF  EA  CONVERSION 

In  this  section  of  the  tutorial,  the  theory  of  quanti¬ 
zation  and  noise  shaping  will  be  discussed,  followed  by 
time-  and  frequency- domain  views  of  these  loops.  Con¬ 
siderations  for  the  design  of  higher  order  loops  will  also 
be  presented. 

3.  EA  VS.  TRADITIONAL  CONVERSION 

This  section  of  the  tutorial  will  compare  EA  conver¬ 
sion  with  more  traditional  approaches  such  as  succes¬ 
sive  approximation  and  flash  conversion.  The  compar¬ 
ison  will  focus  on  the  issues  of  non-idealities  and  appli¬ 
cations,  and  will  include  a  demonstration  of  the  effects 
of  differential  non-linearity  in  an  audio  recording. 

This  work  was  completed  with  much  help  from  Robert  W. 
Adams  of  Analog  Devices. 


4.  EA  A/D  CONVERTER 
IMPLEMENTATION 

This  section  will  discuss  the  implementation  of  E  A  an- 
alog-to-digital  converters.  The  discussion  will  include: 

1.  Design  of  the  loop  filter. 

(a)  Continuous-  vs.  discrete-time  filtering. 

(b)  Breadboarding. 

(c)  Switched- capacitor  implementation. 

(d)  Differential  vs.  single-ended 
implementation. 

(e)  Noise  and  error  tolerance  in  the  loop. 

2.  Opamp  design  considerations  (with  examples). 

3.  Integrator  design  considerations. 

4.  One-bit  DAC  design. 

5.  Comparator  design. 

5.  INTERFACING  TO  THE 

EA  CONVERTER 

This  section  will  present  considerations  in  board-level 
design  around  a  EA  analog-to-digital  converter.  Dis¬ 
cussion  will  focus  on  driving  switched-capacitor  signal 
and  reference  inputs  and  avoiding  analog/ digital  inter¬ 
ference. 

6.  SIMULATION,  TESTING  AND 
DEBUGGING  A  EA  CONVERTER 

This  final  section  will  explore  the  tradeoffs  in  the  sim¬ 
ulation,  test,  and  IC  debug  phases  of  the  converter 
design.  Again,  the  focus  will  be  on  the  analog-to- 
digital  converter  with  obvious  extension  to  the  digital- 
to- analog  converter. 
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ACTIVE  GRIDS  FOR  QUASI-OPTICAL  POWER  COMBINING 

David  Rutledge^  Jung-Chih  Chiao,  Michael  DeLisio,  Jeff  Liu 
Department  of  Electrical  Engineering,  Caltech,  Pasadena,  CA  91125 


I.  Introduction 

Active  grids  are  periodic  structures  loaded  with  transis¬ 
tors  or  diodes  that  interact  with  electromagnetic  beams. 
These  new  quasi-optical  components  may  make  possi¬ 
ble  a  new  generation  of  low-cost,  high-power  solid-state 
millimeter- wave  communications,  broadcast,  and  radar 
systems.  Quasi-optical  power  combining  allows  the  out¬ 
put  powers  from  large  numbers  of  individual  transis¬ 
tors  and  diodes  to  be  combined  in  free  space  with¬ 
out  transmission-line  losses.  These  components  can  of¬ 
ten  be  made  as  planar  structures  that  are  suitable  for 
large-scale  monolithic  integration.  This  should  allow 
these  systems  to  have  low  cost.  Often  these  devices 
are  multi-mode  devices  that  work  with  beams  at  differ¬ 
ent  angles,  or  even  with  several  beams  simultaneously. 
This  should  make  it  possible  to  use  these  components 
in  electronically-scanned  systems.  The  devices  can  also 
tolerate  high  failure  rates — a  10%  transistor  failure  rate 
may  only  reduce  the  gain  by  1  dB.  This  could  give  ad¬ 
vantages  in  increasing  the  fabrication  yield  to  lower  the 
cost,  or  enabling  the  construction  of  ultra-reliable  sys¬ 
tems  for  space  applications,  or  allowing  a  device  to  sur¬ 
vive  partial  destruction  in  military  use.  These  circuits 
are  analyzed  by  equivalent  waveguide  circuits.  The  di¬ 
ameter  of  the  equivalent  waveguide  is  determined  by  the 
period  of  the  structure,  which  is  fixed  by  photolithog¬ 
raphy,  rather  than  by  fabrication  of  a  metal  waveguide. 
This  makes  it  possible  to  make  grids  for  the  terahertz 
frequency  range. 

A  variety  of  grids  have  been  demonstrated,  including 
detectors,  phase  shifters,  multipliers,  oscillators,  and 
more  recently,  amplifiers  and  switches.  In  these  grids, 
the  power  is  proportional  to  the  area,  while  the  circuit 
impedances  are  determined  by  the  dimensions  of  the 
unit  cell.  This  allows  great  design  flexibility.  One  can 
achieve  high  power  and  high  efficiency  simultaneously, 
or  large  dynamic  range  and  low  noise  at  the  same  time. 
I  will  discuss  results  for  several  recent  grids.  The  first  is 
a  a  doubler  with  an  output  power  of  330  //W  at  1  THz. 
The  second  is  a  10  x  10  pHEMT  hybrid  grid  amplifier 
with  a  3-dB  noise  figure  and  3.7-W  output  at  10  GHz. 
Finally  I  will  discuss  current  results  on  monolithic  grid 
amplifiers  for  millimeter  wavelengths. 


II.  Millimeter-waves  and  Quasi-optics 

This  is  an  exciting  time  for  millimeter- wave  systems. 
For  many  years  the  major  applications  have  been  in 
military  radar  and  radiometry.  Millimeter  waves  are  of 
special  interest  to  the  military,  because  millimeter- wave 
systems  offer  better  resolution  and  ground-clutter  rejec¬ 
tion  than  microwave  systems,  and  better  fog  and  smoke 
penetration  than  infrared  systems.  More  recently,  com¬ 
mercial  applications  are  becoming  important.  The  FCC 
is  considering  a  variety  of  allocations  up  to  140  GHz, 
and  at  28-GHz  television  distribution  systems  are  be¬ 
ing  established.  Japanese  companies  are  building  60- 
GHz  computer  networks.  In  Europe,  a  large  number  of 
millimeter- wave  communications  links  have  been  estab¬ 
lished  at  frequencies  between  31  GHz  and  65  GHz  [1], 
Here  in  the  United  States,  the  Millitech  Corporation 
has  made  excellent  progress  in  passive  imaging  systems 
for  detecting  concealed  weapons  and  for  landing  air¬ 
craft  in  the  fog  [2].  Several  corporations  are  also  mak¬ 
ing  car  radars  for  76  GHz.  At  the  same  time  as  these 
new  applications  have  appeared,  there  has  been  excel¬ 
lent  technology  development  in  monolithic  millimeter- 
wave  integrated  circuits,  much  of  it  sponsored  by  the 
MIMIC  program.  Monolithic  amplifiers  that  operate 
at  120  GHz  have  been  demonstrated  [3],  and  the  Lock¬ 
heed/Martin  Corporation  has  built  a  90-GHz  radiome¬ 
ter  on  a  chip  [4]. 

In  millimeter- wave  research,  quasi-optical  devices  have 
become  a  major  focus.  Quasi-optics  research  has  a  long 
history,  going  back  to  the  development  of  beam  waveg¬ 
uides  by  Goubau  and  Schwering  [5],  and  a  quasi-optical 
oscillator  by  Wandinger  and  Nalbandian  [6].  Most  of 
the  early  development  was  in  passive  devices  like  beam 
waveguides,  diplexers,  and  frequency  selective  surfaces. 
This  technology  has  reached  a  sophisticated  level,  and 
has  seen  applications  in  radio-astronomy  telescopes  and 
in  .NASA’s  Deep  Space  Network.  Over  the  last  ten 
years,  however,  active  quasi-optical  devices  have  devel¬ 
oped.  This  new  era  of  quasi-optics  has  attracted  the 
attention  of  the  microwave  community,  with  a  special 
issue  of  the  IEEE  Transactions  on  Microwave  Theory 
and  Techniques  [7],  and  regular  sessions  on  quasi-optics 
at  the  MTT  Symposium.  In  addition,  Robert  York  has 
written  an  excellent  review  paper  of  work  [8]. 
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Ill,  Oscillators 


James  Mink’s  1986  paper  on  quasi-op tical  power  com¬ 
bining  of  solid-state  millimeter- wave  sources  [9]  is  often 
taken  as  the  starting  point  for  modern  quasi-optical  de¬ 
vices.  Popovic  ei  ai  demonstrated  the  first  large-scale 
qiiasi-optical  oscillator  with  a  100-MESFET  oscillator 
grid  [10]  that  produced  a  half  watt  at  5  GHz.  Kim  ei  al 
also  demonstrated  a  monolithic  Ka-band  oscillator  grid, 
showing  that  these  oscillators  can  be  made  in  mono¬ 
lithic  form  [11].  However,  the  output  power  was  small, 
in  the  range  of  30  mW  at  35  GHz,  and  the  pattern  was 
poor,  indicating  that  the  oscillation  really  was  a  sub¬ 
strate  mode  oscillation  [12],  Hacker  ei  aL  recently  built 
a  grid  with  a  10-watt  output  at  10  GHz,  showing  that 
the  grid  approach  is  capable  of  substantial  power  (Fig¬ 
ure  1).  The  next  challenge  is  to  produce  watt-level  out¬ 
puts  from  a  monolithic  millimeter- wave  grid  oscillator. 


Figure  1.  Photograph  of  the  10- watt  grid  oscillator 
demonstrated  by  Hacker  ei  al  [13].  The  beam  propa¬ 
gates  normal  to  the  surface,  and  is  vertically  polarized. 
The  horizontal  leads  are  bias  connections,  and  the  ver¬ 
tical  leads  are  the  radiators, 

IV.  Multipliers 

The  first  grid  multiplier  was  demonstrated  by  Jou  ei 
ai  (Figure  2)  [14].  These  multiplier  grids  are  periodic 
structures  loaded  with  planar  Schottky  diodes.  These 
grids  may  contain  thousands  of  diodes  and  can  have 
substantial  pulsed  power.  A  grid  by  Liu  ei  ai  pro¬ 
duced  5W  at  99  GHz  [15].  This  approach  has  recently 
been  extended  to  the  terahertz  region  by  Chiao  ei  a/., 
who  demonstrated  a  pulsed  output  of  330  /iW  at  1  THz. 
However,  the  efficiency  of  the  terahertz  devices  is  only 
10“"^,  and  it  will  have  to  be  greatly  improved  before 
these  devices  are  usueful  for  continuous  operation. 


Figure  2.  The  grid  multiplier  [14].  The  fundamental 
frequency  wave  enters  on  the  left  as  a  beam,  passes 
through  a  filter,  and  is  incident  on  a  diode  grid.  The 
grid  acts  as  a  non-linear  surface,  and  produces  a  beam 
at  a  harmonic  frequency  that  passes  through  filters  on 
the  right. 

V.  Amplifiers 

In  1991,  Moonil  Kim  ei  ai  demonstrated  the  first  grid 
amplifier  [17].  The  grid  amplifier  is  an  active  quasi- 
optical  device  that  amplifies  a  beam  as  it  passes  through 
it  (Figure  3).  The  grid  had  a  gain  of  11  dB  at  3.3  GHz. 
Our  experience  has  been  that  it  helps  in  several  ways  to 
have  the  input  and  output  beams  cross-polarized.  It  is 
easier  to  stabilize  the  grid  against  oscillations.  In  addi¬ 
tion,  the  polarizers  can  tune  the  input  and  output  cir¬ 
cuits  independently.  We  have  also  found  that  gain  mea¬ 
surements  are  easier  if  the  input  and  output  beams  are 
cross-polarized,  because  interference  is  reduced.  This 
first  grid  was  small,  with  only  25  differential-pairs  el¬ 
ements,  and  the  wiring  arrangement  was  complicated, 
with  radiating  elements  on  one  side  of  the  board,  bias 
lines  on  the  other,  and  connecting  feedthroughs.  DeLi- 
sio  ei  ai  recently  extended  the  original  grid  amplifier 
idea  to  a  100-element  X-band  HEMT  amplifier.  They 
reported  11  dB  of  gain  at  9  GHz,  3-dB  noise  figure,  and 
3.7  W  of  output  power  [18].  Moreover,  it  was  found  that 
a  grid  amplifier  can  amplify  beams  at  angles  up  to  ±30^, 
so  that  it  could  be  used  in  an  electronic  beam-steering 
system. 
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Figure  3.  A  grid  amplifier  [18],  In  the  figure,  hori¬ 
zontally  polarized  power  is  incident  from  the  left,  and 
passes  through  the  polarizer.  The  grid  amplifier  am¬ 
plifies  the  beam  and  radiates  it  as  vertically  polarized 
power,  which  passes  through  the  output  polarizer  on 
the  right. 


Amir  Mortazawi’s  group  has  built  a  spatial  amplifier 
based  on  the  concept  of  extended  resonance,  with  re¬ 
ceiving  and  transmitting  antennas  on  both  sides  of  the 
printed  circuit  board  [19].  Professor  Mortazawi’s  design 
is  particularly  flexible  with  respect  to  input  and  output 
polarization,  Zoya  Popovic’s  group  has  demonstrated  a 
spatial  amplifier  with  patch  antennas  on  two  sides  of  a 
printed-circuit  board  [20].  This  approach  is  particularly 
flexible  in  the  phasing  of  the  radiating  elements,  so  that 
the  beam  can  be  focused.  Robert  York’s  group  has  built 
a  spatial  amplfier  with  folded  slots  [8].  An  excellent 
feature  of  Professor  York’s  grid  is  that  the  connecting 
transmission  line  is  coplan ar  waveguide,  and  this  allows 
MIMIC  amplifiers  to  be  integrated  conveniently. 

VI.  Future  work — monolithic  grid  amplifiers 


Figure  4.  Photograph  of  a  36-element  monolithic  grid 
amplifier  compared  with  a  dime  [21]. 
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The  goal  of  much  current  work  is  to  demonstrate  mono¬ 
lithic  millimeter- wave  quasi-optical  amplifiers.  Liu  ei 
ai  reported  5dB  gain  at  41  GHz  from  a  6  x  6  HBT 
amplifier  (Figure  4)  [21],  Research  in  monolithic  quasi- 
optical  amplifiers  is  just  beginning,  and  the  field  is  wide 
open,  with  several  competing  groups.  Heat  dissipation, 
high-power,  and  efficiency  are  major  challenges. 
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1.0  Introduction 

Transferred  Electron  Oscillators  (TEOs)  are  com¬ 
monly  used  for  local  oscillator  (LO)  power  in  Het¬ 
erodyne  receivers.  These  sources  can  be  used  to 
directly  pump  mixer  circuits  up  to  160  GHz  or  to 
drive  varactor  frequency  multipliers  for  applications 
that  require  a  higher  frequency  LO.  The  advantages 
of  TEOs  over  other  sources  include  low  noise  oper¬ 
ation,  moderate  power  (10-50  mW)  up  to  at  least 
160  GHz,  broad  band  operation,  relatively  low  cost, 
high  reliability,  and  easy  integration  with  the 
receiver  system.  The  disadvantage  of  these  oscilla¬ 
tors  is  the  mechanical  tuning  for  the  frequency  and 
power  backshorts.  The  TEOs  described  in  this  pre¬ 
sentation  are  all  second-harmonic  oscillators  to 
insure  reduced  load  pulling  and  lower  noise  than 
fundamental  TEOs.  Also,  Efficient  power  genera¬ 
tion  from  fundamental  current  oscillations  is  diffi¬ 
cult  to  achieve  at  frequencies  above  100  GHz  with 
short  InP  TEDs.  This  report  will  focus  on  the  theo¬ 
retical  and  experimental  results  for  a  continuously 
tunable  115-145  GHz  (20-50  mW)  low  noise 
source,  and  some  preliminary  theoretical  and  exper¬ 
imental  results  at  160  GHz. 

2.0  Results  and  Discussion 

2.1  Experimental  Results 

The  115-145  GHz  second-harmonic  TEO  is  similar 
to  that  described  in  reference  1.  Half-height  wR-8 
wave  guide  perpendicularly  intersects  a  similar 
diameter  coaxial  cavity.  There  is  an  adjustable  back- 
short  at  one  end  of  the  WR-8  waveguide.  The  pack¬ 
aged  Transferred  Electron  Device  [2]  is  screwed 
into  the  waveguide  floor  and  serves  as  the  bottom 
wall  of  the  coaxial  cavity.  Bias  voltage  is  supplied  to 
the  diode  through  the  coaxial  cavity  central  conduc¬ 
tor.  A  typical  bias  choke,  center  pin,  and  disc  reso¬ 
nator  connects  to  the  top  of  the  diode  and  serves  as 
the  cavity  central  conductor.  The  bias  choke  forms 
the  top  wall  of  the  cavity.  The  coaxial  cavity  length 
is  adjusted  by  mechanically  sliding  the  bias  choke 
up  or  down  over  the  pin.  By  adjusting  the  coaxial 
frequency  backshort  the  cavity  length  is  varied  over 
50  mils  and  the  oscillation  frequency  is  smoothly 


and  continuously  adjusted  from  115-145  GHz.  At 
the  same  time,  the  waveguide  backshort  is  adjusted 
to  optimize  the  output  power.  20-50  mW  of  output 
power  is  achieved  over  the  band,  and  the  cavity 
operation  is  extremely  reliable  and  reproducible. 
The  cavity  length  essentially  controls  the  frequency 
of  oscillation  at  the  fundamental.  Since  the  funda¬ 
mental  (57.5-72.5  GHz)  is  cutoff  by  the  output 
waveguide  and  the  waveguide  back  short  section, 
the  fundamental  is  essentially  reactively  terminated. 
This  leads  to  a  strong  fundamental  voltage  oscilla¬ 
tion  that  is  decoupled  from  the  output  waveguide 
and  the  waveguide  backshort.  The  strong  non-lin¬ 
earity  of  the  diode  combined  with  the  large  funda¬ 
mental  voltage  oscillation  creates  efficient  second 
harmonic  power.  The  diode  impedance  at  the  second 
harmonic  is  tuned  using  the  waveguide  backshort 
and  power  is  coupled  to  the  output  section.  Tuning 
the  second  harmonic  impedance  for  peak  power  has 
little  effect  on  the  fundamental  impedance  and  oper¬ 
ating  frequency. 

2.2  Numerical  Simulations 

Detailed  simulations  of  the  cavity  have  been  com¬ 
pleted  using  HP’s  High  Frequency  Structure  Simu¬ 
lator.  The  entire  oscillator  cavity  including  the 
package  have  been  simulated  to  extract  the  imped¬ 
ance  seen  by  the  diode  at  the  fundamental  and  sec¬ 
ond  harmonic  frequencies  (55-145  Ghz).  These 
results  are  used  in  our  harmonic  balance/numerical 
device-simulation  code  [3]  to  fully  simulate  the 
oscillator  behavior.  A  detailed  understanding  of  the 
oscillator  operation  and  the  tuning  characteristics 
has  resulted  from  these  advanced  simulation  and 
will  be  explained. 
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1.  INTRODUCTION 

Recently  there  has  been  a  resurgence  of  interest  in 
bolometers  as  heterodyne  mixers  at  submillimeter 
wavelengths.  This  is  due  primarily  to  two  new  and 
innovative  concepts  [1,2]  which  result  in  bolometers  with 
response  times  fast  enough  to  allow  for  intermediate 
frequencies  (IF)  of  1  GHz  -  10  GHz,  as  well  as  low  mixer 
noise  temperatures.  These  IFs  are  high  enough  for  practical 
spectroscopy  applications  and  thus  these  bolometers  need  to 
be  seriously  considered  as  heterodyne  sensors.  In  this  paper 
I  will  review  briefly  review  the  basics  of  bolometer  mixers. 
Then  an  overview  will  be  given  of  the  basic  operation  of  the 
new  high  speed  bolometers,  along  with  a  few  recent  results 
which  demonstrate  the  performance.  Finally,  the  role  these 
sensors  may  be  expected  to  fulfill  will  be  discussed.  I  should 
emphasize  that  this  paper  is  meant  to  provide  a  brief 
introduction  to  these  new  bolometer  mixers,  and  reference 
will  be  made  to  the  recent  literature  for  the  interested  reader 
who  wishes  to  delve  more  deeply  into  the  details.  This  is  not 
a  review  article,  but  rather  an  overview  of  recent 
developments,  so  no  attempt  is  made  to  give  a  complete 
listing  of  results  and  publications. 


2.  BOLOMETER  MIXERS 

Bolometers  have  been  used  occasionally  as  heterodyne 
mixers  primarily  because  of  the  advantages  of  high 
frequency  operation  (bolometers  can  be  operated  at 
millimeter  through  submillimeter  wavelengths)  as  well  as 
high  sensitivity  with  near-quantum  limited  noise. 
Additionally,  bolometers  are  simple  square-law  or  total- 
power  detectors.  There  is  no  instantaneous  response  at  the  rf 
as  with  an  electronic  mixer,  such  as  a  Schottky  diode  or  SIS 
tunnel  junction.  There  is  also  no  harmonic  response.  The 
principle  disadvantage  of  the  bolometer  mixer  is  the  slow 
thermal  response  time.  This  limits  the  IF  to  low  values, 
usually  of  order  MHz.  In  general  this  is  too  low  to  be  useful 
for  many  remote-sensing  applications  involving  molecular 
line  spectroscopy  such  as  radioastronomy,  atmospheric 
chemistry,  and  planetary  science. 

The  limitation  placed  on  the  IF  by  the  thermal  response 
time  can  be  understood  by  considering  the  basic  operation 
of  a  bolometer  mixer.  The  two  basic  elements  of  a 


bolometer  are  shown  in  figure  1.  There  is  an  element  which 
absorbs  the  incident  rf  power;  typically  an  absorbing  film. 
The  absorber  has  a  thermal  heat  capacity  C,  and  as  it 
absorbs  power  its  temperature  T  increases.  The  absorber  is 
connected  to  a  thermal  bath  temperature  Tbath  by  a  thermal 
conductance  G.  The  thermal  response  time  is  given  by  = 
C  /  G,  and  represents  the  characteristic  time  over  which  the 
temperature  of  the  bolometer  can  change  for  a  sudden 
change  in  incident  rf  power. 


Pin 


Absorber 


Fig.  1:  Basic  elements  of  a  bolometer.  C  is  the  heat  capacity  of  the 
absorber,  typically  a  thin  film.  G  is  the  thermal  conductance  to  the  bath 
temperature.  Pjnc  is  the  incident  rf  power. 

To  operate  the  bolometer  in  a  heterodyne  mode,  a  local 
oscillator  voltage,  Vlq,  at  frequency  COlo  and  a  signal 
voltage,  Vj,  at  frequency  COj  are  applied  to  the  absorber, 
which  for  simplicity  we  take  to  be  a  thin  film  of  resistance 
R„.  The  resulting  dissipated  power  is: 

P(t)  =  Plo  +Ps  +  2(PloPs)''^  cos(C0ip  t)  (1) 

where  co,p  =  ®lo  -  ®s  is  the  IF  frequency,  and  Plo  =  (^lo^  ! 
2R„)  and  P3  =  /  2R„)  are  the  LO  power  and  signal  power 

respectively  dissipated  in  the  bolometer.  The  bolometer  is 
not  fast  enough  to  follow  the  rf,  so  the  power  dissipated  at 
these  frequencies  is  the  time  averaged  value.  However,  if 
the  IF  is  low  enough,  the  bolometer  can  follow  this 
variation,  so  there  can  be  a  time  dependent  term  at  this 
frequency.  The  Voltage  Responsivity,  S,  of  the  bolometer  [3] 
gives  the  change  in  voltage  across  the  bolometer  for  a 
change  in  absorbed  power,  and  hence  can  be  used  to 
estimate  the  IF  voltage  amplitude: 
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Vjp  =  S*2(PloPs)‘'^  (2) 

The  responsivity  is  given  by: 

S  =  KdR  /  dT)  /  [G*(l  +  cOiF^  (3) 

where  I  is  the  bias  current  through  the  resistive  film,  and 
dR/dT  is  the  derivitive  of  film  resistance  with  temperature. 
Thus  for  low  enough  IF,  that  is  cOjf^  Xih^<  1»  the  bolometer 
can  follow  the  IF  power  swing;  but  for  cOif^  1,  the  IF 
voltage  will  decrease  and  hence  the  conversion  efficiency  of 
the  mixer  will  decrease.  As  mentioned  above,  for  many 
conventional  bolometers  is  large  enough  that  the  IF  is 
limited  to  undesirably  low  values.  However,  the  two  new 
bolometers  discussed  below  address  this  issue. 


3.  NEW  APPROACHES 

Recently,  two  new  approaches  [1,2]  have  been 
proposed  for  bolometers  with  very  short  thermal  response 
times  and  hence  a  high  IF  rolloff  frequency.  Both 
bolometers  utilize  the  resistive  transition  in  a 
superconductive  thin  film  which  results  in  a  large  dR/dT  and 
hence  high  responsivity.  What  is  new  however,  is  that  these 
devices  make  use  of  very  thin  films:  about  10  nm  for  Nb  or 
NbN.  Such  thin  films  have  a  very  high  scattering  rate  due  to 
surface  effects  and  hence  a  short  electron  mean  free  path  £, 

which  is  about  1  nm  -  10  nm.  In  these  films,  it  is  found  that 
the  electron-electron  interaction  is  enhanced,  resulting  in  a 
short  electron-electron  interaction  time:  TegOc  and  the 

electron-phonon  interaction  is  weakened:  [2],  Hence 

the  electrons  can  reach  thermal  equilibrium  at  a  temerature 
different  from  the  lattice  temperature.  Thus  when  absorbing 
rf  power,  the  electrons  can  warm  up  relative  to  the  lattice 
temperature.  The  electrical  resistance  in  the  film  depends  on 
the  electron  temperature  and  such  a  device  is  known  as  a 
hot-electron  bolometer.  Since  only  the  electrons  are  heated, 
the  heat  capacity  C  can  be  very  small,  especially  for  a 
submicron-sized  device.  In  addition,  the  hot-electron 
bolometer  mixers  discussed  below  employ  novel 
mechanisms  for  cooling  the  electrons  which  results  in  a  high 
thermal  conductance  and  hence  an  overall  short  thermal 
relaxation  time. 

Before  describing  the  details  of  the  bolometers,  it  is 
useful  to  list  the  important  advantages  of  these  novel 
devices: 

1)  The  thermal  response  time  is  very  fast:  «  lO's  ps.  Thus 
IPs  of  1  GHz  -  10  GHz  can  be  achieved. 

2)  These  bolometer  mixers  should  operate  well  to  very 
high  frequency:  several  THz.  There  is  no  energy  gap 
limitation  as  in  an  SIS  mixer.  In  fact,  rf  power  is 
absorbed  more  uniformly  above  the  energy  gap 
frequency. 


3)  The  mixer  noise  temperature  is  very  low:  near-quantum 
limited. 

4)  Very  low  LO  power  is  required:  nW’s  -  |iW's.  This  is 
comparable  to  the  requirements  for  SIS  mixers  and  is 
an  important  issue  at  high  submillimeter  wave 
frequencies  where  LO  power  is  difficult  to  generate. 

5)  The  rf  impedance  of  the  device  is  essentially  resistive 
and  is  determined  by  the  geometry  of  the  film;  that  is, 
the  number  of  squares  in  a  small  strip.  Typical  values 
range  from  20  O  to  200  Q.  This  greatly  simplifies  the  rf 
circuit  design.  Unlike  a  Schottky  diode  or  SIS  tunnel 
junction,  there  are  no  parasitic  reactances  to  tune  out. 
The  real  rf  resistance  of  the  bolometer  should  be 
independent  of  frequency  from  about  the  energy  gap 
frequency  up  to  a  frequency  corresponding  to  the 
inverse  electron-electron  elastic  scattering  time  (about 

10“^^  sec)  which  is  approximately  160  THz  [1]. 

6)  In  addition,  these  devices  make  use  of  existing 
materials  and  fabrication  techniques:  Nb,  NbN,  YBCO; 
micron-scale  photolithography,  and/or  submicron  E- 
beam  lithography. 


4.  DIFFUSION-COOLED  HOT-ELECTRON 
BOLOMETER  MIXER 

Figure  2  shows  the  basic  geometry  of  this  bolometer 
mixer  which  was  proposed  by  D.  Prober  in  1993  [1].  The 
unique  feature  of  this  device  is  that  it  uses  the  rapid 
diffusion  of  hot  electrons  out  of  a  submicron  length  strip  (or 
microbridge)  of  superconductor  into  normal  metal  contacts 
as  the  cooling  mechanism,  or  thermal  conductance.  In  order 
for  diffusion  to  dominate,  over  electron-phonon  interactions, 
as  the  cooling  mechanism,  it  is  necessary  for  the 
microbridge  to  be  short.  The  appropriate  length  L  can  be 
estimated  [1]  from  the  expression: 

L  =  (4) 

where  D  is  the  diffusion  constant,  and  is  the  electron- 
electron  inelastic  ("energy  sharing")  interaction  time. 
Basically  when  an  electron  absorbs  energy  from  an  rf 
photon,  it  shares  its  energy  in  a  time  x^^  and  also  diffuses  a 
distance  L/2.  A  hot  electron  in  the  middle  of  the  bridge  can 
thus  go  L/2  left  or  right.  At  that  point,  it  encounters  the 
normal  metal  contact  which  serves  as  a  heat  sink.  These 
pads  at  the  end  of  the  microbridge  must  be  normal  metal 
since  Andreev  reflection  [4]  at  the  energy  gap  in  a  thick 
superconducting  film  would  trap  the  hot-electrons  inside  the 
microbridge  and  substantially  slow  the  response  of  the 
device.  The  electron-electron  interaction  time  can  be 
estimated  from  [5]: 

(5) 
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where  is  the  surface  resistance  and  is  the 
superconductive  transition  temperature  of  the  thin  film.  For 
the  very  thin  ( dirty  limit )  film  used  here,  is  larger  and 
is  smaller  than  the  bulk  values.  For  a  10  nm  Nb  film  for 
example,  ~  20  Q/sq,  ~  5  K,  and  the  width  of  the 
transition  is  AT^.  ~  0.5  K.  Substituting  these  values  into 
eqn  (4)  and  eqn  (5),  yields  a  length  L  «  0.2  \im.  The 
microbridge  can  be  somewhat  longer  (with  a  corresponding 
increase  in  but  should  be  less  than  2(DXep)^^^  where  T^p  is 
the  electron-phonon  interaction  interaction  time.  For  this 
length  and  longer  the  electrons  will  remain  in  the  bridge 
long  enough  to  produce  phonons  which  can  alter  the  mode 
of  operation  (see  section  5). 


Fig.  2:  Basic  geometry  of  the  diffusion-cooled  hot-electron  bolometer 
mixer. 

The  thermal  response  time  can  be  calculated  from  the 
usual  expression: 

Tth  =  C  /  G  (6) 

The  thermal  capacitance  is  given  by  the  electron  specific 
heat  [6]: 

C  =  yTV  (7) 

where  y  =  700  J/K^w?  for  Nb,  T  is  the  electron  temperature 
(which  is  always  about  T^  in  a  transition  edge  device),  and 
V  is  the  device  volume.  The  thermal  conductance  G  is  given 
by  the  Wiedemann-Franz  Law  [6],  This  law  states  that  the 
ratio  of  thermal  conductance  to  electrical  conductance  is 
proportional  to  temperature,  if  the  electrons  carry  both  the 
electrical  current  and  the  thermal  current,  which  is  the  case 
for  the  diffusion-cooled  microbridge.  Thus: 

G  =  (ji^/3)(kB/qe)"(T/Reff)  (8) 

where  kg  is  Boltzman's  constant,  q^  is  the  charge  on  the 
electron,  T  is  the  temperature  («  T^)  and  R^ff  is  the  effective 
electrical  resistance  of  the  microbridge:  ~R^/  12  [1].  R„ 

is  the  DC  resistance  of  the  microbridge  for  T  just  above  T^. 
The  factor  1/12  arises  because  heat  flows  symmetrically  out 
both  ends  of  the  microbridge. 


For  an  actual  bridge  [7]  with  dimensions  width  = 
0.14  |Ltm,  length  =  0.28  |Lim,  thickness  =  10  nm  and 
Rn  =  30  Q,  the  estimated  response  time  is  30  psec.  The  3  dB 
IF  rolloff  frequency  for  the  mixer  conversion  efficiency  is 
then  given  by: 

IFr<,;to#=  (2*7fTu,)-’  (9) 

which  is  about  5  GHz  in  this  case.  However,  due  to  self¬ 
heating  effects  [3]  (  and  to  a  lesser  extent  by  IF  impedance 
mismatch  effects  [8] )  the  thermal  conductance  is  reduced  to 
a  lower  effective  value: 

Geff  =  G(l-A)  (10) 

where  typically  0.1  <  A  <  0.9  for  thermally  stable  operation. 
Thus  the  value  of  IProiio/f  can  actually  be  in  the  range  of 
about  2-5  GHz,  which  allows  for  IF's  commonly  used  in 
heterodyne  instruments. 

Now  we  will  estimate  the  mixer  conversion  efficiency 
q,  the  double-sideband  receiver  noise  temperature  Tr(DSB), 
and  the  LO  power  Plo-  The  conversion  efficiency  for  a 
bolometer  mixer  was  originally  worked  out  in  detail  by 
Arams  et  al  [9]  in  1966  and  recently  discussed  in  connection 
with  these  new  hot-electron  bolometers  [8].  For 
convenience,  we  will  follow  the  simplified  approach  given 
by  Prober  [1].  The  conversion  can  be  estimated  using  the 
voltage  responsivity  of  the  bolometer  and  the  IF  power 
amplitude.  The  result  is: 

11  =  P,p  /  Prf  =  [  r  (dR/dT)'  /  GM  •  [  Plo  /  2R  ]  (11) 

(  note:  this  expression  is  valid  for  (OiFTth  «  1  ).  For  equal 
DC  power  (  P^c  =  I^R)  and  LO  power  dissipated  in  the 
bolometer,  q(SSB)=  1/8  =  -9  dB  (  note  that  q  >  0  dB  may 
be  possible  under  some  conditions  [8] )  It  can  be  shown  that 
the  conversion  efficiency  in  this  case  is  independent  of  L, 
the  length  of  the  microbridge  [1],  However,  the  response 
time  Tth  Ihus  IF^^//^j^oc  1/L^.  Hence,  as  expected  for  a 
device  that  relies  on  diffusion  out  the  ends,  shorter  is  better 
for  high  speed.  However,  there  is  no  sacrifice  in  conversion 
efficiency.  While  decreasing  L  will  increase  G  and  hence 
reduce  the  bolometric  responsivity,  in  the  heterodyne  mode 
this  can  be  compensated  by  increasing  the  LO  power.  Thus 
it  is  possible  to  have  a  bolometer  mixer  which  is  both  fast 
and  sensitive. 

The  best  optimization  of  LO  and  DC  power  is  a  topic  of 
current  discussion  [1,10].  In  practice,  the  bolometer  is 
thermally  biased  at  a  temperature  somewhat  below  T^,  then 
a  combination  of  DC  and  LO  power  is  used  to  heat  the 
electrons  up  to  T^,  Prober  [1]  suggests  setting  «  T^- 
ATJ2  (that  is,  just  below  the  transition  width)  and  then 
applying  enough  LO  to  heat  the  electrons  up  by  ATc/4  along 
with  an  equal  amount  of  DC  power.  The  LO  power  can  then 
be  readily  approximated  by  Plq  ~  ^T^  G  /  4  which  is  4  nW 
for  a  50  Q  microbridge  with  AT^  =  0.5  K.  Karasik  and 
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Elantev  [10]  however  suggest  that  the  bath  temperature 
should  be  made  low  and  then  the  LO  power  should  heat  the 
electrons  from  Tbaui  up  to  Tc-ATc/2  and  the  DC  power  should 
then  raise  the  temperature  by  ATc/2  in  order  to  reach  T^. 
This  implies  an  LO  power  of  -  70  nW  for  the  same 
microbridge  with  Tbath=2.5  K  and  T  =5  K.  Since  both  the 
conversion  efficiency  and  mixer  noise  improve  with 
increasing  LO  power  [10],  the  higher  value  may  be 
desirable.  In  either  case,  the  LO  power  requirement  is 
extremely  low;  equal  or  less  than  that  required  for  a 
submillimeter  wave  SIS  mixer. 

It  is  currently  accepted  that  the  main  contributions  to 
the  noise  in  a  bolometer  mixer  are  due  to  Johnson  noise  and 
electron  temperature  fluctuation  noise  [11].  The  latter  being 
dominant.  The  Johnson  noise  temperature  is,  of  course,  T^ 
and  arises  from  the  film  resistance  at  the  transition 
temperature.  The  electron  temperature  fluctuation  noise 
arises  from  the  thermodynamic  energy  fluctuations  in  an 
electron  gas  at  an  average  thermodynamic  temperature  T^. 
The  RMS  electron  temperature  fluctuation  can  be  expressed 
as  [8]: 

ATe  =  4kBT^/G  (12) 

This  yields  a  DSB  mixer  input  noise  temperarture  of 
Tc^  G/Plo  according  to  Karasik  and  Elantev  [10].  For  the 
LO  power  level  considered  by  Prober,  this  reduces  to 
TcVATc.  The  resulting  DSB  receiver  noise  temperature  can 
be  expressed  as  [1]: 

Tr  (DSB)  =  [  ( AT,)  +  T,  +  T.P  ]  /  T1  (13) 

where  T,f  is  the  IF  amplifier  system  noise  temperature.  Thus 
for  Te  =  5  K,  ATe  =  0.5  K,  Tjp  =  4  K,  and  r|(DSB)  =  -7  dB 
(ie:  6  dB  intrinsic  mixer  conversion  loss  and  1  dB  of 
receiver  optical  path  loss),  we  get  Tr(DSB)  =  295  K. 
However,  reducing  Tc  just  1  K  ( a  slightly  thinner  film ) 
yields  Tr(DSB)  =  200  K.  This  performance  becomes 
competitive  with  current  state-of-the-art  SIS  receivers  near 
frequencies  of  500  GHz  -  600  GHz.  However,  for  the 
bolometer  mixer  there  is  no  inherent  frequency  dependence 
of  the  performance  above  the  energy  gap  frequency  (except 
of  course  for  the  linear  frequency  dependence  set  by  the 
quantum  limit  [12]  ).  The  same  noise  temperature,  200  K, 
should  be  possible  at  0.5  THz  or  several  THz  (mechanisms 
which  may  ultimately  limit  the  high  frequency  performance 
will  be  briefly  discussed  below). 

To  date,  the  diffusion-cooled  hot-electron  bolometer 
mixer  has  been  rapidly  developed  by  the  Jet  Propulsion 
Laboratory  in  collaboration  with  Yale  University  [7,13,14] 
(no  other  groups  have  yet  reported  results  on  this  new 
device).  A  Nb  device  with  dimensions  0.15  |im  wide, 
0.28  |Xm  long,  and  10  nm  thick  was  tested  in  a  waveguide 
niixer  mount  in  a  receiver  at  an  LO  frequency  of  530  GHz 
( see  references  7  and  14  for  a  complete  discussion  of  these 
results  ).  This  LO  frequency  is  well  above  the  gap  frequency 


of  ~  400  GHz  for  this  thin  Nb  film  with  T^^53  K  and 
ATc~0.5-l  K.  The  DSB  receiver  noise  temperature  is  650  K 
at  an  IF  of  1.4  GHz.  The  estimated  DSB  mixer  noise 
temperature  is  560  K  and  mixer  conversion  efficiency  is 
about  -11  dB.  The  calculated  receiver  noise  temperature  is 
570  K  -  750  K  (depending  on  the  exact  choice  of  AT^.  from 
the  R-T  curve  and  using  rj  =  -11  dB),  and  similarly  the 
predicted  mixer  conversion  efficiency  is  about  -9  dB  [15]. 
Thus  the  calculated  values  agree  well  with  experiment.  The 
IF  rolloff  frequency  was  measured  to  be  about  2  GHz, 
which  represents  superior  performence  for  a  low-noise 
bolometer  mixer.  This  performance  is  comparable  to  the 
more  mature  submillimeter  wave  SIS  receivers.  It  is  thus 
clear  from  these  results  that  this  device  works  well  as  a 
submillimeter  wave  heterodyne  mixer. 


5.  ELECTRON-PHONON  COOLED 
HOT-ELECTRON  BOLOMETER  MIXER 

Figure  3  shows  the  basic  geometry  of  this  bolometer 
which  was  proposed  by  E.  Gershenzon,  et  al  in  1990  [2]. 
The  basic  operation  of  this  bolometer  can  be  understood 
from  fig.  3(b).  An  incident  rf  photon  imparts  its  energy  to  an 
electron  in  the  film.  In  a  short  time  this  electron  shares  its 
energy  with  other  electrons.  The  cumulative  effect  of 
absorbed  rf  power  and  the  energy  sharing  process  (ie:  the 
enhanced  electron-electron  interaction)  in  these  ultra-thin 
films  is  to  create  a  hot-electron  distribution.  Then  in  a  time 
Tgp  a  hot  electron  creates  a  phonon  which  then  escapes 
ballistically,  for  a  sufficiently  thin  film,  to  the  substrate  in  a 
short  time  Trs.  In  order  for  the  hot-electron  bolometric 
mechanism  to  proceed  in  this  manner,  there  are  some 
constraints  which  must  be  met.  First  t^e  «  lo  allow  the 
electrons  to  heat  up  from  absorbed  power.  This  is  usually 
satisfied  for  a  thin  (dirty  limit)  film  at  low  temperatures 
T<10K  (Tee  ~  lO'^^sec  to  10'^^  sec)  [16].  Next  it  is 
important  for  Tes  «  Tpg,  where  Tpe  is  the  phonon-electron 
interaction  time  .  That  is,  the  phonons  must  escape  to  the 
substrate  before  they  interact  back  with  other  electrons.  This 
requires  that  the  film  be  very  thin  since[2,16,17]: 

TEs  =  4dCe/(vaCph)  (14) 

where  d  is  the  film  thickness,  and  Cph  are  the  electron  and 
phonon  specific  heats  respectively,  v  is  the  velocity  of 
sound,  and  a  is  the  coefficient  of  transmission  of  a  phonon 
through  the  film-substrate  interface.  The  thermal  response 
time  of  the  bolometer  can  then  be  written  as: 

tu.  = -tep  +  Tes  (15) 

Thus  the  limiting  speed  of  this  type  of  bolometer  is  set  by 
Tgp  ( this  limit  requires  Tes  «  Tgp  and  thus  the  films  should 
be  thin,  the  phonon  transmission  at  the  substrate  interface 
high,  and  the  thermal  conductivity  of  the  substrate  should 
also  be  high  so  that  it  remains  a  constant  temperature  bath). 
For  a  Nb  bolometer,  Tgp «  1  ns  and  hence  IP roihff-  160  MHz, 
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which  is  higher  than  most  bolometer  mixers  but  still  too  low 
for  many  practical  applications.  For  NbN  operated  at  7  K  - 
8  K,  Tep  «  15  ps  and  hence  IFroUoff-  10  GHz.  This  is  of 
course  high  enough  to  be  of  practical  value. 


Nb 


(b) 

Fig.  3:  Basic  geometry  of  electron -phonon  cooled  bolometer  mixer,  (a)  Top 
view  showing  geometry  of  microbridge,  (b)  Cross  sectional  view  showing 
schematically  the  electron-phonon  interactions  described  in  the  text. 


The  LO  power  requirements  can  be  estimated  in  the 
same  way  as  discussed  in  the  previous  section  except  G  is 
now  given  by  the  electron-phonon  thermal  conductance 
( G  =  4xlO^»T^»V  for  thin  Nb  films  [2],  for  example).  It 
should  be  noted  that  G  in  this  case  depends  on  the  volume  V 
of  the  microbridge  and  thus  it  is  advantageous  to  keep  the 
volume  small.  The  width  of  the  microbridge  should  be  less 
than  about  1  |im  to  avoid  a  backflow  of  phonons  from  the 
substrate  [2].  Then  for  a  given  film  surface  resistance  Rg  the 
length  is  chosen  to  give  the  appropriate  resistance  for  an  rf 
match  to  the  mixer  embedding  circuit  (waveguide,  planar 
antenna,  etc...).  A  typical  size  is  1  [im  x  5  jim.  The  LO 
power  then  required  for  a  Nb  bolometer  mixer  is  about  SO¬ 
TO  nW  (  T,  =  4  K  and  ATe  =  0.5  K  );  and  for  NbN,  Plo  « 
0.5-1  |xW  ( Tj,  =  7X  and  AT^.  =  0.5  K ).  These  are  extremely 
low  and  therefore  very  desirable  for  submillimeter 
operation. 


The  mixer  noise  temperature  and  conversion  efficiency 
are  calculated  in  the  same  way  as  previously  discussed. 
However,  the  more  detailed  analysis  in  [8]  suggests  that 
under  certain  conditions,  conversion  efficiency  greater  than 
unity  is  possible.  Also,  as  mentioned  above,  the  exact  choice 
of  DC  and  LO  power  to  optimize  the  mixer  is  still  being 
theoretically  analyzed. 

Recent  results  have  shown  that  the  electron-phonon 
cooled  bolometer  mixer  performs  very  well.  Measurements 
on  a  Nb  bolometer  mixer  [8]  at  an  LO  frequency  of  20  GHz 
gave  conversion  efficiencies  between  -1  dB  and  -7  dB,  an  IF 
rolloff  of  80-100  MHz,  and  an  LO  power  of  about  40  nW. 
These  results  are  in  close  agreement  with  expected  values. 
In  addition,  an  NbN  bolometer  mixer  operated  in  a 
waveguide  mount  at  100  GHz  [18]  has  demonstrated  an  IF 
rolloff  as  high  as  1.5-2  GHz,  and  a  DSB  receiver  noise 
temperature  of  450  K.  This  is  extremely  good  performance, 
and  is  suitable  for  practical  low-background  spectroscopy 
applications. 

Also,  high-Tc  yttrium-barium-copper  oxide  (YBCO) 
bolometers  have  been  investigated  [19-21]  at  wavelengths 
of  0.8  \im  (375  THz),  1.56  \xm  (192  THz),  and  10.6  \im 
(28  THz).  The  goal  of  these  investigations  was  to 
demonstrate  the  high  speed  hot-electron  response  and  the 
heterodyne  mixing  process  (no  attempt  was  made  to 
produce  a  fully  optimized  mixer  at  such  high  frequencies,  so 
the  estimated  conversion  efficiency  was  low).  Heterodyne 
mixing  was  clearly  demonstrated  in  thin  YBCO  films  with 
an  IF  up  to  18  GHz  (the  limit  of  the  measurement  system). 
Since  these  bolometers  operate  at  80-90  K,  the  phonons  play 
a  more  significant  role  which  changes  the  device  physics  as 
compared  to  the  low-Tc  case  (see  references  10,  18-20  for  a 
more  detailed  discussion). 


6.  DISCUSSION 

The  recent  innovations  in  transition-edge  hot-electron 
bolometers  (micron  and  submicron  sized,  thin,  dirty  films) 
have  led  to  ultra-fast,  sensitive  devices  which  are 
competitive  as  heterodyne  mixers.  Recent  measurements 
have  proven  the  concepts  and  even  shown  competitive 
performance.  However,  the  most  complete  mixer 
measurement  to  date  have  been  mainly  below  1  THz 
( except  for  the  YBCO  bolometer  mixer  tests  )  where  there 
already  exist  state-of-the-art  SIS  receivers  with  very  low 
noise.  These  new  bolometer  mixers  will  play  an  important 
role  at  frequencies  well  above  1  THz  where  SIS  and 
Schottky  receivers  become  either  extremely  difficult  or 
impossible  (ie:  above  the  energy  gap  frequency  of  NbN  for 
example)  to  operate.  In  addition,  the  high-Tc  bolometer  will 
be  useful  in  applications  where  sensitivity  can  be  traded  off 
against  cooling  requirements,  as  in  a  space-based  mission. 

Important  development  issues  must  be  addressed  for 
each  bolometer.  The  diffusion-cooled  bolometer  will  need 
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improved  submicron  definition  and  alignment  of  the  normal 
metal  contacts.  Also,  low-resistance  normal-to- 
superconductor  contacts  are  required.  For  the  electron- 
phonon  cooled  bolometer,  the  phonon  reflection  at  the  film 
substrate  interface  must  be  minimized.  This  will  be 
particularly  important  for  materials  like  NbN  (which  have  a 
high  upper  limit  to  the  IF  rolloff)  which  can  react  strongly 
with  common  rf  substrates  such  as  quartz,  thereby 
producing  a  poor  interface.  In  addition,  both  types  of 
bolometers  are  resistive  at  the  rf,  so  a  very  broadband 
match,  to  a  planar  antenna  for  example,  should  be  easily 
achieved.  The  rf  bandwidth  can  however  be  so  broad  that 
the  mixer  will  be  easily  saturated.  Unlike  the  common 
situation  for  an  electronic  mixer  with  reactive  parasitics 
where  the  goal  is  to  achieve  a  broadband  rf  match,  these 
bolometer  mixers  will  probably  require  bandpass  filters  at 
high  submillimeter  wave  frequencies. 

Finally,  since  the  likely  role  for  these  bolometer  mixers 
will  be  at  very  high  frequencies,  it  is  important  to  test  as 
soon  as  possible  the  prediction  that  the  performance  is 
independent  of  frequency.  Certain  mechanisms  may  affect 
the  high  frequency  performance.  A  high  energy  rf  photon 
(several  THz)  will  produce  a  high  energy  electron  in  the 
superconductive  film.  This  hot  electron  can  then  either  share 
its  energy  with  other  electrons,  or  break  Cooper  pairs,  or 
produce  hot  phonons.  In  any  case,  the  film  will  absorb  rf 
power,  but  if  the  energy  escapes,  for  example  by  the  hot 
phonons  rapidly  leaving  the  film,  before  it  is  shared  with  the 
electron  gas,  then  the  electrons  will  not  heat  as  efficiently 
and  the  sensitivity  of  the  mixer  will  decrease.  However, 
investigations  of  phonon  processes  in  thin  Nb  films  [22] 
suggest  this  will  not  be  the  case.  The  expectation  is  that  the 
performance  will  not  degrade  until  at  least  several  THz. 
These  detectors  should  thus  have  a  significant  impact  on  the 
field  of  THz  heterodyne  sensors. 
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Abstract 

Millimeter-wave  (MMW)  mixers  employing  Schottky 
diode  (SD)  devices  are  the  only  mixers  implemented,  to 
date,  for  spacebome  superheterodyne  radiometry 
applications.  Spacebome  missions  impose  high  technic^ 
standards  on  such  radiometers,  lending  special 
significance  to  MMW  mixers  employed  for  such 
applications.  This  paper  reviews  milestones  in  the 
developments  of  SDs  and  circuit  technologies  for 
spacebome  MMW  mixers.  The  paper  also  suggests 
possible  and  desired  future  developments  necessary  to 
maintain  and  extend  the  dominance  of  mixers  with  SDs 
even  to  sub-millimeter-wave  (SMMW)  frequencies. 

1*0  Introduction 

Spacebome  observations,  of  the  Earth*s  atmosphere  and 
its  surface,  can  be  performed  with  radiometers  employing 
superheterodyne  receivers.  Such  radiometers  require 
sensitivity  often  achievable,  as  shown  in  Figure  1,  with 
*'room-temperature”  mixers  employing  SDs  at  MMW  and 
SMMW  frequencies  [1]. 

Space-to-Earth  MMW  radiometry  missions  -  typically 
unmanned,  often  of  extended  duration  (several  years), 
and  always  costly  -  have  successfully  driven  the 
development  of  high  quality,  robust  SD  devices  and 
mixer  circuits.  Evidence  of  this  success  are  the  mixers  in 
dozens  of  different  MMW  radiometer  instmments[2], 
listed  in  Figure  2,  some  of  which  have  already  been 
launched  into  space  and  several  others,  even  at  higher 
SMMW  frequencies,  to  be  launched  in  the  future.  Figure 
2  also  illustrates  the  mixer  technology  as  a  ladder  with 
device  and  circuit  technologies  comprising  its  legs. 
Combinations  of  mixer  device  and  circuit  milestone 
accomplishments  yield  the  steps  of  this  ladder  -  climbing 
to  ever  higher  frequencies  with  improved  sensitivity, 
increased  instantaneous  bandwidth,  improved  reliability, 
and  reduced  complexity  and  costs  -  all  through  the 
MMW  and  the  SMMW  spectra. 

Up-to-date  developments  of  SD  devices  and  mixer 
circuits  are  reviewed  separately  in  the  following  two 


sections.  The  concluding  section  attempts  to  predict  the 
likely  and  desired  future  developments  in  devices  and 
circuits  for  MMW/SMMW  mixers. 

2,0  Schottky  Diode  -  Mixer  Device  Milestones 

The  recent  half  century  of  evolution  in  devices  for  MMW 
mixers,  have  led  to  improvements  in  the  intrinsic  non¬ 
linear  resistor  device  required  for  mixer  frequency 
convertors.  Simultaneously,  device  improvements  have 
optimized/minimized  its  packaging  elements,  which  are 
neither  a  part  of  the  intrinsic  non-linear  resistor  nor  an 
intended  part  of  the  mixer  circuit. 

Crystal,  i.e.  solid-state  semiconductor,  diodes  emerged  in 
the  40's  as  superior  substitutes  to  vacuum  tubes  for 
mixers.  In  the  late  50's,  GaAs  had  replaced  Si  and  Ge 
crystals  yielding  superior  high  frequency  characteristics 
[3],[4].  Early  mixer  crystal  devices  had  employed 
sharply  pointed  whisker  wires  to  form  a  point-contact 
diode,  at  the  whisker/crystal  contact,  and  to  couple  this 
diode  with  the  mixer  circuit.  After  establishing  the 
whisker-semiconductor  pressure  contact,  a  forward  bias 
to  the  diode  generated  sufficient  heat  to  weld  the  whisker 
and  the  semiconductor,  de  facto,  forming  a  metal- 
semiconductor  Schqtdcydiqd^unction  [5], 

The  progress  in  vacuum  metal  deposition  and 
photolithography  technologies,  in  the  60’s,  facilitated  the 
production  of  semiconductor  chips  populated  with  small 
(--2pm)  discs  of  metal  deposits  -  a  group  of  SD  junction 
anodes  in  a  ”honevcomb”  layout,  sharing  one  large  ohmic 
contact  cathode  [6].  Each  such  preformed  SD  yields  a 
mixer  device  with  quality,  predictability,  and 
repeatability  consistent  with  space  application  standards. 
However,  the  SD  still  requires,  for  coupling  to  the  mixer 
circuit,  a  mechanically  problematic  and  costly  whisker 
pressure  contact.  For  many  years,  SD  “honeycomb" 
chips  maintained  only  a  180°  (lateral)  anode/cathode 
orientation,  compatible  with  machined  coax/waveguide 
MMW  mixers. 

In  the  70’s,  device  manufacturers  provided  SDs  whisker 
precontacted  in  “picopills”  to  simplify  the  assembly  of  up 
to  100  GHz  machined  mixers. 
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Also  in  the  70’s,  device  technology  matched  new  MMW 
mixer  printed-circuits  with  SDs  compatible  with  planar 
circuits:  whisker  contacted  “honeycombs”  were  modified 
to  include  MMW  notch-front  SDs  [7]  with  90° 
(orthogonal)  anode/cathode  orientation,  which  improved 
the  reliability  of  SD-circuit  whisker  contact  on  planar 
dielectric  substrates;  and,  planar  beam-lead  SDs  [8] 
altogether  eliminated  the  need  for  whisker  contacts  for 
printed  circuit  mixers  below  150  GHz. 

In  the  80*s,  beam-lead  SDs  evolved  [9]  into  planar  air¬ 
bridge  finger,  whiskerless  SDs,  successfully  challenging 
the  dominance  of  whisker  contacted  MMW  mixers  even 
above  150  Ghz  [10]. 

In  the  90*s,  planar  whiskerless  SDs  with  sub-micron 
diameters  promise  to  further  reduce  the  role  of  whisker 
contacted  SDs,  even  for  SMMW  mixers  [11]. 

3.0  Mixer  Configuration  -  Mixer  Circuit  Milestones 

Mixer  circuits  embody  the  linear  circuit  residency  for 
non-linear  mixer  devices.  A  mixer  circuit  provides 
filtering/matching  interfaces  for  widely  separated 
frequencies:  at  the  highest  frequencies,  the  mixer  circuit 
interfaces  between  the  mixer  device  and  external  RF  and 
local-oscillator  (LO)  input  sources;  while  at  much  lower 
frequencies,  a  mixer  circuit  interfaces  between  the  mixer 
device  and  an  external  output  IF  load.  Such  circuit 
functions  have  governed  developments  in  mixer  circuit 
technology  for  more  than  fifty  years. 

One  diode,  single-ended  mixers,  with  fundamental 
frequency  pumping  LOs,  have  traditionally  employed 
high-Q  directional  cavities  to  channel  the  RF  and  the  LO 
to  the  diode,  and  to  suppress  the  LO  noise.  Balanced 
mixers  employing  two  identical  diodes,  were  introduced 
in  the  mid-40's  [12],  requiring  balanced/unbalanced 
(balun)  circuits  for  feeding  the  RF  and  the  LO  differently. 
The  balun  enables  RF/LO  isolation  by  virtue  of 
orthogonality,  and  also  the  suppression  of  LO  amplitude- 
modulated  (AM)  noise  at  frequencies  of  the  RF. 
Fundamentally  pumped,  up  to  190  GHz  [13],  balanced 
mixers  were  implemented  with  whisker  contacted  diodes. 
The  same  paper  [12]  also  describes  all  sixteen  possible 
mixers  with  two  diodes,  some  of  which  feature  high 
sensitivity  [14]  and  efficient  subharmonic  pumping. 
Such  MMW  mixers  [15]  have  bridged  the  gap  created  by 
a  lagging  MMW  LO  technology,  and  an  inefficient  and 
costly  MMW  frequency  multipliers  technology.  For 
example,  subharmonic  x2  MMW  mixers,  requiring  LOs 
at  frequencies  half  that  of  the  RF,  were  implemented  in 
the  80’s  for  MMW  spacebome  applications  [16].  They 
achieved  wider  instantaneous  bandwidth  relative  to 
harmonic  single-ended  mixers  [17],  while  requiring 
similar  low  frequency  LOs  as  the  latter. 


Diverse  MMW  mixer  circuit  configurations  have  been 
implemented  in  a  variety  of  transmission-line  media.  The 
first  single-ended  MMW  mixers  consisted  of  machined 
coaxial  and  waveguide  filters.  Early  balanced  mixers 
included  a  four-port  waveguide  magic-T  with  E  and  H 
ports  as  RF  and  LO  inputs.  The  two  remaining  ports 
were  terminated  with  coax/waveguide  single  ended 
mixers  with  their  IF  outputs  feeding  a  coaxial  power 
combiner. 

Printed  circuit  mixers  proliferated  in  the  60's  and  the 
70's,  yielding  attractive  planar  compact  balanced  mixers: 
stripline  and  microstrip  versions  of  3-dB  couplers  and 
rate-race  circuits  replaced  the  waveguide  magic-Ts; 
ultimately,  the  junction  between  a  slotline  and  coplanar 
line  yielded  a  wideband  planar  magic-T  [18],  variations 
of  which  are  commonly  used  in  monolithic  mixers;  and 
also  inspired  a  particularly  successful  version  of  a 
machined  strip  guide  MMW  balanced  mixers  [19]  with 
“picopiir  SDs.  Planar  versions  of  MMW  mixers  were 
transformed  into  MMW  monolithic  integrated  circuit 
(MMIC)  mixers  [20]  eliminating,  altogether,  the  circuit 
device  mixer  interconnects,  and  poising  for  penetration 
into  SMMW  frequencies. 

4.0  Predictions  and  Conclusions 

Mixer  technology  will  continue  to  penetrate  new  and 
higher  frequencies,  and  to  improve  mixer  performance. 
However,  GaAs  device  high  frequency  characteristics  and 
circuit  miniaturization  capabilities  are  approaching  their 
limits  at  SMMW  frequencies.  At  the  same  time 
oscillator  technology  continues  to  lag  in  meeting 
MMW/SMMW  LO  requirements.  Hence,  it  is  critical  for 
future  progress  to  correctly  identify,  in  advance,  agents 
and  patterns  which  will  most  likely  succeed  in  carrying- 
out  future  changes  in  MMW/SMMW  mixer  technology. 
This  review,  which  has  thus  far  outlined  portraits  for 
agents  and  patterns  of  past  changes  in  mixer  technology, 
can  also  serve  as  an  effective  reference  for  such 
predictions. 

Semiconductor  submicron  processes  and  materials  with 
faster  characteristics,  e.g.  InP,  fiirther  developments  of 
MMW/SMMW  subharmonic  mixers,  and  advances  in 
MMIC  technology  are  agents  also  likely  to  remain  active 
in  shaping  future  mixer  technology. 

MMK^s  and  whiskerless  SDs  are  technologies  which  can 
simplify  the  mixer  assembly  process,  thereby,  rendering 
practical  MMW/SMMW  mixer  configurations  otherwise 
considered  too  complex  for  implementation. 

Whiskerless  planar  SDs  have  already  rendered  practical 
MMW  subharmonic  x2  mixers  with  two  SDs  in  anti¬ 
parallel,  and  are  considered  for  600  GHz  space  missions 
[21]  to  operate  with  a  frequency  doubler  and  available 
150  GHz  LO.  Subharmonic  x4  600  GHz  mixers  [22] 
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eliminate  the  requirement  for  frequency  doublers.  Since 
mixer  performance  degrades  with  higher  order  LO 
subharmonic.  Subharmonic  x3  [23],  with  two  SDs 
connected  in  series,  promises  new  and  superior  mixers  up 
to  ~1  THz,  requiring  only  a  single  doubler  and  available 
LOs. 

MMICs  have  eliminated  bond  wire  interconnects  between 
various  mixer  elements,  enhancing  reliability  and 
facilitating  improved  sensitivity  and  bandwidth. 
However,  conventional  MMW/SMMW  MMICs  yield 

relatively  high  effective  8r  lossy  circuit  media  and 
support  undesired  modes.  New  variations  of  MMICs 
integrated  with  thin  (~l-5p.m)  membranes  [24], 
particularly  in  suspended  configurations,  yield  lower 

effective  8r  and  suppress  undesired  modes  through 
higher  and  wider  frequency  bands.  The  continuing  trend 
depicted  in  Figure  2  -  toward  MMW/SMMW  multi¬ 
channel,  wide  bandwidth,  spacebome  radiometers  -  is 
driven  by  cost,  packaging,  and  performance 
considerations  favoring  advanced  MMIC  technology. 
This  technology  promises  to  render  practical  complex 
mixer  configurations,  like  image  rejection  subharmonic 
mixers  [25]  and  two  SD  subharmonic  comer-cube  mixers 
[26].  Such  complex  and  advanced  mixer  configurations 
are  important  for  meeting  the  future  challenges  awaiting 
MMW/SMMW  mixer  technology. 
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Figure  1:  Possible  SSB  noise-temperature  and  achievable  sensitivity,  for 
practical  MMW  radiometers  and  mixers  with  Schottky  diodes. 

(Tif  =  lOOK;  Tm  =  Mixer  Nouse-Temperature;  Lm  =  Mixer  Conversion-Loss; 
B  =  Predetection  Bandwidth;  t  =  Integration  Time) 
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Figure  2:  Radiometers  launched  and  to  be  launched  into  orbit,  and 
their  implemented  MMW  mixer  technology. 
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Abstract 

Electronic  circuits  and  systems  which  can  operate 
at  temperatures  up  to  250°C  or  even  higher 
become  more  and  more  important.  It  can  be 
shown  that  Si  as  semiconductor  material  can  be 
used  up  to  this  temperature  provided  that  circuit 
structures  are  optimized  to  handle  the  parasitic 
effects  rising  with  temperature.  For  even  higher 
temperatures  up  to  350°C  GaAs  can  be  used. 

Other  problems  going  along  with  high 
temperature  applications  arise  from  material 
characteristics  as  melting  point,  mechanical 
stability,  insulating  characteristics  and  migration 
problems. 

This  paper  reports  on  a  German  Joint  Research 
project  dealing  with  the  problems  mentioned 
above.  Some  of  them  could  be  solved  satisfactory, 
some  problems  are  still  existing. 

1.  Introduction 

Some  years  ago  High  Temperature  Resistant 
Electronic  Circuits  were  a  problem  of  some 
scientists  and  engineers  for  special  sophisticated 
applications.  But  during  the  last  years  the 
situation  has  completely  changed.  The  area  of 
applications  for  high  temperature  electronic 
circuits  and  systems  has  dramatically  increased. 
For  example  there  is  a  great  demand  on  high 
temperature  electronics  in  the  fields  of 
automotive-,  air-  and  spacebome  applications  and 
especially  in  powerplants  and  harsh  environment 
applications.  In  all  these  fields  the  main 
requirements  are  -  additional  to  the  Jiigh 


temperature  conditions  -  that  the  components  and 
the  circuits  have  to  be  both  reliable  and  cheap. 


To  get  the  optimal  solution  for  all  fields  of 
application  it  is  necessary  to  know  their  different 
temperature  ranges.  Fig.  1  gives  an  overview  of 
these  applications  as  a  function  of  temperature.  In 
the  same  diagram  we  plot  the  intrinsic  carrier 
concentration  for  different  semiconductor 
materials  versus  temperature,  which  is  a  measure 
for  leakage  currents  and  other  parasitic  effects  to 
be  compensated  by  special  circuit  structures. 
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Figure  1:  Temperature  ranges  for  industrial 
applications. 

If  we  consider  that  we  can  build  circuits  with  a 
reliable  operation  up  to  an  intrinsic  carrier 

•I  c  "1 

concentration  of  10  /cm  ,  we  see  that  it  should 
be  possible  to  use  silicon  (bulk  and  SOI)  as 
semiconductor  material  up  to  250°C  and  GaAs  up 
to  350°C.  This  means  that  it  should  be  possible  to 
cover  the  main  application  areas  of  car 
electronics,  powerplant  electronics,  air-  and 
space  applications  and  the  main  part  of 
petrochemistry. 
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2.  The  German  Joint  Research  Project  for 
High  Temperature  Electronics 

In  summer  1992  a  German  Joint  Research  Project 
for  High  Temperature  Electronics  was  established 
using  only  silicon  and  gallium  arsenide  as 
semiconductor  materials.  Silicon  carbide  and 
diamond  were  not  taken  into  further  consideration 
for  until  now  they  do  not  fulfill  the  two  limiting 
conditions  mentioned  above:  to  be  reliable  and 
cheap. 

In  this  project  10  partners  (4  companies,  2 
Fraunhofer  Institutes,  3  University  Institutes  and 
1  Goverment  Research  Institute)  came  together  to 
cooperate  xmder  the  title  „Microsystem 
Components  for  High  Temperature  Appli- 
cations“.  The  total  financial  volume  is  24  Million 
DM,  sponsored  by  the  German  Ministry  of 
Research  and  Technology.  So  this  project  gives  a 
large  variety  of  results  some  of  which  will  be 
presented  in  this  conference. 


Fig.  3  shows  in  comparison  the  transfer  functions 
of  a  Bulk  and  a  SOI  N-MOSFET  at  a  temperature 
of  300°C.  It  can  be  seen  that  the  SIMOX-type  is 
still  in  its  active  area  while  the  bulk-type  is  out  of 
control.  Some  other  papers  of  this  conference  will 
report  on  operational  amplifiers  which  were  built 
in  Bulk  and  SIMOX  technologies. 


Figure  3:  Comparison  of  Bulk-  and  SIMOX- 
Transistors. 


3.  Some  results  achieved  until  now 

I  will  now  report  on  some  of  the  results  which 
were  achieved  in  this  joint  project.  For  silicon 
investigations  were  made  in  Bulk  technology  as 
well  as  in  SOI  technology  (SIMOX  and  BESOI). 
The  principles  of  these  technologies  are  shown  in 
Fig.  2.  As  a  result  of  the  reduced  leakage  areas  by 
application  of  SOI  technologies  a  reasonable 
improvement  of  the  high  temperature  behavior 
could  be  achieved. 


Bulk  Silicon  On  insulator  (SOI) 

(up  to  250  “C)  (up  to  300  X) 


SIMOX  BESOI 

Separation  by  Implanted  Oxygen  Back  Etched  Silicon  On  Insulator 


implanted 


silicon  substrat  | 


Figure  2:  Silicon  Technologies 


For  even  higher  temperature  applications  GaAs 
should  be  used  because  of  its  wider  band  gap. 
Fig.  4  shows  the  DC  output  characteristic  of  a 
GaAs-MES-FET  in  the  temperature  range  of 
-196°C  up  to  +500°C.  It  can  be  seen  that  the 
transistor  can  be  used  over  this  large  range 
although  there  is  a  reasonable  reduction  in  gain. 
As  the  main  field  of  applications  for  GaAs 
components  is  in  the  high  frequency  domain 
Fig.  5  shows  the  S -parameters  at  different 
temperatures.  The  first  demonstrator  will  be  a 
FMCW-Radar  system  for  high  temperature 
application.  It  can  be  used  for  high  temperature 
level  measurement  in  chemical  process  control. 

Regarding  the  complete  problem  of  high 
temperature  electronics  one  easily  finds  out  that 
the  semiconductor  problem  is  only  one  of  several 
others  of  equal  importance.  Fig.  6  shows  that  for 
example  many  materials  used  for  passive 
components  and  devices  as  well  as  for 
metallisation  and  mounting  and  bonding 
technology  in  the  normal  temperature  range 
cannot  any  longer  be  applied:  Plastics  as 
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dielectric  and  insulating  material  melt.  Normal 
solders  melt,  glas  becomes  conductive,  flexible 
boards  are  distroyed  and  most  of  the  materials 
corrode.  Also  material  migration  cannot  be 
neglected. 
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0  - ► 


Figure  6:  Characteristic  temperature  ranges  for  some 
materials. 

This  means  that  for  example  the  usual  AlSiCu  for 
chip  metallisation  has  to  be  replaced  for  example 
by  W  or  Ti  and  the  current  density  has  to  be 
reduced.  All  this  requires  more  space  on  chip  and 
so  reduces  package  density.  For  interconnection 
of  different  devices  thick  film  and  thin  film 
technologies  have  been  proved  to  be  reliable. 
Film  resistors  have  a  good  temperature  behavior 
and  a  good  tracking,  especially  when  being 
overglazed.  Fig.  7  shows  the  long  term  behavior 
of  a  family  of  thick  film  resistors  at  a  temperature 
of300°C. 


Figure  7:  Irreversible  resistance  drift  at  300°C  up  to 
1,000  hours;  survey  of  the  HS80  thick  film  resistor 
family. 


Wire  bonding  becomes  a  migration  problem  when 
different  materials  have  to  be  connected.  Fig.  8 
shows  the  change  in  contact  resistance  at  different 
high  temperatures  over  a  period  of  thousand 
hours. 


1o'  10^  f/h  10® 


Figure  8:  Irreversible  contact  resistance  drift  up  to 
1,000  hours  for  different  temperatures;  AlSil  bond  on 
DuPont's  9910  conductor. 

Migration  can  be  reduced  by  applying  sandwich 
methods.  Nevertheless  pulltests  and  sheartests 
show  that  the  stability  of  the  connection  is 
reduced  to  more  than  1/2  by  temperature 
treatment  up  to  300°C.  Fig.  9  gives  an  example. 
All  these  effects  can  only  be  investigated  in  long 
term  testloops  of  several  thousand  hours  duration. 
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Figure  9:  Pull  strength  drift  after  300°C  aging  of  32  pm 
AlSil-wires  on  Au-thick-  and  thin  film  metallisation. 

This  list  of  problems  going  ahead  with  high 
temperature  applications  cannot  be  closed  without 
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mentioning  the  question  of  cheap  and  reliable 
housing  and  packaging  of  high  temperature 
devices.  The  well  known  Covar  and  Cofired 
Ceramic  Packages  are  reliable  but  so  expensive 
that  they  could  never  be  used  in  automotive  or 
similar  applications.  But  high  temperature 
resistant  cheap  plastic  packages  are  not  available 
until  now. 

In  conclusion  we  see  that  only  the  combination  of 
high  temperature  components  (active  and  passive 
ones)  as  well  as  a  high  temperature  reliable 
mounting  and  bonding  technology  and  last  but  not 
least  high  temperature  circuit  design  can  offer  the 
possibility  of  building  a  high  temperature 
electronic  system  (Fig.  10). 


Figure  10:  Activities  for  the  development  of  high- 
temperature  sensor-systems. 

Discussing  the  question  which  circuits  for  high 
temperature  application  are  needed  let  us  look  to 
a  conventional  control  loop  in  the  high 
temperature  environment  (Fig.  11). 

We  see  that  there  are  two  ways  to  correspond 
with  the  high  temperature  range  of  the  loop.  The 
one  is  the  sensor  way  to  get  information  from  the 
high  temperature  loop  and  the  other  one  is  the 
actor  way  to  send  control  signals  that  means  to 
send  information  to  the  high  temperature  area.  If 
you  want  to  do  so  with  a  minimum  of  information 
loss  in  both  directions  you  have  to  use  intelligent 
sensors  (smart  sensors)  and  intelligent  actors 
(smart  power  actors).  This  means  that  in  both 
ways  you  have  to  have  high  temperature  resistant 
electronics. 


Figure  11:  Process  Control  System. 


Coming  back  to  the  German  Research  Project  we 
here  limited  to  the  left  area  of  Fig.  11,  to  the 
sensor  area.  Fig.  12  shows  the  smart  sensors  and 
the  necessary  sensor  electronics  which  we  have 
chosen  as  demonstrators. 

4.  Conclusions 

The  design  of  optimal  circuits  provides  a 
coordinated  research  in  three  areas: 

•  Material  and  component  selection  including 
process  optimization  for  the  production  of 
active  and  passive  components, 

•  circuit  structure  optimization  to  tolerate  the 
high  temperature  depending  parasitic  effects 
and,  last  but  not  least, 

•  moimting  and  packaging  methods  for  reliable 
and  cheap  high  temperature  application. 

These  three  areas  of  research  and  development 
are  strongly  influencing  each  other. 

It  could  be  shown,  that  for  temperatures  up  to 
250°C  Silicon  in  Bulk  and  SOI  technologies  can 
be  used  as  a  semiconductor  material  and  circuit 
design  enables  the  production  of  most  of  the 
system  components  necessary  to  build  high 
temperature  resistant  smart  sensor  systems.  A 
similar  situation  is  given  for  the  use  of  GaAs  up 
to  temperatures  of  350°C  with  the  limitation  that 
GaAs  semiconductor  technologies  have  some 
lack  of  experience  compared  with  Silicon 
technologies. 
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-  temperature  sensors  -  electrometer  amplifier  -  analog-to-digitai 

converters 


-  mass  flow  sensors  -  differential  amplifier 

operational  amplifier  -  buffers  and 

-  level  sensors  drivers 

-  current-voltage  converter 

-  pressure  transducer 

and  acceleration  -  charge-sensitive  amp. 

pick-up 
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Figure  12:  High-temperature  sensor-system 
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ABSTRACT 

We  describe  new  heterodimensional  technology 
suitable  for  ultra  low  power  applications.  This 
technology  uses  Schottky  barrier  contacts  between 
three-dimensional  metal  and  two-dimensional  electron 
gas.  The  low  power  performance  is  due  to  a  small 
capacitance  of  the  2D-3D  junction,  the  concentration  of 
the  depletion  layer  electric  field  streamlines  in  the 
active  channel,  suppression  of  parasitic  resistance, 
small  leakage  current,  and,  most  of  all,  due  to  the  total 
elimination  of  the  narrow  channel  effect  which  allows 
us  to  scale  the  device  width  to  submicron  dimensions. 
We  present,  compare,  and  discuss  measured  and 
simulated  I-V  and  C-V  characteristics  for  the  2D-3D 
Schottky  diode,  2D  MESFET  and  Schottky  Gated  2D- 
3DRTT. 

1.  INTRODUCTION 

A  major  emphasis  in  IC  technology  over  the  last 
decade  has  been  to  increase  the  microprocessor  speed 
and  reduce  memory  access  times.  While  speed 
performance  continues  to  be  an  important  metric  for 
advances  in  ICs,  increasing  importance  is  given  to 
overall  energy  efficiency,  particularly  for  battery- 
powered  applications  such  as  wireless 
communications.  An  important  figure-of-merit  for  low 
power  electronics  is  the  power-delay  product.  The 
power-delay  product  of  a  FET  is  determined  by  the 
energy  (C+CpAV^  stored  by  the  gate  capacitor.  Here, 
C  is  the  gate-channel  capacitance,  Cp  is  the  parasitic 
capacitance  and  AV  is  the  gate  voltage  swing  required 
to  switch  between  on  and  off  states  (AV  is  proportional 
to  the  supply  voltage  V^.  Future  low  power  transistor 
technologies  therefore  must  achieve  the  lowest 
possible  reduction  of  both  the  switching  capacitance 
and  the  supply  voltage.  Direct  Coupled  FET  Logic 
(DCFL)  and  Complementary  Heterostructure  FET  (C- 
HFET)  technology  using  GaAs-based  FETs  are  among 
the  most  promising  alternatives  to  conventional  CMOS 
technology,  due  primarily  to  higher  carrier  mobilities  in 


Schottky 


Fig.  1.  Schematic  structure  of  the  heterodimensional 
Schottky  diode. 

GaAs  and  InGaAs  where  high  carrier  velocities  can  be 
achieved  at  lower  voltages.  While  significant 
performance  advances  are  being  made  by  utilizing 
faster  materials,  further  improvement  may  be  achieved 
by  optimizing  the  device  geometry  to  simultaneously 
reduce  power  consumption  and  increase  speed.  In  this 
paper,  we  review  heterodimensional  Schottky  devices 
in  which  the  Schottky  contact  is  between  the  3-d  metal 
and  the  2-d  electron  gas.  The  3-d/2-d  Schottky  diode 
as  well  as  two  new  field  effect  transistors  will  be 
discussed  and  recent  results  reported  on  each.  These 
novel  high  speed  devices  offer  unique  and  new 
approaches  to  the  design  of  advanced  semiconductor 
devices  and  circuits. 

2.  HETERODIMENSIONAL  SCHOTTKY  DIODE 

A  schematic  of  the  heterodimensional  3-d/2-d 
Schottky  diode  and  it’s  non-linear  capacitance-voltage 
characteristic  are  shown  in  Fig.  1.  The  Schottky 
contact  is  formed  by  plating  metal  into  a  trench  etched 
in  a  heterostructure  (e.g.  Al_25Ga_75As/In,2(pa,8oAs/ 
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Fig.  2.  Capacitance-voltage  characteristic  of  the 
100  |Xm  wide  heterodimensional  Schottky  diode. 

GaAs)  [1-3].  The  Schottky  barrier  results  in  a  depletion 
layer  within  the  2-dimensional  quantum  well.  An 
important  consequence  of  the  direct  contact  is  the  2- 
dimensional  (vertical)  spreading  of  the  electric  field  in 
the  depletion  region.  This  leads  to  a  stronger  voltage 
dependence  on  the  depletion  depth  and  to  a  larger 
breakdown  voltage  [4].  Also,  due  to  a  small  effective 
junction  area,  the  capacitance  is  very  low,  0.1  -  0.3  fF/ 
micron  width,  as  shown  in  Fig.  2  (for  comparison,  a 
state-of-the-art  0.2  micron  length  heterostructure  field 
effect  transistor  has  a  capacitance  of  about  1.0  fF/ 
micron).  Together  with  low  series  resistance  (0.8  QJ 
mm)  and  large  breakdown  voltage  (up  to  20  V),  the 
planar  3-d/2-d  Schottky  diode  should  be  ideal  for 
frequency  multiplier  applications  in  the  millimeter/ 
submilUmeter  wavelength  regions.  Tripler  efficiencies 
greater  than  1%  at  225  GHz  have  been  obtained  and 
much  higher  efficiencies  should  be  possible  with  better 
diode-to-circuit  impedance  matching  [5]. 

3.  SCHOTTKY-GATED  RESONANT 
TUNNELING  TRANSISTOR 

A  resonant  tunneling  transistor  utilizing  the 
heterodimensional  Schottky  gate  contacts  is  illustrated 
in  Fig.  3.  The  room  temperature  transistor  character¬ 
istics  are  shown  in  Fig.  4.  The  Schottky  gate  metal  was 
electroplated  onto  the  side  of  a  AlGaAs/GaAs  double 
barrier  heterostructure.  The  heterodimensional  gate 
modulates  the  drain  current  by  modulating  the  area  of 
the  quasi-2-dimensional  accumulation  layer  which 
forms  in  the  source-drain  resonant  tunneling  diode 
under  bias.  This  device  exhibited  record  room 
temperature  transconductance  of  218  mS/mm  with  a 
peak  current  of  225  mA/mm  [6].  The  cutoff  frequency 
was  estimated  to  be  18.5  GHz. 


Fig.  3.  Schematic  of  Schottky- gated  Resonant 
Tunneling  Transistor  ( SG-RTT). 


Drain-source  Voltage  (V) 


Fig.  4.  Drain  current  versus  drain-source  voltagi 
characteristics  of  SG-RTT  (300K). 

4. 2-D  MESFET  FOR  LOW  POWER 
ELECTRONICS 

Low  power  digital  ICs  use  device  widths  as  small 
as  3-5  micron  but  rarely  are  smaller  widths  used.  This 
is  due  to  parasitic  effects,  known  as  the  narrow  channel 
effect  (NCE),  related  to  the  poor  control  of  charges  at 
the  edges  of  the  channel  [7].  These  parasitics  cause 
poor  subthreshold  ideality  factor  (which  dictates  larger 
gate  voltage  swing  and  therefore  higher  power 
consumption)  and  parasitic  capacitances  which  limit 
switching  speed.  To  address  these  problems,  we  ate 
developing  a  new  transistor  technology,  based  on  the 
heterodimensional  Schottky  sidegate  technology, 
which  permits  width  scaling  to  sub-micron  dimensions 
without  degradation  due  to  the  NCE.  The  2-D  MEtal- 
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Fig.  6.  Drain  current  versus  drain-source  voltage 
characteristics  of  the  1.0  micron  wide  device. 


Fig.  7.  Drain  current  versus  drain-source  volta, 
characteristics  of  0.4  micron  wide  device. 
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Fig.  8.  Drain  current  versus  gate-source  voltage 
characteristics  of  0.4  micron  wide  device. 


Semiconductor  Field  Effect  Transistor  (2-D  MESFET) 
[2,8,9]  is  shown  schematically  in  Fig.  5.  The  gate 
electric  field  streamlines  are  focussed  onto  the  2-DEG 
and  pinch-off  occurs  when  the  two  depletion  regions 
merge  together.  Note  that  the  gate  metal  physically 
eliminates  the  side  channels,  thereby  eliminating  the 
origin  of  the  narrow  channel  effect.  The  two  gates  may 
be  independently  biased  (for  a  "fan-in"  of  two)  for 
greater  functionality  in  digital  applications. 

Both  enhancement  (E)  and  depletion  (D)-mode 
devices  have  been  fabricated.  The  measured  room 
temperature  Ij  -V(js  characteristics  of  a  1  micron  wide 
D-mode  2-D  MESFET  are  shown  in  Fig.  6.  The 
threshold  voltage  was  -1.0  V,  the  peak  drain  current  and 
transconductance  values  were  370  |J.A/|i,m  and  225  |a.S/ 
|J.m,  respectively  [9].  The  and  ^8  -  V  char¬ 

acteristics  of  a  0.5  micron  wide  E-mode  2-D  MESFET 


are  shown  in  Fig.  7  and  8,  respectively.  The  threshold 
voltage  was  0.3  V,  the  subthreshold  ideality  factor  was 
1.1,  the  output  conductance  was  less  than  1  mS/mm, 
the  ON/OFF  current  ratio  was  greater  than  10^  and 
there  is  virtually  no  Vj  shift  for  0  <  <  1  V  [10]. 

The  DCFL  2-D  MESFET  transfer  characteristics 
(using  E-mode  devices)  are  shown  in  Fig.  9  where  a 
noise  margin  of  0.26  V  was  obtained.  Using  accurate  2- 
D  MESFET  I  -  V  and  C  -  V  models  implemented  in 
AIM-SPICE  [II],  we  simulated  a  three-stage  ring 
oscillator  and  obtained  per  gate  power-delay  product  of 
less  than  0.1  fJ,  shown  in  Fig.  10,  which  is  about  two 
orders  of  magnitude  lower  than  that  for  competing 
technologies.  The  dramatic  reduction  in  power-delay 
product  is  due  to  the  fundamentally  superior  electrical 
properties  of  the  heterodimensional  Schottky 
technology  in  sub-micron  dimension  devices. 
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Fig.  9.  Measured  2-D  MESFET  DCFL  inverter 
transfer  characteristics  at  0.8  V  supply  voltage. 
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Fig.  10.  Simulated  2-D  MESFET  ring  oscillator 
has  much  lower  power-delay  product  compared 
with  existing  transistor  technologies. 

5.  SUMMARY 

We  have  described  the  operation  of  several  new 
devices  based  on  the  novel  heterodimensional  metal/2- 
d  electron  gas  junction.  The  Schottky  diode  has  low 
junction  capacitance  and  high  breakdown  voltage,  due 
to  2-dimensional  spreading  of  the  electric  field  in  the 
depletion  region.  The  Schottky-gated  Resonant 
Tunneling  Transistor  uses  the  heterodimensional  gate 
technology  and  achieved  a  record  room  temperature 
transconductance.  The  novel  2-D  MESFET  also  uses 
the  novel  gate  technology,  to  eliminate  the  narrow 
channel  effect  and  drain-inducted  barrier  lowering  in 
devices  scaled  to  sub-half  micron  channel  widths.  The 
2-D  MESFET  was  used  to  demonstrate  a  DCFL 
inverter  gate  having  a  noise  margin  of  0.26  V  at  0.8  V 
supply  voltage  and  voltage  gain  of  10.  Finally,  a  power- 
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delay  product  of  less  than  0.1  fJ  was  predicted  for  a  3- 
stage  2-D  MESFET  DCFL  ring  oscillator. 
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I.  Introduction 

For  years,  circuit  simulators  have  used 
idealistic  models  to  describe  the  device 
characteristics  ignoring  the  non-ideal  behaviors  such 
as  the  contribution  of  "warm”  and  "hot"  electrons  to 
the  gate  currents  in  Heterostructure  Field  Effect 
Transistors  (HFETs)[l].  These  non-ideal  behaviors 
may  affect  the  gate-swithching  characteristics 
introducing  faults  in  high  performance  circuits. 
The  purpose  of  this  paper  is  to  present  experimental 
gate-current  data,  over  a  wide  temperature  range, 
and  to  introduce  for  the  first  time  an  analytical 
model  that  describes  the  contribution  of  heated 
electrons  to  the  increase  in  gate  current  with  drain- 
to-source  voltage.  The  experimental  data  have  been 
obtained  at  temperatures  ranging  from  198  K  to 
450  K  from  Heterostructure  Insulated  Gate  Field 
Effect  Transistors  (HIGFETs)  fabricated  using 
Alo.TsGao  25As/Ino.2Gao.8As/GaAs  and 
Alo.48li^o.52As/Ino.53Ga()47As/InP.  This  regime  of 
operation  should  be  considered  in  the  design  of 
Heterostructure  FETs  intended  for  low  power,  high 
density,  applications  as  well  as  in  devices  operated 
in  the  Charge  Injection  Transistor  (CHINT-like) 
mode. 

n.  Experimental  Observations 

Figure  1  shows  the  experimental  gate 
current  characteristics  as  a  function  of  the  drain-to- 
source  voltage  for  a  1.4  x  10  pm^  HFET  at  room 
temperature.  Three  regions  of  importance  can  be 
identified  when  the  gate  current  is  observed  as  a 
function  of  drain-to-source  voltage.  The  first 
region  corresponds  to  low  where  the  electric 
field  parallel  to  the  heterointerface,  F^.,  is  linearly 
dependent  on  position,  and  the  electron  temperature, 
is  nearly  uniform  everywhere  in  the  channel. 


drain-to-source  voltage  (V) 


Figure  1  Gate  current  characteristics  as  a 
function  of  drain-to-source  voltage  for  a  1.4  x  10 
pm^. 


remaining  close  to  the  lattice  temperature.  In  this 
case,  the  gate  current  is  a  contribution  of  the  gate- 
to-source  and  the  gate-to-drain  current  components, 
with  the  gate-to-source  current  component  nearly 
independent  of  the  parallel  electric  field  in  the 
channel. 

The  experimental  data  suggest  that  at  low 
drain-to-source  voltages,  the  electron  contribution, 
from  the  drain  to  the  gate  current,  decreases  with  a 
reverse  in  the  drain-to-gate  potential.  A  further 
increase  in  drain  voltage  increases  the  electron 
energy  distribution  causing  the  2-D  electrons  to 
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accelerate  in  the  channel,  increasing  their  kinetic 
energy  and  therefore,  contributing  to  a  sharp 
increase  in  the  gate  current.  Finally,  the  gate 
current  saturates  at  high  drain-to-source  voltages. 
These  abrupt  changes  in  the  gate  current  also  affect 
the  drain-current  characteristics  of  the  device  and 
can  not  be  reproduced  with  current  device  models 
[2,3,4].  Similar  qualitative  behavior  is  oKserved  in 
both  GaAs-  and  InP-ba.sed  HFETs  over  a  wide 
temperature  range. 

Figure  2  shows  a  qualitative  description  of 
the  electric  field,  electron  temperature,  and  barrier 
potential  for  each  of  the  three  regions  described 
above.  At  very  high  drain-to-source  voltages,  the 
barrier  increases  near  the  drain,  shifting  the 
scattering  distribution  toward  the  middle  region  of 


Figure  2.  Qualitative  description  of  the  electron 
temperature,  electric  field,  and  the  potential 
barrier  under  the  gate  for  an  HFET  oprated  in 
the  hot  electron  regime. 


the  channel.  In  that  case,  the  contribution  of  hot 
electrons  to  the  gate  current  decreases  in  the  drain 
side  of  the  channel  but  is  maintained  in  the  source- 
side  and  at  the  mid  channel  region.  Therefore,  the 
contribution  of  "superheated"  electrons  decreases 
with  an  increase  in  the  drain-to-source  voltages  due 
to  the  increase  in  the  barrier  potential  [5]. 


Charge  Control  Model  for  FETs,  and  the  CHINT- 
like  theory  of  operation  and  real-space  electron 
transfer  in  which  the  saturation  of  the  gate  current 
at  high  drain-to-source  voltages  is  explained  based 
on  the  rapid  drop  in  the  effective  electron-energy 
relaxation  time  caused  by  the  real-space  transfer  of 
hot  electrons  [6,7,8].  In  our  model,  electrons  are 
accelerated  by  the  electric  field  parallel  to  the 
heterointerface.  During  the  acceleration  process, 
non-linear  electric  potential  and  electron 
temperature  distributions  are  established  in  the 
channel  area  under  the  gate.  These  distributions  are 
strongly  dependent  on  the  gate  and  drain-to-source 
voltages. 

For  drain-to-source  voltages  below  the 
saturation  voltage  <  V^),  the  electric  potential 
and  the  electric  field  distributions  are  expressed  as 
quadratic  and  linear  functions  of  position.  Then, 
the  electron  temperature  can  also  be  estimated  as  a 
function  of  position  and  is  given  as 

r,(x)  =  To  ^ 


where  is  the  lattice  temperature,  q  is  the  electron 
charge,  K^,  is  the  Boltzman  constant,  xjx)  is  the 
energy  relaxation  time  as  a  function  of  position, 
and  fjjx)  is  the  mobility  as  a  function  of  position). 

For  drain-to-source  voltages  above 
saturation  (y^,>  the  channel  is  divided  in  two 
separate  regions,  sub-critical  and  critical  regions. 
The  critical  region  under  the  gate,  is  where  most  of 
the  electron  transfer  occurs.  The  position  of  the 
critical  region  is  estimated  as 


^  12) 


where  is  the  saturation  voltage,  L,  is  the  gate 
length,  Fj  is  the  saturation  field,  and  X  is  a  fitting 
prameter  dependent  on  the  gate  length. 

In  the  subcritical  region,  the  potential  and 
electric  field  are  given  as 


ni.  Theoretical  Models 

In  order  to  understand  the  "hot"  eletron 
regime  of  operation  in  HFETs,  we  were  motivated 
to  develop  an  analytical  model  that  can  describe  the 
dependence  of  the  gate  current  on  the  drain-to- 
source  voltage.  This  model  is  based  in  both  the 
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where  x^,^,  is  estimated  from  equation  2,  and  7*,^  is 
the  total  drain  samration  current  given  as 
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where  is  the  gate  width,  and  Q  is  the  insulator 
capacitance. 

In  the  critical  region,  (for  x  >  x^,),  the 
potential  and  the  electric  field  are  given  as 

V{X)  =  + 


and 
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Fix)  =  ^  ' 
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where  V^,  is  the  gate-to-source  voltage,  V(x)  is  the 
potential  as  a  function  of  position,  K,,  is  the 
Boltzman  constant,  AE^  is  the  conduction  band 
discontinuity,  T/x)  is  the  electron  temperature  as  a 
function  of  position,  T„  is  the  lattice  temperature, 
and  T|  is  the  ideality  factor  of  the  potential  barrier. 

IV.  Re.sults 

The  re.sults  of  our  models  were  compared 
with  experimental  data.  Figure  1  shows  the  room 
temperature  gate  current  characteristics  as  a 
function  of  drain-to-source  voltage  for  a  1.4  x  10 
pm^  Al7jGa25A.s/In2Gaj,A.s/GaAs.  In  tliis  figure, 
the  gate  current  was  measured  at  a  constant  gate-to- 
.source  voltage  of  2.7  volts.  As  shown  in  this 
figure,  our  mtxlel  is  able  to  reproduce  the  three 


For  (V.k  >  K«/).  the  electron  temperature  is 
e.stimated  as 


Tgix)  =  ro+j-:|^]Tg(j£-)  p„(x)F(x)F, 

V  (  8  ) 

Therefore,  the  gate  current  as  a  function  of 
drain-to-, source  voltage  can  be  estimated  by  adding 
the  contribution  of  scattered  "hot"  electrons  under 
the  gate  and  its  given  as 

I^  =  A'T/Wgj  [Aix)  *3ix)  -cix)]dx 
0 

(9) 

where 


Figure  3.  Comparison  between  the  experimental 
and  theoretical  "hot"  electron  current  as  a 
function  of  gate  voltage  for  a  1.4  x  10  pm^ 
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regions  observed  in  the  experimental  data.  In  the 
linear  region  <  0.85  V),  the  experimental  and 
theoretical  data  correlate  well.  The  same  results  are 
obtained  in  the  saturation  region,  >1.5  V).  In 
the  transition  region,  our  model  usually  fails  to 
describe  the  gate  current  accurately.  The  reason  for 
this  discrepancy  between  the  theoretical  and 
experimental  data  is  due  to  the  number  of 
competing  mechanisms  in  this  transition  region 
which  are  not  accounted  for  in  our  model.  A  more 
sophisticated  simulation,  such  as  Monte  Carlo, 
would  be  more  accurately  to  describe  the  gate 
current  in  this  region. 

Similar  results  are  obtained  when  the 
experimental  and  theoretical  data  as  a  function  of 
gate-to-source  voltage  are  compared.  Figure  3 
shows  the  family  of  curves  as  a  function  of  gate-to- 
source  voltage  for  the  same  device.  As  shown  in 


(lrain-!()-s<>urce  vohage  (V) 


Figure  4.  Comparison  between  the  experimental 
and  theoretical  gate  current  as  a  function  of 
temperature  for  the  same  device. 

this  figure,  a  good  agreement  between  the 
experimental  and  theoretical  data  is  maintained  in 
the  linear  and  saturation  region  over  the  entire 
range  of  gate  voltages,  in  which  the  hot-electron 
effects  are  predominant. 

As  previously  mentioned,  modeling  of  the 
hot-electron  gate  current  was  also  performed  for  a 
wide  temperature  range,  and  the  theoretical  results 
were  also  compared  with  the  experimental  data. 
Figure  4  shows  the  gate  current  as  a  function  of 
drain-to-source  voltage  for  temperatures  ranging 
from  198  K  to  398  K.  As  shown  in  this  figure, 
good  agreement  between  the  experimental  Jind 


theoretical  results  was  obtained  over  the  entire 
temperature  range. 

V.  Conclusions 

An  analytical  model  to  describe  the  hot- 
electron  regime  of  operation  has  been  presented. 
This  model  was  built  on  a  Charge  Control  Model 
and  a  CHINT-like  model  in  which  the  electron 
temperature  is  limited  by  tlie  intervalley  transfer 
and  real-space  transfer  of  electrons.  Good 
agreement  between  the  models  and  the  experimental 
data  was  found  over  a  wide  temperature  range, 
from  198  K  to  398  K. 

Understanding  of  these  complex  mechanisms  is 
important  to  effectively  reduce  the  gate  leakage 
current  in  Complementary  HFETs. 
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ABSTRACT 

A  Dual-Gate  PM  HFET  monolithic  mixer  has  been 
developed,  fabricated  and  measured  at  60  GHz.  This 
circuit  uses  a  0.15  jim  gate  length  AlGaAs/InGaAs/GaAs 
pseudomorphic  HEMT  technology.  The  design 
optimization  is  performed  from  a  complete  extraction 
method  of  the  dual-gate  device  and  a  non-linear  electrical 
modeling.  The  mixer  has  a  maximum  conversion  gain  of 
-  5  dB,  a  minimum  noise  figure  of  1 1  dB  at  5  dBm  LO 
power,  an  output  IMPS  of  -2  dBm  and  a  LO  to  RF 
isolation  close  to  35  dB. 

1.  INTRODUCTION 

The  fabrication  of  GaAs  MMIC,  for  applications 
in  50-75  GHz  frequency  range  has  been  developed  these 
last  years  for  linear  and  non-linear  devices.  In  this 
perspective,  we  have  developed  a  microstrip 
AlGaAs/InGaAs/GaAs  dual-gate  HEMT  mixer.  The  main 
advantage  of  the  dual-gate  device  is  the  suppression  of 
the  external  wave  guide  coupler,  required  in  single  gate 
mixer,  for  the  combination  of  LO  and  RE  signals  [1]. 
Furthermore,  due  to  the  natural  isolation  between  the  two 
gates,  LO  and  RF  signals  can  be  combined  with  a  good 
isolation.  The  first  results  of  this  MMIC  shown  in  this 
paper  give  a  conversion  gain  close  to  -5  dB  for 
Frf  =  59.8  GHz  and  Flq  ^  ^ 

bandwidth  in  the  58.5  -  62  GHz  frequency  range.  The 
output  third  order  intercept  point  (IMPS)  is 
about  -  2  dBm.  These  results  constitute  a  promising 
performance  for  a  monolithic  dual-gate  PM  HFET  mixer 
operating  at  V  band. 


2.  MODELING  OF  THE  DEVICE 

The  dual-gate  mixer  design  needs  a  good 
knowledge  of  the  active  device  behaviour.  For  this  aim, 
due  to  its  complexity,  a  specific  modeling  has  been 
developed  in  order  to  determine,  in  particular,  the 
equivalent  scheme  of  the  transistor  which  can  be 
considered  as  two  equivalent  single  gate  devices  in 
cascode  configuration  [2].  The  method  is  divided  in  two 
main  sequences  where  the  access  and  intrinsic  elements 
are  successively  determined  [3].  For  this  extraction,  on 
wafer  three-ports  S  parameter  measurements  are 
performed  using  a  specific  test  bench  developed  in  our 
laboratory.  From  this  method  it  is  possible  to  obtain  with 
a  good  accuracy  the  main  parameter  evolutions  versus  the 
gate  biases  when  the  equivalent  transistors  are  operating 
in  linear  or  saturated  regimes.  Then,  from  this 
determination  a  dual-gate  nonlinear  electrical  model  has 
been  performed  for  the  mixer  design. 

3.  CIRCUIT  DESIGN  AND  FABRICATION 

The  mixer  operates  under  low-noise  conditions 
(LNM)  [4].  In  this  configuration,  the  first  equivalent 
transistor  is  linear  and  the  second  is  saturated.  Mixing 
occurs  in  the  first  transistor  and  the  second  one  is 
working  as  an  IF  amplifier.  The  RF  signal  is  applied  on 
the  gate  1  and  the  LO  signal  is  applied  on  the  gate  2.  RF 
and  LO  matching  are  assumed  by  open  stubs.  The  second 
gate  is  grounded,  using  a  resonant  circuit,  in  order  to 
obtain  an  IF  common  gate  amplifier.  Two  open  stubs 
achieve  LO  and  RF  short-circuits  at  the  drain.  IF 
matching  consists  of  a  combination  of  a  lumped  spiral 
inductor  and  a  capacitor.  This  circuit  provides  a  low-pass 
filter  and  a  real  impedance  at  IF  frequency.  Gate  bias 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


171 


circuits  using  X/4  lines,  include  50  Q  resistances  for  the 
stabilization. 


The  MMIC  was  fabricated  by  THOMSON  TCS 
Foundry  using  the  0.15  jim  gate  length  pseudomorphic 
AlGaAs/InGaAs/GaAs  HEMT  technology  with  three 
finger  of  25  pm.  The  PM  HFET  is  a  one  5-doped  layer 
device  of  SxlO'^cm'^  with  a  InGaAs  well  of  120  A 
depth.  A  photograph  of  the  fabricated  chip  is  shown  in 
figure  1. 


Fig.  1 :  Photograph  of  dual-gate  PM  HEMT 
mixer 


3.  PERFORMANCE 


The  monolithic  chip  is  mounted  in  a  50-75  GHz 
test  fixture  which  has  been  developed  in  our  laboratory 
[5]  and  that  we  have  modified  to  include  a  specific  insert 
with  a  third  access  for  the  IF  output.  Its  characteristics  are 
insertion  losses  lower  than  0.75  dB  and  return  losses 
lower  than  -20  dB  at  each  port. 


Vg1s 


Fig.  2  :  Dependance  of  the  conversion  gain 
versus  Vgls  (RF=62.5  GHz  ;  LO=56. 7  GHz) 


Fig.  3  Dependance  of  the  conversion  gain 
versus  Vg2s  (RF-=62.S  GHz  ;  LO=56. 7  GHz) 


The  dual  gate  mixer  optimization  is  very  complex  due  to 
the  two  gate  biases  influence.  The  criteria  chosen  is  the 
conversion  gain.  Figures  2  and  3  represent  its  dependance 
versus  the  gate  biases  Vgls  and  Vg2s.  It  can  be  noted 
that  the  optimum  condition  is  obtained  for 
0  <  Vgls  <  0.15  V  and  0  <  Vg2s  ^  0.2  V. 

But  the  conversion  gain  is  not  the  only  parameter  to 
consider.  In  a  second  step,  a  good  compromise  has  to  be 
found  between  the  isolations  RF  to  LO  and  LO  to  RF 
associated  with  a  good  matching  impedance  on  the  two 
input  ports. 


The  optimum  conditions  are  shown  in  figure  4  by  the  Sy 
mixer  parameters  analysis.  It  can  be  noted  that  in  this 
case,  the  LO  to  RF  isolation  is  close  to  30  dB  and  the  RF 
to  LO  isolation  is  better  than  35  dB.  These  results  are 
confirmed  with  measurement  made  with  a  LO  signal 
pumped.  The  input  return  losses  are  about  15  dB  on  the 
RF  port  and  6  dB  on  the  LO  port 

In  this  case,  the  LNM  gate  biases  conditions  are 
^gls  ^  0  V  and  Vg2s  =  0. 1 5  V  and  the  drain  bias  is  set  to 
3  V.  Figure  5  gives  the  conversion  gain  evolution  versus 
the  LO  power  where  Frp  =  59.8  GHz,  Flq  =  54  GHz 
and  Prp  =  -  23.5  dBm.  A  maximum  conversion  gain 
close  to  -5  dB  is  obtained  with  LO  drive  of  5  dBm. 
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Fig.  4 :  Sg  parameters  measurement 
(between  50  GHz  and  75  GHz  step  2.5  GHz) 


Fig.  5  :  LO  power  dependance  of  conversion 
gain 


For  this  LO  power  and  Flq  =  54  GHz  figure  6  shows  the 
evolution  of  the  conversion  gain  versus  RF  frequency.  A 
flat  bandwidth  is  obtained  from  58.5  to  62  GHz. 


Fig.  6 :  Conversion  gain  versus  RF  frequency 


Intermodulation  measurements  are  also  made.  The 
evolution  of  IMPS  versus  the  LO  power  is  represented  in 
Figure  7.  A  maximum  value  of  -2  dBm  is  obtained  at  the 
output  of  the  mixer. 
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CONCLUSION 


LO  Power  (dBm) 


Performance  of  dual-gate  microstrip  PM  HFET 
monolithic  mixer  has  been  reported  in  this  paper. 
Operating  under  Low  Noise  Mixer  configuration,  this 
circuit  exhibits  -5  dB  conversion  gain  with  5  dBm  LO 
power  from  58.5  to  60.5  GHz  and  an  output  third  order 
intercept  point  of  -2  dBm.  This  dual-gate  device  makes 
possible  to  obtain  good  isolations  between  LO  and  RF 
signals  avoiding  any  external  circuit  wich  constitutes  its 
main  advantage.  These  results  confirm  the  great  interest 
of  the  pseudomorphic  AlGaAs/GalnAs  on  GaAs  based 
MMIC  for  V  band  mixer  applications. 


Fig.  7 :  Output  IMPS  versus  LO  power 


The  dual  gate  mixer  noise  figure  is  represented  in 
figure  8  versus  LO  power.  A  minimum  value  close  to 
1 1  dB  is  observed. 
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Fig.  8  :  Noise  Figure  versus  LO  power 
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ABSTRACT 

Low-frequency  noise  in  SiO/InP  metal-insulator- 
semiconductor  field-effect  transistor  (MISFET)  is  studied 
systematically.  In  ohmic  regime,  the  drain  noise  power 
density  is  proportional  to  the  squared  drain-source  voltage, 
as  1/f  resistor  fluctuactions.  Hooge's  constant  a„  of  4x10'^ 
is  obtained.  The  1/f  noise  of  the  device  attributed  to 
number-fluctuation,  mobility-fluctuation,  and  to 
trapping/detrapping  in  space-charge  region  and 
oxide/semi  conductor  interface  are  examined  under  various 
bias  conditions.  Moreover,  a  generation-recombination  (G- 
R)  noise  corresponding  to  a  deep  energy  level  of  0.4  eV  is 
observed. 

L  INTRODUCTION 

The  InP  metal-insulator-semiconductor  field-effect 
transistors  (MlSFETs)  are  promising  for  the  use  in  high- 
power  devices,  high-speed  devices,  digital  circuits,  and 
monolithic  optoelectronic  circuits  due  to  its  high  saturation 
drift  velocity,  low  ionization  coefficient,  and  high  mobility. 
However,  the  performance  of  MlSFETs,  critically  depends 
upon  the  quality  of  MIS  gate  interface.  To  date,  the  best 
results  for  the  InP  MIS  structure  have  been  achieved  with  a 
deposited  dielectric  layer,  silicon  dioxied  (SiO,)  has  been 
the  most  widely  used  gate  dielectric  in  InP  MlSFETs, 
because  of  its  low  leakage  current,  high  breakdown  field, 
and  moderate  interface  state  density. 

Low-frequency  spectroscopy  technique  is  a  very 
powerful  tool  to  investigate  the  theoretical  and  practical 
information  about  the  material  and  the  devices  [1]. 
Although,  for  high  frequency  or  microwave  linear 
applications  we  do  not  worry  about  the  noise  at  low 
frequency,  the  low  frequency  noise  does  influence  the 
performance  of  microwave  nonlinear  applications  like 
diode,  or  FET  mixers  and  oscillators  [2].  Since  low- 
frequency  noise  induces  phase  noise  or  frequency 
modulation  (F.M.)  and  amplitude  modulation  (A.M.)  noise 
microwave  circuits,  it  is  important  to  try  to  reduce  the  low- 
frequency  noise  of  these  devices  by  investigating  its  origins 
and  mechanisms.  Several  low-frequency  noise  properties 
associated  with  these  devices  have  been  studied.  Duh  et  al. 
studied  the  f  drain  current  noise  in  modulation-doped 
FETs  (MODFETs)  under  a  gate-biased  condition  [3], 
Loreck  et  al.  showed  the  generation-recombination  (G-R) 
noise  arising  from  the  deep  levels  in  AlGaAs  [1],  Zhang  et 


al.  the  burst  noise  in  do.uble-hetero  bipolar  transistors  [4] 
and  Liu  et  al.  the  gate  f  '  noise  in  AlGaAs/GaAs  MODFETs 
[5]  and  InGaAs  quantum-well  MQDFETs  [6].  However,  to 
our  knowledge,  there  is  no  investigations  for  the  low- 
frequency  noise  performances  of  Si02/InP  MISFET 
structure.  In  this  letter,  we  have  systematically  studied  the 
low-frequency  drain-voltage  noise  in  Si02/InP  MISFET 
under  various  bias  conditions  for  the  purpose  of 
determining  the  mechanism  of  the  noise. 

2.  EXPERIMENTS 

The  n-channel  depletion-mode  SiO^/InP  MISFET 
structure  is  sketched  in  Fig.  1  and  its  transconductarice  is 
about  63  mS/mm  for  a  2x200  pm  gate.  The  gate  oxide  was 
deposited  by  direct  photo-enhanced  chemical  vapor 
deposition  (CVD)  under  the  irradiation  of  deuterium  (IX) 
lamp  at  low-temperature  range  (<250  ”C).  The  reactant 
gases  were  silane  (SiHJ  and  oxygen  (O2).  The  detailed 
description  of  process  and  a  full  summary  of  Si02/InP 
MISFET  characteristics  have  been  published  elsewhere  [7]. 
For  measurement  purposes,  selected  device  chips  were 
mounted  on  TO-5  headers  and  bounded  with  a  thermo¬ 
compression  ball  bonder.  The  noise  was  measured  in  a 
brass-shielded  box  at  room  temperature,  and  a  bias  d.c. 
current  was  supplied  by  batteries  through  a  1 00-K  resistor. 
The  voltage  noise  across  the  current  terminals  was 
amplified  by  a  low-noise  preamplifier,  and  the  outputs  were 
fed  to  an  HP  3652A  spectrum  analyzer.  The  analyzer  was 
controlled  automatically  by  a  personal  computer  (PC).  The 
smallest  detectable  circuit  noise  of  the  setup  was  about  -1 72 
dBV/Hz'^  between  10  Hz  and  100  kHz.  The  spectrum 
analyzer  averaged  2^  noise  spectra  of  every  decade,  and  the 
computer  further  averaged  the  data  and  subtracted  the  a.c. 
line  noise. 

3.  RESUI.TS  and  DISCUSSION 

The  drain  noise  power  density  of  Si02/lnP 
MISFET  were  measured  by  biasing  the  device  in  ohmic 
region.  The  measured  noise  spectra  of  the  device  with 
2x200  pm  gate  for  a  variety  of  drain  voltages  from  0.1  tp 
0.6  V  and  a  fixed  gate  voltage  of  0  V  are  shown  in  Fig.  1 . 
We  observed  the  noise  spectrum  density  Svd{f)  of  MISFET 
decreases  with  frequency  at  the  rate  of  f  below  60  Hz, 
and  f  '“  above  it.  This  is  tyical  of  f  *  noise  superimposed  on 
the  G-R  noise.  The  turnover  frequency  of  60  Hz  is 
observed.  This  has  been  previously  reported  for  GaAs 
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MESFET  [8,9].  Furthermore,  the  noise  power  density  at 
frequencies  lower  than  1  kHz  is  at  least  ten  times  larger  than 
the  thermal  noise  and  the  background  noise  of  the  system. 
Figure  2  shows  Sy/f)  versus  drain  voltage  Vj  at  a  frequency 
of  40  Hz  calculated  from  Fig.  1 .  The  noise  spectra  density 
has  a  squared  voltage  (V/)  dependence  expected  of  a 
resistive  mobility-fluctuation  mechanism.  The  mobility- 
fluctuation  noise  is  caused  by  interaction  of  electrons  with 
slowly  fluctuating  longitudinal  acoustical -phonon 
populations  [10].  Already  in  1969,  Hooge  had  postulated 
[10],  based  on  data  obtained  for  many  different  materials, 
that 


where  is  the  voltage  across  the  device,  f  and  N  are  the 
frequency  and  the  total  carrier  number,  respectively,  and 
is  the  Hooge  noise  parameter.  Recent  measurements  on  a 
GaAs  MESFET  showed  the  value  of  5.6x10*^  [11],  and 
those  on  a  quarter-micron  GaAs  Hall  device  a  value  of 
an=4.5xl0^  [12].  The  proportionality  constant  an  become 
thus  a  parameter  to  give  the  magnitude  of  the  f  '  noise 
levels.  Fitting  the  noise  curve,  a  new  empirical  formula  for 
f was  then  proposed  based  on  these  experiments  with  the 
value  a„  of  4x10*’. 

Fig.  3(a)  and  3(b)  show  the  noise  power  spectra 
versus  frequency  for  various  negative  and  positive  gate 
voltages,  respectively,  in  the  drain  voltage  of  0.05  V.  These 
spectra  in  Fig.  3  present  pure  f*^  noise  without  the  f in 
intermediate  frequency  as  shown  in  Fig.  1 .  The  noise  power 
density  versus  gate  voltage  for  the  drain  voltages  of  0.05  V 
and  0.5  V,  respectively,  at  frequency  of  40  Hz  is  plotted  in 
Fig.  4.  For  negative  gate  voltage,  at  low-field  region 
(Vd=0.05  V),  the  noise  increases  with  the  increase  of  the 
gate  voltages.  It  indicates  that  the  depletion  layer  near  the 
gate  electrode  plays  an  important  role  in  noise  behaviour. 
The  larger  depletion  layer  results  in  the  layer  noise 
generation.  Sah  explained  by  the  random  fluctuations  in  the 
occupancy  of  deep  traps  in  the  depletion  region  [13].  For 
positive  gate  voltage,  the  noise  increases  with  the  increase 
of  gate  voltages  as  a  result  of  trapping  of  carriers  by  oxide 
traps  and  the  interface  surface  states  as  found  for  MOSFET 
[14].  Our  data  of  Sv^(f)  agrees  well  with  the  carrier  density 
fluctuation  model  [15].  At  high  field  region  (Vd=0.5  V)  as 
shown  in  Fig.  4,  however,  there  is  no  obvious  variation  in 
drain  noise  power  density  throughout  the  gate  voltages. 
Because  the  mobility-fluctuation  1/f  noise  in  high  field  field 
predomated  over  the  1/f  noise  attributed  to  trapping  of  the 
depletion  region  and  interface  surface  states. 

4.  CONCLUSION 

In  conclusion,  low-frequency  noise  measurements 
were  performed  in  ohmic  region  of  SiO/InP  MISFET  for 
various  test  conditions.  The  noise  power  density  is 
proportional  to  V/,  as  l/f  resistor  fluctuations.  Hooge's 
constant  a„  of  4x10’  is  obtained.  The  orgins  and 


mechanisms  of  noise  in  MISFET  were  investigated  and 
analyzed. 
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Fig.l  The  noise  spectra  Svd(f)  <^fSi02/InP  MISFET  for  Vg,=0  V  and  Vj  varied  from  0.1  V  to  0  6  V. 


Fig.2  The  noise  spectra  SyjCO  of  SiO./InP  MISFET  versus  drain  voltage  at  a  frequency  of  40  Hz 
calculated  from  Fig.l. 
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Fig.3  The  noise  spectra  Svd(0  of  SiO./lnP  MISFET  versus  frequency  for  (a)  Vp=-0  IV,  -0.3V,  -0.5V,  -0.8V 
and  -2V;  and  (b)  Vp=0V,  0.5V  and  IV,  respectively,  at  Vj=0.05V. 
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Fig.4  The  noise  power  density  of  SiOj/InP  MISFET  versus  gate  voltage  at  the  frequency  of  40  Hz  and 
varied  Vj=0.05  V  and  0.5  V,  respectively 


A  NEW  FORM  OF  THE  FOURIER  TRANSFORM  FOR 
TIME- VARYING  FREQUENCY  ESTIMATION’ 

Vladimir  Katkovnik 
Department  of  Statistics, 

University  of  South  Africa  (UNIS A) , 

RO.  Box  392,  Pretoria  0001,  RSA, 

Tel:  (012)  429-6210,  FAX:  (012)  429-3434, 

E-Mail  :  katkov@risc6.iniisa.ac.za 


Abstract —  The  local  polynomial  Fourier  trans¬ 
form  (LPFT)  and  the  local  polynomial  periodogram 
(LPP)  are  proposed  in  order  to  estimate  the  rapidly 
time- varying  instantaneous  frequency  (IF)  Q{t)  of 
a  harmonic  signal.  The  LPFT  gives  the  time- 
fi^uency  power  distribution  over  the  t  —  {Q{t), 
(r^~^Q{t)/dt^~^)  space,  where  m  is  a  de¬ 
gree  of  the  LPFT-  The  LPFT  enables  one  to  estimate 
both  the  time- varying  frequency  and  its  derivatives. 
The  tecdmique  is  based  on  fitting  the  local  polyno¬ 
mial  approximation  of  the  frequency  which  imple¬ 
ments  a  high-order  nonparametric  regression.  The 
a  priori  information  about  bounds  for  the  frequency 
and  its  derivatives  can  be  incorporated  to  improve 
the  acciiracy  of  the  estimation.  The  asymptotic 
mean  square  errors,  bias  and  covariance,  of  the  es¬ 
timators  of  d^Q(t)/dt"  ,  s  =  0, 1,2,  ...,m  —  1,  are  ob¬ 
tained.  The  considered  estimators  are  high-order 
generalization  of  the  short-time  Fourier  transform. 
The  comparative  study  of  the  asymptotic  varianc:e 
and  bias  of  the  estimates  is  presented. 

I.  Introduction 

By  convention  the  term  “frequency”  is  associated  with 
the  Fourier  transform  defined  as  an  infinite  integral 
over  the  time  domain.  Thus  the  time  is  removed, 
implying  that  the  Fourier  frequency  is  not  a  descrip¬ 
tor  of  a  signal  at  a  specific  time-instant.  For  nonsta¬ 
tionary  signals  the  time  variation  of  frequency  infor¬ 
mation  in  y{t)  will  be  obscured  since  the  spectrum  is 
/(cj)  —  \Y{(j)f .  An  alternative  representation  is  the 
short  time  Fourier  transform  (STFT)  which  is  evalu¬ 
ated  by  applying  a  suitable  windowing  function  to  the 
original  signal  and  evaluating  the  conventional  Fourier 
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transform  of  the  resulting  length  sequence.  The  STFT 
of  the  signal  y{t)  can  be  given  in  particular  by 

/oo 

Ph(u)y{t  +  u)  exp{-ju}u)du,  (1) 

OO 

where  is  a  windowing  function  and  h  is  a  param¬ 
eter  which  controls  a  length  of  the  window.  Using 
this  technique,  an  approximation  to  the  spectral  con¬ 
tent  at  the  point  t  can  be  achieved  by  computing 

In  recent  years,  time-frequency  representations  alter¬ 
native  to  the  STFT  and  based  on  time- frequency  distri¬ 
butions  have  been  developed.  The  time-frequency  dis¬ 
tributions,  which  indicate  the  energy  content  of  a  signal 
as  a  function  of  both  time  and  the  IF  n(i),  are  power¬ 
ful  tools  for  time- varying  signal  analyses.  They  have 
been  utilized  to  study  a  wide  range  of  signals,  including 
speech,  music,  and  other  acoustic  signals,  biological  sig¬ 
nals,  radar  and  sonar  signals,  and  geophysical  signals. 
Recent  reviews  of  the  field  are  given  in  [1],  [4]. 

Most  time-frequency  distributions  of  current  interest 
are  members  of  Cohen’s  bilinear  (quadratic)  class  [3], 
which  includes  as  a  particular  case  the  Wigner-Ville  dis¬ 
tribution  (WVD). 

The  polynomial  WVD  was  proposed  [2]  in  order  to 
improve  the  tracking  ability  of  time-frequency  distribu¬ 
tions  and  has  the  similar  optimal  concentration  in  the 
t—cj  plane  for  a  polynomial  IF  of  the  complex  harmonic 
signal. 

Another  approach  to  the  problem  and  a  new  class 
of  time- frequency  distributions  were  studied  in  [5]- [9]. 
The  local  polynomial  approximation  of  the  time- varying 
phase  was  used  in  order  to  derive  and  to  justify  the  local 
polynomial  Fourier  transform  (LPFT).  This  transform 
is  as  follows 

y^(d;,0=  (2) 
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I  /!)fc(u)y(t  +  «)exp(-j0(«,a;))dti, 

J  —  oo 

Cj  =  {u}i,u;2,...,cJm)'  e  R"", 

where 

6{u^Q)  =  (ji .  ti  +  a;2  •  u^/2  + ...  -|-  Um  •  (3) 

The  local  polynomial  periodogram  (LPP)  is  then  de¬ 
fined  as  follows 

4(w,t)  =  |yh(u>,oi"-  (4) 

The  quadratic  and  higher  degree  polynomials  of  the 
signal  are  used  in  the  Cohen’s  and  polynomial  classes  of 
time-frequency  distributions  with  the  exponential  func¬ 
tion  exp(— jcju)  by  universal  convention  for  the  Fourier 
transform.  The  argument  of  this  exponent  is  linear  with 
respect  to  variable  n.  The  LPFT  on  the  other  hand 
is  linear  with  respect  to  the  signal  and  the  polyno¬ 
mial  function  of  appears  in  the  argument 

of  the  complex  exponent.  This  is  a  principal  differ¬ 
ence  between  the  LPFT  and  the  Cohen’s  and  polyno¬ 
mial  classes  of  time-frequency  distributions. 

The  representation  similar  to  (2),  with  a  polynomial 
in  the  power  of  the  complex  exponent,  appeared  in  the 
literature  as  a  problem  of  estimating  time-invariant  co¬ 
efficients  of  the  polynomial  phase  of  complex  signals, 
which  originated  in  radar  applications  [15].  The  vari¬ 
ance  and  the  Cramer-Rao  upper  bounds  of  these  esti¬ 
mates  were  studied  in  [14]  and  [15].  In  contrast  to  the 
problem,  considered  in  [14]  and  [15],  the  LPFT  and  the 
LPP  in  the  form  (2)- (4)  have  been  developed  with  quite 
a  different  motivation. 

In  [10],  [12]  the  LPFT  and  LPP  have  been  derived 
within  the  local  polynomial  approximation  (LPA)  ap¬ 
proach  and  interpreted  as  a  generalization  of  the  STFT 
and  as  a  form  of  the  time-frequency  distribution.  The 
accuracy  of  the  LPA  of  the  time- varying  frequency  and 
amplitude  is  studied  in  [9].  The  accuracy  of  the  LPP 
estimator  of  the  time-varying  frequency  and  its  deriva¬ 
tives  is  considered  in  [11],  [13]. 


IL  Estimation  of  the  IF 

The  LPP  estimator  is  introduced  as  a  solution 

of  the  following  constrained  optimization  problem  [10]: 

Q){t,  h)  =  arg  max^eQcfi"*  t),  .  . 

Q  =  {u; :  lo^sl  <  La  ,  s  =  l,2,...,m},  ^  ^ 

where  Ls  contain  the  a  priori  information  about  the 
IF,  <  La,  5=  1,2,... ,m. 

It  was  shown  in  [10]-[13]  that  the  components  of 
fr)  yield  the  following  estimates:  Cj\  (t,  h)  for  the  fre¬ 
quency  Q{t)  and  cja+i(^fr)  for  the  derivatives 
s  =  1,2,  ...m—  1,  Both  the  convergence  result,  Cj{t,  h)  — > 
Q^{t)  as  h  — ►  oo, 

and  the  asymptotic  accuracy  of  the  estimator  proves 
that  the  distribution  as  a  function  of 

Q  concentrates  in  the  point  uP{t). 

III.  Asymptotic  accuracy 

The  accuracy  results  are  given  here  for  the  discrete-time 
version  of  the  LPFT.  The  vector-error  is  as  follows 

Aw(fcT,  h)  =  Q°{kT)  -  Q{kT,  h),  Au;(kT,  h)  €  RT. 

(6) 

It  is  obvious  that  only  asymptotic  analytical  results 
for  the  AQ{kT,  /i),  where  the  estimator  a>(fcT,  /i)  is  a  so¬ 
lution  of  the  awkward  multimode  optimization  problem 
(5),  can  be  obtained. 

First  of  all  let  us  make  clear  what  kind  asymptotic 
is  studied.  We  will  assume  that  the  error  AQ{kT^h), 
the  noise  e  and  the  derivatives  s  >  m,  are 

small.  The  last  means  that  Lg  are  assumed  to  be  small 
and  therefore  the  frequency  Q{t)  is  close  to  a  polyno¬ 
mial  function  of  the  power  (m  —  1).  These  assumptions 
are  the  basis  for  linearization  of  exponential  functions 
which  is  used  in  the  analysis. 

We  assume  also  that  the  window  length  is  large 
{h  — >  oo)  so  that  a  large  number  of  harmonic  cir¬ 
cles  is  imbedded  into  the  window.  This  is  a  long-time 
type  asymptotics  usual  for  most  analytical  results  in 
the  spectral  analysis.  The  sampling  interval  T  is  not 
assumed  to  be  small. 

The  first  two  moments  of  the  vector-error  A(j{kT,h) 
are  used  in  the  paper  in  order  to  characterize  the  accu¬ 
racy  of  the  estimation: 

a)  the  covariance  matrix 

cov{Au;{kT^h))  —  (7) 

E{{Au{kZ  h)  -  EA(b{kT,  h))  x 
{AQj^kT,  h)  -  EAu^ikT,  /i))'), 
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where  the  prime  ( ' )  stays  for  transpose  and  the  aster¬ 
isk  means  a  complex  conjugate  value. 

b)  the  bias,  which  is  presented  in  the  form  of  two 
expansions: 

oo 

EAw{kT,  h)  =  Y^  Bs{h)Q^^\kT),  Bs{h)  €  RT,  (8) 

«=0 

and 

\EAcj{kT, h)\  <  Bi^\h)Lm,  Bi^Hh)  e  ii”* ,  (9) 

where  are  determined  in  (5).  The  inequality  |x|  < 
|y|,  x,y  e  in  (9)  means  that  \xj\  <  \yj\,  j  = 

1,2,..., 771. 

For  the  considered  estimators  Bs{h)  —  0  for  s  = 
0, 1, m  “  1,  because  the  reproduction  properties  with 
respect  to  polynomial  components  of  Q{t)  are  insured 
by  the  local  polynomial  approximation  estimates  [5] -[8], 
thus  (8)  is  as  follows: 

oo 

EAu{kT,  h)=Y^  B,{h)n^^HkT).  (10) 

s—m 

As  a  matter  of  fact  (9)  is  a  minimax  upper  bound. 

The  expansions  (8)-(10)  present  quite  a  clear  pic¬ 
ture  of  the  bias  at  the  instant  kT  through  the  val¬ 
ues  of  the  derivatives  of  fl{kT),  Both  the  coefficients 
Bs{h)^  and  cov{Au;{kT,  h))  appear  to  be  a  very 

convenient  tool  for  a  detailed  comparison  of  the  accu¬ 
racy  and  the  tracking  ability  of  the  different  kinds  of 
algorithms.  The  orders  of  the  bias  and  the  covariance 
matrix  with  respect  to  h  and  T  can  be  given  in  clear 
analytical  form. 

The  following  results  are  given  for  the  rectangular 
windows: 

a)  Symmetric  window  with  ph{nT)  =  T//i,  if  \nT\  < 
h/2, 

b)  Nonsymmetric  window  with  phi'^T)  =  T/h,  if 
—h  <  nT  <  0,  so  that  /i  is  a  window  length,  and  h/T  is 
a  number  of  observation  in  the  window 

Let  us  present  the  asymtotic  formulas  derived  for  the 
covariance  and  bias  of  the  estimates.  These  formulas 
enable  us  to  compare  the  accuracy  of  the  different  al¬ 
gorithms  in  clear  analytical  form. 

The  following  notation  is  used  in  what  follows: 

D{h)  =  diag{h,h'^,...,h^), 

=  limh_oo  Ph{nT)U{nT)U{nT)'  - 

Er=-oc  PH{nT)U{nT)  ^^=-00  PH{nT)U{nT)']D-\h), 


^  ^{h)x 

(Er=-oo  Ph{nT)U{nT){nTr+^- 
E~=-oo  Ph{nT)U{nT)  p,.(nT)(nr)-+‘), 

(Er=-oo  P/.(nT)l[7(nT)||nT|'+i+ 

I EZ-00  Ph{nT)U{nT)\ ^Z-oo  PhinT)\nT\^+^), 

where 

U{u)  =  (w,tt^/2,...,w"^/m!)', 

and  \U\  and  \W-^\  are  a  vector  and  a  matrix  of  abso¬ 
lute  values  of  U  and  respectively. 

Proposition  1  Let  u>{kT,h)  be  a  solution  of  (5), 
and  ,  s  =  m,  m  -f  1, such  that  as  h  oo 

Ls+i-h^^O,  (11) 

then 

coviD{h)A^ikT,  h))  =  (12) 

and  (S)  is  of  the  form: 

oo 

ED{h)Au>(kT,h)  =  ^  B«(ft)nW(A:r),  (13) 

s=m 

Bs{h)  -  .  6(s). 

Let  in  f  (11)  hold  for  s  =  m,  where 

supt  IcT^Q/df^l  <  Lm-\-i,  then  (9)  is  of  the  form: 

\ED(h)ACj{t,h)\  <  B^ZHh)L„,+i,  mZHh)  €  B(t4>) 

■ 

Comment  to  the  Proposition. 

The  formulas  (12)-(14)  determine  the  covariance  ma¬ 
trix  and  the  bias  of  the  estimates.  It  follows  that  the 
asymptotic  mean  square  errors  are  of  the  form: 

E{J‘-^\kT)  -  Us(kT,  h)f  =  (15) 

0{ - -2 - T  +  Ll^,  .  +1))^ 

\A\ 

s  =  l,2,...,m. 
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The  expression  (15)  gives  a  clear  picture  of  the  accuracy. 
The  constants  for  (15),  which  depend  on  m,  can  be 
calculated  from  (12)-(i4).  The  optimal  choice  of  h  in 
(15)  yields  the  bias-variance  trade-off  and  results  in 

-  u>sikT,h)f  I 

s  =  l,2,...m, 

(16) 

where  B  is  assumed  to  be  small. 

Let  us  note  that  the  variance  is  a  linear  function  of 
the  sampling  interval  T,  but  the  bias  does  not  depend 
on  it.  The  accuracy  results  for  the  STFT  follows  from 
(15)  and  (16)  if  one  assumes  m  =  1  in  these  formulas. 

IV.  Conclusion 

A  new  form  of  the  time-frequency  transform  LPFT 
and  a  new  approach  to  the  estimation  of  the  rapidly 
time-varying  frequency  of  a  complex  harmonic  signal 
are  proposed  and  justified.  The  most  important  distinc¬ 
tive  feature  of  the  new  transform  is  that  the  degree  of 
the  complex  exponential  function  generic  for  the  Fourier 
transform  is  a  polynomial  function  in  the  LPFT. 

The  LPFT  is  applied  in  order  to  propose  a  new  form 
of  the  periodogram,  the  LPP,  where  the  argument  is 
a  vector  whose  components  provide  estimates  of  the  IF 
and  its  derivatives.  The  convergence  results  and  asymp¬ 
totic  MSE  for  the  estimator  give  an  insight  into  prop¬ 
erties  of  the  LPP  and  LPFT. 

For  large  ho  the  accuracy  of  the  LPA  estimates,  char¬ 
acterized  by  the  MSE  values,  has  orders  for 

the  estimates  of  k  =  l,2,..,,m. 

The  disadvantage  of  the  developed  approach  is  a  com¬ 
plexity  of  the  optimization  problem  (5).  Nevertheless, 
the  study  and  calculations  show  that  there  are  a  few 
quite  prospective  ways. 

In  particular,  the  following  sequential  algorithm,  with 
a  one-dimensional  optimization  for  the  every  step,  has 
been  successfully  tested  for  the  linear  LPA,  m  =  2. 
The  first  step  assumes  that  a;2  =  0  and  Ih{uJi,U2,t)  is 
maximized  over  u;i,  =  argmax^^j  /;i(a;i,0,t).  The 
second  step  is  maximization  of  over  0J2 

,  =  argmaxa;2  and  so  on.  Experi¬ 

ments  show  the  fast  convergence  of  this  recursive  pro¬ 
cedure. 
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ABSTRACT 

This  paper  presents  stochastic  models  and 
estimation  algorithms  for  speckled  images.  We  treat 
speckle  from  a  novel  point  of  view:  that  of  a  carrier  of 
useful  surface  information  as  opposed  that  of  a  contam¬ 
inating  noise.  The  stochastic  models  for  surface  scat¬ 
tering  are  based  on  a  doubly  stochastic  marked  Poisson 
point  process.  For  each  of  these  surface  scattering  statis¬ 
tical  models,  we  present  estimation  algorithms  to  deter¬ 
mine  the  average  surface  reflectivity  and  scatterer  density 
within  a  resolution  cell  using  intensity  measurements  of 
speckled  images.  We  show  that  the  maximum  likelihood 
estimator  is  optimal  in  the  sense  that  the  variance  of  the 
error  is  the  smallest  possible  using  any  other  conceivable 
estimate  having  the  same  bias  with  the  same  data. 

1.  INTRODUCTION 

In  most  prior  work  on  speckle  development  in  images, 
speckle  has  been  viewed  as  a  multiplicative  noise  that 
contaminates  the  image  [1],  making  interpretation  and 
image  information  extraction  more  difficult.  As  a  result, 
a  great  deal  of  effort  has  been  expended  to  look  at  ways 
of  reducing  speckle  in  images  in  order  to  improve  their 
interpretation. 

In  our  approach,  we  make  use  of  the  fact  that  speckle  is 
a  function  of  the  spatial  distribution  of  scatterers  within 
the  imaged  resolution  cell  on  a  scale  corresponding  to 
the  wavelength  of  the  illuminating  radiation.  As  a  result, 
speckle  can  be  viewed  as  a  carrier  of  information  about 
roughness  characteristics  of  the  imaged  surface. 

Our  goal  is  not  to  reduce  speckle  per  se,  but  to  make 
use  of  it  in  developing  techniques  for  modeling  the  scat¬ 
tering  surface  using  random  point  processes  influenced 
by  an  underlying  information  process  and  estimating  the 
surface  reflectivity  and  density  of  elemental  scatterers 
{average  number  of  scattering  points  per  resolution  cell) 
from  Synthetic  Aperture  Radar  (SAR)  reflectivity  mea¬ 
surements. 

The  estimation  scheme  under  consideration  is  based 
on  the  marked  Poisson  point  process  and  the  resulting 
statistics  of  partially  developed  speckle  which  we  derived 
in  a  separate  paper  [2]. 

Although  the  emphasis  of  this  work  is  on  synthetic 
aperture  radar  systems,  we  use  SAR  systems  in  our  analy¬ 
sis  to  establish  results  that  will  serve  as  a  basis  for  estima¬ 
tion  problems  in  other  coherent  imaging  systems  where 


the  speckle  phenomenon  is  present,  such  as  medical  ul¬ 
trasound  imaging  systems. 

2.  SURFACE  SCATTERING  STOCHASTIC  MODEL 


2.1  Marked  Point  Process  Speckle  Model 


In  the  single-look  model,  a  radar  resolution  cell  Is  as¬ 
sumed  to  contain  a  collection  of  N  elemental  point  scat¬ 
terers  randomly  distributed  throughout  the  resolution 
cell,  with  each  elementary  scatterer  position  distributed 
independently  of  the  positions  of  other  scatterers.  The 
random  spatial  distribution  of  the  scatterers  is  described 
by  a  point  process  with  a  set  of  points  having  associated 
complex  marks  Ejy{j  =  1,  •  •  • ,  N)  corresponding  to  the 
backscattered  electric  field  from  the  j-th  scatterer.  Each 
backscattered  electric  field  component  Ej  has  a  constant 
amplitude  Aj  equal  to  the  size  or  reflectance  strength  of 
the  j-th  scatterer  and  a  random  phase  <f>j  uniformly  dis¬ 
tributed  over  the  interval  [0,27r).  We  assume  that  the 
number  of  scattering  points  within  a  resolution  cell  is 
Poisson  distributed  with  a  rate  AA,  where  A  is  the  area 
of  a  resolution  cell. 

When  electromagnetic  waves  are  scattered  from  such  a 
surface,  the  resultant  scattered  field  is  the  superposition 
of  the  electric  fields  scattered  by  each  of  the  elemental 
scatterers.  The  resulting  process  is  a  marked  Poisson 
point  process  and  A  is  the  intensity  of  this  process  over  a 
resolution  cell  with  area  A.  Wlien  A(-)  is  a  random  pro¬ 
cess,  the  point  process  is  referred  to  as  doubly  stochastic 
marked  Poisson  point  process  [3|. 

SAR  systems  record  intensity  measurements  of  the 
mapped  surface  according  to: 


Sn 


N 


(1) 


Speckle  under  this  model  is  refered  to  as  partially  devel¬ 
oped. 

In  the  multilook  model,  L-independent  diversity  mea¬ 
surements  are  taken  over  the  resolution  cell  by  the  radar. 
This  technique  involves  the  noncoherent  sum  of  L  sta¬ 
tistically  independent  single  realizations  of  the  intensity 
measurements  Sm  (i  =  1,2, ...,L)  in  Eq.  (1)  at  each 
resolution  ceil: 

L 

Tnl  = 
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(2) 


In  a  separate  paper  [2],  we  present  a  complete  deriva^ 
tion  for  the  intensity  probability  density  function  condi¬ 
tioned  on  N  for  the  single  look  and  multilook  models. 

2.2  Reflectance  Amplitude  Models 

Since  the  surfaces  mapped  by  SAR  systems  have  vary¬ 
ing  roughness  characteristics  with  respect  to  the  wave¬ 
length  of  the  illuminating  radiation,  we  expect  the  am¬ 
plitude  (strength)  of  the  scatterers  to  be  governed  by  dif¬ 
ferent  models  for  diff'erent  types  of  surfaces.  We  consider 
two  models:  (1)  the  reflectances  of  the  elementary  scat¬ 
terers  are  constant  (model  discussed  in  the  previous  sec¬ 
tion),  and  (2)  the  reflectances  are  Rayleigh  distributed. 

In  the  case  of  Rayleigh  distributed  amplitudes,  we  con¬ 
sider  the  surface  resolution  cell  as  made  up  of  a  small 
number  of  scattering  centers,  with  each  center  made  up  of 
a  large  number  of  elementary  scattering  points  (Rayleigh 
center). 


We  note  that  £^“^(-)  are  the  generalized  Laguerre  poly¬ 
nomials,  K{  )  is  the  complete  elliptic  function,  and  the 
coefiBcients  Cm  are  tabulated  in  a  separate  paper  [2]. 

The  EM  algorithm  was  numerically  implemented  for 
various  values  of  the  parameter  A  (assumed  known).  The 
results  after  k  iterations  are  shown  in  Figs.  1  and  2  for 
different  numbers  of  looks.  In  all  cases,  the  sequence  of 
estimates  A^^J  converges  towards  the  maximum  likelihood 
estimate.  We  assumed  that  the  area  of  a  resolution  cell 
is  625  m^(25  m  x  25  m),  which  is  a  typical  size  for  the 
resolution  cell  of  Seasat  SAR. 

3.1.1  Performance  of  the  ML  estimator 
The  performance  of  the  estimator  is  characterized  by 
the  bias  6(A)  and  by  comparing  the  variance  of  the  error 
with  the  Cramer- Rao  lower  bound  (CRLB)  defined  as  [3]: 


3.  ESTIMATION  TECHNIQUES 

3.1  Constant  Amplitude  Model 

3.1.1  Maximum  Likelihood  Estimation 

The  objective  is  to  estimate  the  average  reflectivity 
of  SAR  scattering  surfaces,  or  equivalently,  the  rate  XA 
of  the  Poisson  process  from  the  intensity  measurements 
given  by  Eq.  (1).  This  estimation  problem  is  rather  diffi¬ 
cult  since  the  parameter  XA  to  be  estimated  is  implicitly 
imbedded  in  the  reflected  signal  intensity  as  the  rate  XA 
of  the  marked  Poisson  process. 

Due  to  the  mathematical  complexity  of  the  likelihood 
function  for  both  single  and  multilook  cases,  we  formu¬ 
late  an  expectation-maximization  (EM)  algorithm  [3]  to 
produce  a  recursive  ML  estimate  of  A. 

We  show  that  the  EM  algorithm  is  implemented  re¬ 
cursively,  starting  with  an  admissible  initial  estimate 
A^^i  >  0,  according  to: 

Al*+»l=  tur  (3) 


p'(t)  =  exp(— Aj4)(^^^j  .ff  (^n/^  •^io.4)u(4.»i 


+— (i-1 

r(L)' 


(AA)"  {  t\.  , 


nssnt'f  1 


where  Jl(A)  is  the  Fisher  information  matrix. 

The  expected  value  of  the  estimator  E  ^Aj  can  be  esti¬ 
mated  using  Monte  Carlo  simulation  of  the  speckle  inten¬ 
sity  measurements.  The  numerical  results  are  illustrated 
in  Fig.  3  for  1  and  2  looks.  We  note  that  bias  is  reduced 
as  the  number  of  looks  is  increased.  This  is  a  direct  result 
from  the  property  that  the  maximum  likelihood  estima¬ 
tor  is  a  consistent  estimator  [4]. 

We  can  estimate  the  variance  of  the  estimator  and  the 
Cramer-Rao  lower  bound  using  Monte  Carlo  simulation 
of  intensity  measurements.  Figure  4  provides  a  compari¬ 
son  between  the  estimated  variance  of  the  error  and  the 
estimated  CRLB  as  a  function  of  the  rate  A  A,  for  vari¬ 
ous  number  of  looks.  For  a  given  number  of  looks,  the 
variance  of  the  error  and  the  CRLB  increase  as  the  rate 
gets  larger.  This  follows  from  the  fact  that  the  rate  and 
the  variance  of  the  Poisson  process  are  equal.  As  the 
rate  increases,  so  does  the  variance  of  the  Poisson  pro¬ 
cess,  causing  a  lower  performance  of  the  estimator.  As 
the  number  of  looks  is  increased,  the  variance  of  the  error 
and  the  CRLB  decrease,  and  in  addition,  the  variance  of 
the  error  gets  closer  to  the  CRLB.  The  fact  that  the  per¬ 
formance  of  the  estimator  improves  with  a  larger  number 
of  looks  is  also  a  direct  result  of  the  consistency  property 
of  the  ML  estimator. 

Overall,  it  is  evident  from  these  graphs  that  the  per¬ 
formance  of  the  ML  estimator  is  very  good.  For  rates 
lower  then  20,  the  variance  of  the  error  is  very  close  to 
the  CRLB  when  2  or  more  looks  are  used.  For  larger 
rates,  only  4  looks  are  needed  to  bring  the  variance  of 
the  error  much  closer  to  the  CRLB. 


and 


3.2  Rayleigh  Amplitude  Model 


3.2.1  Maximum  Likelihood  Estimation 
We  derive  the  following  recursive  algorithm  for  the  es¬ 
timate: 


n=l 
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The  EM  algorithm  was  numerically  implemented  for 
various  values  of  A  (assumed  known),  starting  with  ad¬ 
missible  initial  estimates  XW  >  o.  The  results  after  k  it¬ 
erations  are  shown  in  Fig.  5  for  4  looks.  We  chose  L  =  4 
because  this  is  the  number  of  looks  that  is  usually  used 
for  SAR  systems  in  real-life  applications. 

3.2.2  Performance  of  the  Maximum  Likelihood  Estimator 

Using  Monte  Carlo  simulation  of  the  intensity  mea¬ 
surements,  we  estimate  the  bias,  error  variance,  and  the 
CRLB.  The  results  are  illustrated  in  Figs.  6  and  7.  Fig¬ 
ure  6  shows  that  the  bias  is  reduced  as  the  number  of 
looks  increases.  This  follows  from  the  fact  that  the  ML 
estimator  is  consistent.  We  also  note  that  as  the  rate  in¬ 
creases,  so  does  the  variance  of  the  Poisson  process.  This 
is  evident  in  Fig.  6  since  the  statistical  fluctuation  in  the 
estimate  E{X)  gets  larger  as  the  rate  XA  is  increased. 

A  comparison  between  the  estimated  error  variance 
and  the  estimated  CRLB  is  provided  in  Figure  7  for  dif¬ 
ferent  numbers  of  looks.  For  a  fixed  number  of  looks, 
the  variance  of  the  error  and  the  CRLB  increase  as  the 
rate  gets  larger.  As  the  number  of  looks  increases,  the 
variance  of  the  error  and  the  CRLB  decrease,  and  the 
variance  of  the  error  gets  closer  to  the  CRLB.  Again, 
this  is  expected  because  the  ML  estimator  is  consistent. 

For  large  number  of  looks  (L  =  4),  the  error  variance 
and  the  CRLB  get  very  close  and  the  ML  estimator  be¬ 
comes  optimal  in  the  sense  that  no  other  conceivable  es¬ 
timator  having  the  same  bias  with  the  same  data  can 
perform  better. 

4.  DISCUSSION 

In  this  section,  we  provide  a  comparison  between  the 
performance  of  the  ML  estimators  in  the  constant  ampli¬ 
tude  and  Rayleigh  models. 

The  performance  of  the  ML  estimator  over  regions 
where  the  rate  XA  is  not  sufficiently  large  (<  20)  is  bet¬ 
ter  in  the  constant  amplitude  model  than  in  the  Rayleigh 
amplitude  model.  This  can  be  seen  by  comparing  how 
close  the  error  variance  is  to  the  CRLB  in  Fig.  4  versus 
Fig.  7,  especially  for  the  cases  when  L  =  2  and  4. 

Hence,  the  difference  between  the  performances  of  the 
ML  estimators  for  both  models  is  notable  when  the  rate 
XA  is  small.  On  the  other  hand,  when  the  rate  is  suffi¬ 
ciently  large  (>  25),  the  performances  of  the  estimators 
are  quite  comparable.  This  is  justified  by  the  fact  that 
for  large  rates,  the  likelihood  function  in  the  constant  am¬ 
plitude  model  is  very  sunilar  to  the  one  in  the  Rayleigh 
model. 

Finally,  we  mention  that  in  the  limit  when  the  number 
of  scatterers  N  gets  very  large  (fully  developed  speckle), 
it  follows  from  the  the  centraJ  limit  theorem  that  the 
likelihood  function  of  the  multilook  intensity  T  obeys  a 
Gamma  law  with  parameters  (L,AA).  The  statistic  t  is 
thus  a  complete  sufficient  statistic  for  the  average  reflec¬ 
tivity  (or  mean  intensity)  A  A,  and  A  =  t/{AL)  is  the 
minimum  variance  unbiased  estimator  (MVUE)  and  the 
maximum  likelihood  estimate  of  the  parameter  A. 

5,  CONCLUSIONS  AND  SIGNIFICANCE 

In  this  paper,  we  presented  estimation  algorithms  to 
determine  the  average  surface  reflectivity  and  scatterer 
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Figure  1:  Sequences  of  EM  estimates  for  a  single  look 
and  different  rates  XA.  (Constant  amplitude  model) 

density  (average  number  of  scatterers  within  a  resolution 
cell)  using  intensity  measurements  of  speckled  images. 
The  estimation  schemes  were  based  on  a  marked  Poisson 
point  process  model  for  partially  developed  speckle  and 
were  applied  to  the  particular  model  of  a  constant  mean 
reflectivity  within  the  resolution  cell. 

The  structure  and  performance  of  these  estimators  dif¬ 
fer  significantly  from  that  of  estimation  in  multiplicative 
and  additive  noise  and  are  also  a  function  of  the  par¬ 
ticular  point  process  peurameters  used  to  model  partially 
developed  speckle. 

We  showed  that  the  performance  of  the  estimators  im¬ 
proved  as  the  number  of  looks  was  increased.  For  2  looks 
and  rates  XA  less  than  20,  the  maximum  likelihood  esti¬ 
mator  was  optimal  in  the  sense  that  the  variance  of  the 
error  is  the  smallest  possible  using  any  other  conceivable 
estimate  having  the  same  bias  with  the  same  data.  The 
maximum  likelihood  becomes  optimal  for  all  rates  XA  as 
the  number  of  looks  is  increased  to  4  or  more  looks. 

The  techniques  we  developed  to  estimate  the  scatterer 
density  serve  as  a  powerful  tool  in  SAR  image  classifi¬ 
cation,  SAR  image  reconstruction,  in  addition  to  biolog¬ 
ical  tissue  classification  and  tomographic  reconstruction 
in  biomedical  ultrasound  imaging  systems. 
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Figure  2:  Sequences  of  EM  estimates  for  a  fixed  rate  XA 
and  different  looks.  (Const2uit  amplitude  model) 


Figure  3:  Estimated  bias  of  the  ML  estimator  as  a  func¬ 
tion  of  the  rate  XA  for  1  and  2  looks.  (Constant  ampli¬ 
tude  model) 


XA 

c 

Figure  4:  Estimated  variance  of  the  error  of  the  ML  es¬ 
timator  compared  to  the  estimated  CRLB  as  a  function 
of  the  rate  XA,  (Constant  amplitude  model) 


k(Itaitiai  number) 


Figure  5:  Sequences  of  EM  estimates  for  4  looks  and 
different  rates  XA.  (Rayleigh  amplitude  model) 


Figure  6:  Estimated  bias  of  the  ML  estimator  as  a  func¬ 
tion  of  the  rate  XA  for  1  and  2  looks,  using  3000  runs. 
(Rayleigh  amplitude  model) 


XA 

Figure  7:  Estimated  error  variance  of  the  ML  estimator 
(using  10000  runs)  compared  to  the  estimated  CRLB  as 
a  function  of  the  rate  XA.  (Rayleigh  amplitude  model) 
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ABSTRACT 

In  this  paper,  a  novel  direct  algorithm  to  estimate 
parameters  of  iiactal  interpolation  hinctions  is 
proposed,  and  test  results  of  its  robustness  and  its 
performance  in  signal  compression  are  reported. 

IFS  fractal  interpolation  function  (FTF)  is 
becoming  an  increasingly  appealing  class  of  models 
of  such  signals  as  the  height  distribution  of  sea  floors, 
seismograph,  and  electrocardiograph  signals,  due  to 
its  inherent  advantages. 

Existing  FBF-based  signal  compression  algorithms 
usually  use  a  UF  parameter  estimation  formula 
proposed  by  Mazel  and  Hayes  ([1]),  which  is  based  on 
least  square-error  fitting  techniques  and  needs  to 
calculate  the  derivatives  of  such  a  error  measure  with 
respect  to  FIF  parameters.  It  is  very  inconvenient  to 
introduce  many  other  useful  error  measures  in  real 
signal  processing  applications,  such  as  Kullback 
entropy  or  HausdorfiF  distance,  for  they  may  endanger 
the  computability  of  the  derivatives. 

In  this  paper,  a  computationally  efficient,  direct 
algorithm  to  solve  the  inverse  problem  of  IFS 
interpolation  signals  is  proposed.  It  can  solve  FIF 
parameters  from  its  samples  without  calculating  error 
function  s  derivatives.  The  algorithm’s  robustness  in 
parameter  estimation  and  usefulness  in  signal 
compression  are  shown  with  experimental  results. 

1.  IFS  Fractal  Interpolation 

Interpolation  in  pure  mathematical  sense  is  the 
problem  of  finding,  from  a  given  initial  data  set 

<jf,,/  =  l,2,— ,,V}  (1-1) 

a  continuous  function  f[x)  that  satisfies, 

/(^,)  =  ^,/  =  0,1.2,-,. V.  (1-2) 


There  are  many  ways  to  find  the  f[x)  in  (1-2), 
for  a  A-th  order  IFS  fractal  interpolation  function 
(UF),  f{x)  is  the  y-direction  projection  of  a  2-D  N- 

map  hyperbolic  IFS’  attractor  G.  The  maps  of  the  IFS 
are  affines  of  the  form  : 


satisfi’ing 


w 


\fi  =  \X—,N 

From  (1-1)  to  (1-4).  one  can  find 


«.  =  - 


X,  -X 


c,  =  • 


X  -  X, 

F  -  F 


F  -  F 
■d,  ^  ^0 


-Xo 

XwX,._|  —  XqX^ 
Xv  -x„ 


(1-4) 


(1-5) 


/,  =  -XqL  _  j 


where  (t/,;/ -  1.2,-  -,/Vj  is  a  set  of  free  parameters 
that  controls  the  bo.x-counting  dimension  of  G.  To 
construct  a  hjperbohc  IFS  and  have  stable  iteration, 
it  is  required  that  ||r/,|  <  1;/  =  1,2, A'j . 


2.  A  Simple  Inverse  Problem  Solver 
In  the  discussions  below,  we  suppose  we  are  given 
a  1-D  signal  /(x).  r  j ,  whose  image 

^  ~  {(^’Ax))-x  e[xo,Xj,,]|  is  the  attractor  of  a 
hyperbolic  IFS  in  the  form  of  (1-.3).  From  (1-4)  and 


’  A  research 


supported  by  the  National  Natural  Science  Foundation  of  China. 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


187 


the  afiSnity  of  loaps,  we  know  that,  when  x  changes 
from  xo  to  x^ ,  the  y-componcnt  of  w,  maps  /(x), 
xe[x<,,x^]  to  the  section  between  [xj_,,X;],  so  we 
have 

/(fl,X  +  Cj)  =  C‘,x  +  </-/(x)  +  /; , 

Vxe[xo,x^]  (2-1) 

On  the  other  hand,  because  the  position  of  the 
first  interpolation  point  is  known,  we  can  see 
from  (1-5)  that  parameters  depend  only 

on  and  x, .  If  we  can  find  a  relationship  between 
</,  and  X, ,  then  by  using  (1-4),  (1-5)  and  testing  if 
equation  (2-1)  holds,  we  will  be  able  to  solve  Xj  and 
then  a,,c,  ,e, .  Similarly,  once  x^.,  is  estimated, 
a,,c,,e,,/j  depends  only  on  d,  and  x, ,  by  using  a 
relationship  of  d,  and  x,  and  (1-4),  (1-5),  (2-1),  we 
will  also  be  able  to  solve  x,  and  . 

To  derive  a  relationship  between  d,  and  x, , 
consider  the  integration 

[  j\aiX  +  e,)d{a,x  +  e) 

=  r  "  (c,  X  +  d,  f{x)  +  /( )d{a,  X  +  e, ) 

Jx=Xo  ' 


=  P  f{x')dx'  (x'=fl,x  +  e,) 
so  we  have 

P  f(x)dx  =  a.c^  f xdx+a,d,  P'' /(x)^& 

Jxj_i  '  -^^0  ''''0 


-^a 


,xr 


dx 


From  (1-5)  and  {2-3) 

cortl  +  cow4  • /cl  +  cow3  •  A:2 

a,  - 


l+k2-k3-^khk4 


where 
coni  = 

con3  - 

k\  =  - 


P  f{x)dx 
a,rf{x)dx' 


con4  = 


P''  f{x)dx 

dxn 


^2  ^2 


k2  =  -- 


k3  =  - 


•'^0 


2p'' 


A:4  = 


(2-2) 


(2-3) 

(2-4) 


(2-5) 


A  complete  algorithm  to  solve  the  parameters 
from  a  1-D  FIF  signal  is  as  follows; 


1.  Set  a  step  s,  for  searching,  let  i=l; 

2.  Choose  X,  =x,_, +s,  as  the  i-th  interpolation 
point; 

3.  Utilizing  (2-4),  (2-5)  to  compute  the  estimation 
of  d,  associated  with  the  current  x, .  If  |t/,|  s  1 , 
go  back  to  step  2;  else  estimate  a, ,  c, ,  e, ,  /,  with 
(1-5); 

4.  Compute  and  temporarily  store  an  error 
function  £(x, )  =  d{A,  ,  B,  ) ,  where 

A,  =(  f{x)'  ^  e  [xi_, ,  X,  ]| ,  B,  =  w,  (/) ,  and 

£)(  v)  is  a  distance  measure; 

5.  If  X,  <x„  -s,  ,  let  X,  =  X,  +5,  and  go  back  to 
step  3;  otherwise  set  the  estimates  of  x.  and  d, 
{x,,d,)  as  the  pair  that  >ield  the  minimum 
value  of  £(x, )  in  step  4; 

6.  If  Xi  <Xfi  -  S2.  let  /  =  /  + 1,  go  back  to  step  2, 

otherwise,  let  A'  =  /' .  xn  =  x,.  and  temunate 
the  algorithm. 

In  the  above  algorithm,  5,  is  a  searching  step 
which  can  be  set  to  1  for  time  discrete  signals,  is 
used  to  control  the  quality  of  signal  compression,  it 
can  be  determined  from  compression  ratio 
requirements  or  o  pnot  knowledge  of  the  original 
signal  ([3]). 

The  above  algorithm  does  not  need  to  calculate 
error  function  deviates  in  the  estimation  of  any  FIF 
parameters,  as  a  result,  there  is  virtually  no  limitation 
on  the  choice  of  error  function. 

The  computational  complexity  of  our  algorithm 
is  very  small.  As  a  matter  of  fact,  it  took  a 
486DX2/66  only  a  little  over  10  seconds  to  solve  a 
1025  point  FIF  signal  in  our  experiment. 

3.  Simulations  and  Conclusions 

We  present  three  examples  of  the  applications  of 
our  algorithm.  We  use  the  sum  of  signal  sample 
values  to  approximate  integration  and  mean-square 
error  to  judge  approximation. 

In  the  first  scenario,  our  algorithm  is  used  to 
estimate  the  parameters  of  the  4th  order  FIF  in  Fig.l. 
The  algorithm  finds  all  the  correct  interpolation  point 
positions,  the  estimated  ’s  are  also  very  close  to 
the  original  (Table.!).  The  reconstructed  FIF  is 
virtually  indistinguishable  from  the  original. 

In  the  second  scenario,  our  algorithm  is  used  to 
estimate  FIF  components  in  the  signals  of  Fig.2-1 
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and  Fig.3-1.  Fig.2-1  is  constructed  by  adding  a  zero- 
mean  white  additive  noise  with  uniform  amplitude 
distribution  to  the  FIF  in  Fig.l,  while  Fig.3-1  is  the 
sum  of  Fig.  i  and  a  period  25  samples  sine  wave.  The 
SNRs  for  Fig.2-1  and  Fig.3-1  are  0.06dB  and  0.02dB 
respectively.  The  FTFs  our  algorithm  reconstructs  are 
plotted  in  Fig.2-2  and  Fig,3-2,  whose  parameters  are 
listed  in  Tables  2  and  3.  The  SNRs  for  reconstructed 
FIFs  with  respect  to  the  original  are  2.9dB  and  3.5dB 
respectively. 

Finally,  we  present  the  result  of  FIF/DFT  hj'brid 
compression  of  a  short  segment  of  real-world  music 
signal  /(n)  (Fig. 4-1).  We  first  approximate  it  with  a 

16lh  order  FIF  /,{n),  then  the  difference  between 
f{n)  and  f,{n)  is  compressed  with  DFT  transform 

coding.  Preserving  291  DFT  coefficients 
(corresponding  to  the  DC  component  and  145  AC 
components)  that  are  “furthest”  away  from  the  mean 
of  all  DFT  coefficients  and  setting  the  rest  of 
coefficients  to  the  mean,  we  get  signal/,(w). 

Reconstructed  signal  is  the  sum  of  f\n)  and  f\n) . 

Differential  coding  can  compress  DFT  coeffi¬ 
cient  index  and  interpolation  point  position 
information  to  4bits/index  and  6bits/point 
respectively.  Each  d.  is  quantized  to  1  Obits,  and  the 

real  and  imaginative  parts  of  DFT  coefficients, 
sample  values  at  interpolation  points  are  saved  with 
16bits  each.  Because  the  original  signal  is.  a  1025- 
point  16bits/sample  sequence.  the  overall 
compression  ratio  is  a  little  over  3:1.  and  SNR  is 
about  28.7dB.  Using  more  complex  quantization 
schemes  and  entropy  coding  techniques  can  achieve 
about  5:1  compression  with  approximatelv  the  same 
SNR. 

In  conclusion,  we  broaden  the  range  of  FIF- 
based  signal  modeling/compressioii  by  proposing  an 
algorithm  to  solve  the  parameters  of  a  FIF  from  its 
samples  which  relaxes  the  requirement  to  error 
metric  derivability.  Our  algorithm  can  also  be  used  to 
estimate  fractal  components  in  signals.  It  is  able  to 
reconstruct  a  fairly  good  “outlook”  of  the  embedded 
FIF,  even  at  ver>^  low  SNRs.  Though  the  algorithm 
realization  is  simple  and  our  studies  preliminary,  the 
performance  of  the  algorithm  is  encouraging. 
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x.  (x. ) 

d.  /  d. 

0 

0 

-3.000 

/ 

1 

256 

-9.000 

0.800/0.797 

2 

512 

-5.000 

0.800/0.819 

3 

768 

-10.000 

0.800  /  0.784 

nz 

1024 

-24.000 

0.800/0.790  1 

Table.  1  Original  and  Estimated  FIF 
Parameters  of  Fig.l 
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-1.893 

/ 
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256 

-6.296 

0.840 

2 

512 

-1.366 

0.847 

3 

748 

1.7601 

0.774 

4 

1024 

-33.998 

0.785 

Table.2  Estimated  FIF  Parameters  From  Fig.2-1 
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0 

-0.762 

/ 

1 

256 

-0.159 

0.394 

2 

511 

1.276 

0.209 

3 

762 

-0.357 

4 

1024 

0.424  1 

Table.3  Estimated  FIF  Parameters  From  Fig.3-1 
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1.  Introduction 

As  well  known  the  K-distributed  clutter  can  be  described  as 
the  product  model  of  a  texture  variable,  representing  the 
properties  of  the  observed  surface,  times  an  exponential 
component,  usually  named  speckle,  representing  the  effect 
of  the  coherent  radar  sensor.  The  performance  of  usual 
CFAR  schemes  for  the  detection  of  radar  targets  embedded 
in  this  clutter  depends  strictly  on  the  algorithm  used  to 
select  the  detection  threshold.  In  particular  Watts  pointed 
out  in  [1]  that  the  exact  knowledge  of  the  texture  value  at 
each  instant  allows  an  ideal  adaptive  threshold  to  be  set 
only  on  the  speckle  component.  This  selection,  named 
'ideal  CFAR',  is  shown  to  widely  outperform  the  fixed 
threshold  detection  scheme,  especially  for  spiky  clutter. 
The  performance  of  the  ideal  CFAR  defines  an  upper 
bound  to  the  detection  probability  in  K-distributed  clutter, 
but  it  is  purely  theoretical  since  the  knowledge  of  the 
texture  variable  is  not  available  in  practice.  In  the  present 
paper  we  derive  two  estimators  of  the  local  texture  value  on 
the  basis  of  the  echoes  received  from  the  N  range  cells 
closest  to  the  cell  under  test  and  use  the  estimated  texture 
value,  instead  of  the  exact  one,  to  set  the  detection 
threshold.  The  performance  is  clearly  highly  dependent  on 
the  range  correlation,  which  determines  the  texture 
estimation  accuracy,  and  its  analysis  allows  us  to  assess 
how  far  is  the  'ideal  CFAR'  from  the  practical 
implementation  in  the  different  conditions.  The  two 
estimators  distinguish  for  using  or  not  some  prior 
information  about  the  texture  distribution.  An  evaluation  of 
its  importance  can  therefore  be  obtained  from  the 
comparison  of  their  performances. 

2.  Correlated  clutter  model 


the  exponentially  distributed  variable  z  (with  mean  value 
1),  usually  named  'speckle',  times  the  variable  t,  referred  to 
as  'texture'.  The  texture  variable,  which  modulates  the 
variance  of  the  speckle  contribution,  is  a  characteristic  of 
the  observed  scene  and  has  a  gamma  probability  density 
function  (pdf) 


(1) 


Px('C)  = 


r(v) 


-T 


X  e  ^ 


x>0. 


Its  mean  value  \i  is  the  mean  reflected  power  and  the  order 
parameter  v,  which  controls  its  variance,  is  a  measure  of 
the  inhomogeneity  of  the  scattering  from  the  scene. 


To  derive  the  optimum  estimation  structures,  we  describe 
the  model  for  the  vector  x  =  ‘  L=N/2), 

of  the  echoes  n  =  -  L,  •  •  • ,  L  received  from 

N+1  adjacent  range  cells.  With  reference  to  a  maritime 
search  radar  we  can  assume  that  the  texture  contributions 
from  successive  range  cells  show  some  partial  correlation, 
depending  on  the  sea  swell  configuration.  On  the  contrary, 
at  least  as  a  first  approximation,  the  speckle  components  in 
in  adjacent  range  cells  can  be  considered  decorrelated.  In 
the  presence  of  target  the  central  component  has  a  Rice 
pdf  with  gain  factor  1  and  noncentral  parameter  5  equal  to 
the  target  signal  power.  Thus  the  multivariate  pdf  of  X  can 
be  written  as 


(2) 


Px  (2^)  =  J  •  •  •  J /’x/T  /  I)  •  Px  (I) 


0  0 

n=-L; 


^  e-'  /o 

to 

A 

L  V 

0 

1 _ 

dx 


As  well  known  the  K-distribution  [2]-'[3]  is  an  adequate 
statistical  description  for  sea  clutter  echoes  received  from  a 
low  grazing  angle.  According  to  this  description  the 
square-law  detected  intensity  of  the  single  received  echo, 
x  =  X'Z,  can  be  interpreted  as  the  product  model  [2]  of 


where  the  correlation  structure  of  the  texture  component  is 
implicit  in  the  multivariate  pdf  /?^(x)  of  the  vector 

X  =  [x_^,'**,T^]^,  representing  the  N+1  partially 
correlated  texture  variables.  To  simplify  the  analysis  we 
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assume  a  first  order  Markov  structure  for  the  texture 
component,  which  allows  us  to  use  the  factorization 

(3) 

Pt(I)  “  Pz2l'ix  *  *  'Px2lXx  ‘ 

with  conditional  probability 


texture  is  highly  correlated.  According  with  this 
consideration,  a  first  estimator  will  be  obtained  assuming 
it  completely  correlated:  and  deriving  the 

Maximum  Likelihood  (ML)  estimate  of  x.  Namely  we 
maximize  the  conditional  pdf  of  the  vector  x ,  given  x: 

(5)  =  exp  (-1 

L, n^O 


(4) 


V  1  ( 


v-1 

1  2  - 

e 


n  1-p^  J 

^v-1 


where  p  is  the  one-lag  correlation  coefficient.  For  half¬ 
integer  Oder  parameters,  the  gamma  correlated  sequence 
with  conditional  pdf  defined  by  eq.  (4)  can  be  derived  as 
the  sum  of  the  squares  of  2v  gaussian  exponentially 
correlated  variables  with  one-lag  correlation  coefficient 

-y/p  and  is  itself  exponentially  correlated  [4]. 


In  the  detection  it  is  of  high  interest  to  assure  the 
maximum  probability  of  detection  (Pd),  maintaining  the 
constant  false  alarm  rate  (CFAR)  condition.  Assuming  an 
ideal  knowledge  of  the  texture  in  each  cell,  this  is  obtained 
by  the  ideal  CFAR,  by  setting  the  detection  threshold  on 
the  speckle  component  only.  In  practice  this  knowledge  is 
not  available  and  we  replace  the  true  value  with  its 
estimate,  based  on  the  N  closest  range  cell.  The  work  is 
therefore  highly  based  on  the  joint  pdf  model  of  the  N+1 
received  echoes  in  eq.  (2),  which  will  be  exploited  to  derive 
optimum  texture  estimators.  The  performances  obtained 
using  the  different  estimators  will  be  evaluated,  showing 
their  strong  dependence  on  the  correlation  coefficient  p. 


3.  Texture  estimators 


Using  the  pdf  model  in  eq.  (2)-(4)  we  can  tackle  the 
derivation  of  the  texture  estimators.  We  will  derive  two 
different  estimators  operating  respectively  in  the  domain  of 
intensity  and  in  the  domain  of  the  logarithm  of  intensity. 
The  former  is  the  more  natural  domain  and  will  be 
considered  first.  The  latter  domain  shows  a  series  of 
desirable  properties  as  much  lower  estimation  errors  for  the 
parameters  of  the  model  [4]-[5].  Moreover  it  appears  to  be 
especially  suited  for  variables  with  product  model 
distribution,  where  the  logarithmic  transformation  converts 
the  product  into  a  sum  of  variables.  Hie  estimators,  derived 
below,  make  use  of  the  N  samples  around  the 

central  sample  Xq,  which  is  the  cell  under  test. 

Since  the  estimation  of  the  texture  in  the  central  range  cell 
is  obtained  using  the  informations  in  the  neighbor  range 
cells  only,  sensible  results  can  be  achieved  only  if  the 


Nulling  the  derivative  of  the  logarithm  of  eq.  (5)  with 

L 

respect  to  x  we  get  T  =  ™  ^  .  This  is  coincident 

n--L,  n?tO 

with  the  usual  cell-average  (CA)  CFAR  scheme,  which 
shows  to  be  optimum  for  the  estimation  of  the  local  texture 
variable,  assuming  a  complete  correlation.  Even  though  the 
CA  scheme  has  been  derived  for  this  specific  case,  it  can  be 
applied  to  every  correlation  condition.  A  loss  in  detection 
performance  is  expected,  which  grows  as  p  decreases. 

To  obtain  an  accurate  estimator  when  the  texture  is  par¬ 
tially  correlated,  the  prior  pdf  of  the  texture  sequence  must 
be  properly  represented  in  the  model  used  for  the  optimiza¬ 
tion.  This  leads  to  complex  equations  in  the  domain  of 
intensity.  An  approximate  solution  can  be  found  in  the 
logarithmic  domain,  where  the  product  model  is  mapped  in 
a  sum:  =  In  =  In  +  In A  simpli¬ 

fied  texture  estimator  is  obtained  by  approximating  the  pdf 
of  the  two  logarithmic  transformed  variables  by  two 
gaussian  distributions  with  the  same  first  and  second  order 
statistics  of  the  true  distributions.  Namely  for  w  =  In  z  we 
have  a  mean  value  =  — y  (7  is  the  Euler  constant)  and 

an  identity  covariance  matrix  I,  where 

Gy  =71^/6.  f  =  lnx  has  a  mean  value 
m,  =(lnT)=\|r(v)-ln(v/|J,),  where  \|l(v)is  the 
digamma  function  and  the  element  (n,k)  of  its  covariance 
matrix  is  ^In  In  ^  —  ^In  x)^ ,  derived  analytically 
in  [4].  Thus  the  approximate  joint  distribution  of  the 
logarithms  of  texture  £  =  » •  *  •  ?  Lp  ‘ ‘ 

received  data  vector  = 

with  a  zero  replacing  the  echo  from  the  cell  under  test,  can 
be  written  as 

(6)  (W’  0 = Pw/t  /  t)p,  it)  = 


1  T 

I 
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where  =  [?_£,••  s  ?_p  0,  ]  is  obtained  from 

nulling  its  central  component.  The  vector  1  has  all  the 
components  equal  to  1,  except  for  the  central  one,  which  is 
set  to  0,  and  the  vector  w,  has  all  the  components  equal  to 

m, .  A  Maximum  a  Posteriori  (MAP)  texture  estimate, 

called  LMAP,  is  found  by  nulling  the  N+1  derivatives  of 
the  logarithm  of  eq.  (6)  with  respect  to  the  components  of 
I 
(7) 

The  system  in  eq.  (7)  can  be  solved  as 

i  =  U'+  R,  ‘)”'w+(/'+  -m„l)  , 

where  I'  is  obtained  from  the  (N+l)-dimensional  identity 
matrix,  by  nulling  the  central  component.  Thus  in  the 
domain  of  the  logarithm  we  obtain  a  linear  estimator.  The 
estimated  texture  intensity  for  the  central  range  cell  can 

therefore  be  expressed  as  Tq  =  exp(fQ)  . 

It  is  instructive  to  observe  that  CA  doesn’t  require  the 
knowledge  of  the  covariance  matrix  of  texture  and  is  easily 
implemented  in  all  correlation  conditions,  even  though  it  is 
optimum  only  for  completely  correlated  texture.  On  the 
contrary,  LMAP  uses  the  informations  about  the  prior 
distribution  of  the  texture  to  get  a  better  estimation 
accuracy.  It  requires  for  practical  application  a  map  of 
correlation,  or  its  adaptive  estimation.  Next  Section  will  be 
devoted  to  the  performance  analysis  of  the  proposed 
estimators,  with  special  attention  to  the  evaluation  of  the 
improvement  obtained  by  using  the  prior  information. 

4.  Detection  performances 

The  analytical  performance  analysis  can  be  tackled  only  in 
the  completely  correlated  condition  and  even  in  that  condi¬ 
tion  there  are  no  closed  form  expressions  [1].  Therefore  we 
will  resort  to  simulations  to  assess  the  detection  perform¬ 
ance  of  the  different  schemes.  We  generate  the  N+1  clutter 
samples  with  correlated  texture  and  independent  speckle 
components,  according  to  the  model  of  Section  2.  A  set  of 
signals  with  different  amplitudes  are  added  coherently  to 
the  central  range  cell  to  obtain  the  performance  as  a  func¬ 
tion  of  the  signal-to  clutter-ratio  SCR=5/p..  For  each  of  the 

10®  trials  the  echoes  from  the  N=8  external  range  cells  are 
used  to  estimate  the  local  texture  value,  in  accordance  with 
the  proposed  estimation  schemes,  and  the  central  cell  is 
compared  to  the  detection  threshold  X,  =  G  •  T.  The  prob¬ 


ability  of  false  alarm  is  maintained  constant,  =  10"^, 
by  adequate  selection  of  the  threshold  gain  G. 

The  simulated  results  for  the  CA  are  shown  in  Fig.  1  for  v 
=0.5,  as  functions  of  the  signal-to-clutter  ratio  SCR=6  /  |i,. 
The  performance  of  ideal  CFAR  and  fixed  threshold  are 
also  given  for  comparison.  It  is  instructive  to  observe  the 
decrease  of  the  detection  performance  with  decreasing 
texture  correlation  p.  In  fact,  the  neighbor  cells  are  less 
and  less  representative  of  the  texture  value  in  the  central 
range  cell.  In  particular  for  p=l  the  detection  curve  is  not 
far  from  the  curve  obtained  with  the  'ideal  CFAR'.  In  fact 
in  this  condition  we  have  a  good  estimation  of  the  local 
texture  value.  For  lower  p  values  the  detection  performance 
degrades,  crossing  the  fixed  threshold  curve,  because  the 
estimation  variance  is  much  higher.  Specifically  for  p=0 
the  estimation  cells  don't  contain  any  informations  about 
the  texture  in  the  central  range  cell;  this  causes  the 
detection  curve  to  be  always  lower  than  the  fixed  threshold 
curve.  Fig.  2  and  3  show  a  comparison  of  the  detection 
performances  of  CA  and  LMAP  for  v=0.5  and  p=0.7^0.95 
and  for  v=5  and  p=0. 7-50.95  respectively.  LMAP  detection 
curves  are  always  contained  between  'ideal-CFAR'  and 
fixed  threshold  ones  (fi-om  more  extended  simulation  we 
can  assess  that  this  apply  to  every  value  of  p).  Moreover 
the  LMAP  estimator  is  less  dependent  on  the  correlation 
than  the  CA  and  provides  always  the  best  performance. 

The  improvement  obtained  with  respect  to  the  CA  scheme 
depends  on  the  texture  variance,  namely  on  the  inverse  of  v 
Moreover,  it  appears  evident  that  for  v=5  the  spread  of  the 
detection  performances  with  the  correlation  coefficient  p  is 
much  reduced  with  respect  to  the  condition  of  v=0.5.  This 
can  be  explained  in  terms  of  the  normalized  texture 
variance,  which  is  equal  to  1/v  and  is  reduced  as  v 
increases.  For  a  high  variance  an  accurate  estimator  is 
important.  On  the  contrary,  with  a  small  texture  variance, 
even  though  the  detection  scheme  uses  an  estimate  of  the 
texture  mean  value  instead  of  an  estimate  of  the  local 
texture,  the  detection  performance  is  not  highly  affected. 
The  analysis  of  accurate  texture  estimators  is  therefore  of 
interest  mainly  for  very  spiky  clutter  distributions.  In  this 
condition  we  have  shown  that  the  new  LMAP  estimator 
achieves  better  performance  than  the  more  usual  CA.  This 
shows  the  importance  of  the  use  of  prior  information 
especially  for  the  partially  correlated  case.  From  the 
behaviour  of  the  plots  we  can  also  observe  that  for  high 
correlation  and  high  SCR  the  LMAP  is  close  enough  to  the 
ideal  CFAR.  The  validity  of  the  latter  for  practical 
application  is  therefore  assessed  in  this  condition.  It  is 
evident,  though,  its  purely  theoretical  significance  for  low 
texture  correlations. 
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Fig.  1  -  Pd  as  a  function  of  the  SCR  for  the  CA  scheme 
for  K-distributed  clutter  with  v=0.5,  N=8  range 
cells  and  p=l,  0.95,  0.7,  0  (curves  in  decreasing 
order)  compared  to  'ideal  CFAR',  (O),  and  fixed 
threshold,  (*). 


Fig.  2  -  Comparison  of  the  Pd  as  a  function  of  the  SCR  for 
the  CA  and  LMAP  schemes  for  K-distributed 
clutter  with  v=0.5  and  N=8  range  cells.  p=  0.95: 

CA,  LMAP,  ( _ );  p=0.7,  CA,  ( . ), 

LMAP,  ( - ).  For  comparison  'ideal  CFAR',  (O 

),  and  fixed  threshold,  (*). 


Fig.  3  -  Comparison  of  the  Pd  as  a  function  of  the  SCR  for 
the  CA  and  LMAP  schemes  for  K-distributed 
clutter  with  v=5  and  N=8  range  cells.  p=  0.95: 

CA,  LMAP,  ( _ );  p=0.7,  CA,  ( . ), 

LMAP,  ( - ).  For  comparison  'ideal  CFAR',  (O 

),  and  fixed  threshold,  (*). 
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Abstract ; 


Although  originally  analog, 
SIGMA-DELTA  techniques  have  be¬ 
come  a  DSP  tool.  After  a  review 
of the  basic  principles, a  design 
technique  is  presented  for  the 
bandpass  loop  filter  of  a  high 
performance  modulator  and  stabi¬ 
lity  issues  are  discussed. Three 
applications  linked  to  the  area 
of  communications  are  briefly 
discussed,  a  A/D  converter  for 
mobile  radio,  a  direct  digital 
synthesizer  and  a  clock  synchro¬ 
nization  issue. 


I  -  INTRODUCTION 

SIGMA-DELTA  is  a  feedback  tech¬ 
nique  which  was  introduced  for 
the  digitization  of  telephone 
signals  several  decades  ago  [1] . 
Its  most  successful  field  of  ap¬ 
plication  has  probably  been  the 
realization  of  high  accuracy 
integrated  analog/digital  (A/D) 
converters [2] .Now, it  is  used  in 
many  technical  fields,  but  the 
denomination  of  SIGMA-DELTA  mod¬ 
ulator  has  survived,  from  the 
initial  application.  Following 
the  trend  toward  digital  signal 
processing,  it  has  become  a  DSP 
tool . 

The  concept  is  shown  in  fig.l. 


Fig.l  A  SIGMA-DELTA  modulator 


The  system  output  y(n)  is  fed  back 
and  subtracted  from  the  input  x(n) . 
G(Z)  is  the  feedback  filter  and 
H(Z)  is  called  the  loop  filter. The 
box  Q  represents  a  non-linear  ope¬ 
ration,  which  generally  corresponds 
to  a  quantization  or  rounding  func¬ 
tion.  In  a  simplified  analysis,  it 
can  be  modelled  as  a  source  of  an 
additive  white  noise  b(n)  and  the 
system  operation  is  described,  in 
terms  of  Z-transf orms ,  by 

(1) 

H(Z)  1 

Y(Z)=X(Z) -  +B(Z) - 

1+G(Z)H(Z)  1+G(Z)H(Z) 

The  input  X(Z)and  the  quantization 
noise  B(Z)  are  multiplied  by  two 
different  transfer  functions  and, 
thus, the  spectral  distributions  in 
the  output  of  the  input  signal  and 
the  noise  can  be  controlled. 

The  following  particular  case  is 
worth  pointing  out 

1 

H(Z)=  -  (2) 

1  -  G(Z) 


The  diagram  is  shown  in  fig. 2  and 


Fig. 2.  Noise  shaping  circuit 
the  equations  become 

Y(Z) =X (Z) +B (Z)  [l-G(Z)  ] 
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Only  the  quantization  noise  is 
affected  by  the  system  which 
is  a  noise  shaping  circuit. 

Both  approaches  in  figures  1 
and  2  are  used  in  applications. 

The  performance  of  the  system 
is  critically  dependent  on  the 
filters . 


II -FILTER  DESIGN- STABILITY 


inside  the  unit  circle . Then, the  ma¬ 
gnitude  of  the  oscillation  is  H(-l) . 
In  order  to  reject  the  noise  out  of 
the  passband,the  gain  of  the  filter 
H(Z)  must  be  large  and  the  poles 
must  be  located  in  the  passband  and 
close  to  the  unit  circle. An  effici¬ 
ent  approach  for  the  filter  design 
consists  in  using  inverse  notch 
sections  defined  by 

(6) 

l-2(l-a)cose  Z-1  +(l-a)2  z-2 
HI  (z)  - - — 


In  some  major  applications, 
like  A/D  conversion,  the  feed¬ 
back  filter  is  often  just  a 

one  sample  delay, i . e .G (Z) =  Z~^ 
and  the  design  problem  concen¬ 
trates  on  the  loop  filter. Then, 
stability  analysis  has  to  be 
included  in  the  design  process 
and  the  model  shown  in  fig. 3 
is  considered . The  quantizer  is 


Fig. 3.  Model  for  stability 
analysis 


replaced  by  an  amplifier  with 
gain  cK  .Assuming  that  H(Z) 
takes  the  form 

H(Z)  =  N(Z)  /  D(Z)  (4) 

thesignal  transfer  function  is 
U(Z)  N(Z) 

K(Z)=  - (5) 

X(Z)  D(Z)  +0/  Z-1n(Z) 

The  stability  of  the  system 
depends  on  the  poles  of  that 
function.  Their  locations  in 
the  complex  plane  in  turn  de¬ 
pends  on  the  value  oC  and  a 
root  locus  analysis  has  to  be 
carried  out  [3] .The  system  is 
stable,  when  it  oscillates  at 
half  the  sampling  frequency 
fs/2,  i.e.  a  pole  sits  at  -1, 
and  all  the  other  poles  are 


l-2(l-b)cose  Z-1  +(l-b)2  z-2 

a  and  b  being  positive  scalars  such 

that:  0<b<a<<l. 

The  peak  magnitude  of  the  frequency 
response  is  approximately  a/b. 

A  procedure  to  include  the  stabili¬ 
ty  in  the  design  is  as  follows.  The 
poles  and  the  zeros  of  the  filter 
sections  are  chosen  to  comply  with 
the  width  of  the  passband  and  the 
gain  objectives.  Then,  a  pair  of 
zeros  is  moved,  so  that  all  the 
other  poles  of  K(Z)  are  within  the 

unit  circle  when  OC  is  such  that  the 
real  pole  reaches  (-1) . 

As  an  illustration,  the  dotted  area 
in  fig. 4  is  the  authorized  domain 


Poles 

Zeros 

-0.2012±0.9693i 

-0.0935±0.4912i 

-0.1851±0.9725i 

7 

Fig. 4 -a .Authorized  domain  for  the  se 
cond  zero  pair  in  an  order  4  filter, 
when  a  zero  pair  (and  poles) is  fixed 
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Poles 

Zeros 

-0.2012±0.9693i 

0.5354±0.4510i 

-0.1851±0.9725i 

? 

Fig.4-b.  Alternative  authori¬ 
zed  domain. 

for  a  pair  of  zeros,  when  two 
pole  pairs  and  a  zero  pair 
are  fixed  in  a  system  of  or¬ 
der  4. It  is  obtained  by  a  sys¬ 
tematic  search  on  a  grid.  In 
fig.4-b,  a  different  choice  of 
the  fixed  zero  pair  yields  a 
different  domain.  In  an  itera¬ 
tive  procedure,  all  the  poles 
and  zeros  can  be  moved  to 
optimize  the  design. 


Ill-  A/D  CONVERTER  FOR 
MOBILE  RADIO 

This  is  a  typical  field  of 
application [5] .The  method  des¬ 
cribed  above  has  been  applied 
to  the  design  of  an  A/D  conver¬ 
ter  for  mobile  radio  data  sig¬ 
nals,  in  the  vicinity  of  the 
10.7  MHz  intermediate  frequen¬ 
cy.  Considering  a  filter  with 
order  M=4,  the  poles  and  zeros 
obtained  are  given  in  fig. 5, as 
well  as  the  spectrum  of  the 
output  signal,  when  the  input 
is  a  sinewave.  The  passband  of 
the  useful  signal  is  approxima¬ 
tely  50kHz  and  the  locpp  filter 
gain  exceeds  60  dB, which  leads 
to  an  accuracy  of  about  10 
bits  for  the  A/D  converter. 


Poles 

Zeros 

-0.1756±0.8827i 

0.2268±0.4456i 

-0.1970±0.9907i 

0.4388±0.2397i 

Fig . 5 . Spectrum  of  the  output  signal 
for  a  filter  of  order  M=4 . 

The  procedure  can  be  exploited  to 
design  filters  with  higher  orders 
leading  to  A/D  converters  of  grea¬ 
ter  accuracy.  The  practicality  of 
such  designs  relies  on  the  technol¬ 
ogy  employed  for  the  implementation 
and  the  context  in  which  the  equip¬ 
ment  is  operated. 


IV-  DIRECT  DIGITAL  FREQUENCY 
SYNTHESIZER 

The  flexibility  of  digital  circuits 
can  be  exploited  in  frequency  syn¬ 
thesis  [6] .  The  bottleneck  is  gene¬ 
rally  the  need  for  an  accurate  and 
extremely  fast  D/A  converter.  In 
some  applications,  like  microwave 
communications, the  frequency  domain 
in  which  the  sinewaves  have  to  be 
generated  is  significantly  smaller 
than  the  values  of  the  frequencies 
of  the  sinewaves  themselves,  giving 
rise  to  a  sort  of  reduced  band  DDS . 
For  example, it  can  be  50  MHz  around 
200  MHz. 

In  those  conditions,  a  SIGMA-DELTA 
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circuit  can  produce  a  quantiza-  frequency  above  the  loop  filter 
tion  noise  shaping  and,  thus,  bandwidth, through  noise  shaping.  An 
attenuate  the  noise  in  the  cri-  efficient  scheme  is  shown  in  fig. 7, 
tical  band,  which,  for  a  given  for  order  3.  It  is  a  cascade  of  3 
D/A  converter, improves  the  spu-  sections  and  the  noise  produced  by 
rious  performance.  The  concept  the  decision  circuit  D  is  multi- 


is  shown  in  fig. 6.  The  noise  plied  by  (1-Z"i)3  . 


Fig. 6.  Reduced  band  Direct  Digital  Synthesizer  with  noise  shaping 


Fig. 7.  A  cascade  structure  for  noise  shaping 


rejection  capacity  of  the  fil¬ 
ter,  and  its  design,  is  related 
to  the  performance  objectives 
and  the  characteristics  of  the 
D/A  converter. 

V  -  CLOCK  SYNCHRONIZATION 

In  various  applications,  like 
broadcasting,  digital  storage, 
telephony  or  remote  sensing,  a 
clock  produced  by  a  source  has 
to  be  transmitted  over  data 
channels  and  accurately  resto¬ 
red  at  the  receiving  side, gene¬ 
rally  with  the  help  of  a  phase 
lock  loop.  The  quality  of  the 
recovered  clock  depends  on  the 
width  of  the  lowpass  loop  fil¬ 
ter  and  on  the  spectrum  of  the 
digital  information  transmit¬ 
ted,  for  example  under  the  fom 
of  time  stamps,  stuffing  bits, 
or  other  timing  data  [7] . 

In  the  spectrum  of  the  transmi¬ 
tted  signals,  the  undesirable 
components  can  be  shifted  in 


To  conclude,  SIGMA-DELTA  techniques 
can  adjust  to  different  contexts  in 
terms  of  functionalities  and  perfor¬ 
mance  and  have  a  great  potential . 
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Abstract  -  An  adaptive  code  rate  chfinge 
scheme  in  DS-SSMA  systems  is  proposed.  In  the 
proposed  scheme,  the  error  correcting  code  rate 
is  changed  according  to  the  channel  state,  the 
effective  number  of  users.  The  channel  state  is 
estimated  based  on  retransmission  requests.  The 
criterion  for  the  change  of  the  code  rate  is  to  max¬ 
imize  the  throughput  under  given  error  bound. 


1  Introduction 

In  FH-SSMA  systems  using  Reed-Solomon  error 
correcting  codes,  optimum  code  rate  and  optimum 
number  of  users  for  maximum  throughput  can  be 
obtained  when  the  number  of  hopping  frequencies 
and  the  packet  error  probability  are  given  [1]. 

In  DS-SSMA  systems,  however,  neither  optimum 
code  rate  nor  optimum  number  of  users  has  been 
obtained.  In  DS-SSMA  systems,  packet  error  proba¬ 
bility  and  channel  throughput  are  merely  increasing 
functions  of  the  code  rate  and  number  of  users.  In 
this  paper,  a  transmission  scheme  is  proposed  which 
controls  the  error  correcting  code  rate  adaptively  ac¬ 
cording  to  the  channel  states  to  maximize  through¬ 
put  when  the  packet  error  probability  is  fixed  in  DS- 
SSMA  communication  systems. 

In  this  paper,  retransmission  requests  is  used  to 
estimate  the  channel  state,  t.e.,  the  number  of  users, 
which  has  major  effects  on  the  bit  error  performance. 


2  The  System  Model 


The  system  considered  here  is  shown  in  Figure  1. 


In  Figure  1,  ajb(f),  6jb(f),  dk{t),  and  6k  are  the  random 
signature  signal  with  bit  duration  Tc,  encoded  signal 
with  bit  duration  T,  data  signal,  and  phase  of  the 
kth  carrier,  respectively.  The  common  carrier  (an¬ 
gular)  frequency  ojc  is  known,  and  K  is  the  number 
of  users.  We  assume  that  each  signature  sequence 
has  an  integer  period  N  =  T/Tc.  The  transmitted 
signal  for  the  kth.  user  can  be  written  as 


Sk{t)  =  VWRe  [ak{t)bk{t)exp{ju)ct  -f  jOk)]  (1) 

where  P  is  the  signal  power. 

Assuming  that  the  channel  is  a  Rician  fading  chan¬ 
nel,  the  faded  signal  can  be  written  as 


yk{t)  -  Re{uk{t  -  rk)exp[juc{t  -  rk)-\-  jOk]] 

■hnk{t)  (2) 

where  nk{i)  is  the  additive  white  Gaussian  noise 
(AWGN)  term. 


Uk{t)  =  7  j 


Pk{r,t)sk{t  -  r)dT  +  Sk{t)  ,  (3) 


and  7  is  the  transmission  coefficient  (or,  fading  inten¬ 
sity)  of  the  fading  channel,  with  Pk{ii  t)  a  zero-mean 
complex  Gaussian  random  process.  We  assume  that 
the  effective  number  of  users  and  the  fading  environ¬ 
ment  are  varying  slowly. 

Figure  2  shows  the  receiver  model,  where  it  is  as¬ 
sumed  that  the  phase  of  the  despreading  signal  is 
matched  to  the  received  signal,  i.e.  (pk  =  0k  —  ^^c^k- 
It  is  assumed  that  Reed-Solomon  code  is  used  for  er¬ 
ror  control,  and  the  selective  repeat  request  strategy 
is  employed  in  the  ARQ  [2], 
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We  change  the  code  rate  by  changing  the  informa¬ 
tion  bit  length  with  the  code  word  length  fixed  .  We 
assume  that  backloged  packets,  requested  through 
an  ideal  backward  channel,  are  retransmitted  in  the 
next  packet  transmission  slot  with  probability  1.  In 
any  time  slot,  K  packets  are  transmitted:  if  a  user  is 
requested  for  a  retransmission  the  user  is  not  allowed 
to  send  a  new  packet. 


The  probability  Pe  that  the  receiver  commits  an 
error  when  the  ARQ  strategy  is  used  is  given  by 

Pe  = 


i=l 


Pu 


Pc  +  Pu 


(9) 


2.1  Bit  Error  Probability 

Assuming  the  number  of  users  is  large  enough  to 
satisfy  the  central  limit  theorem,  it  has  been  shown 
in  [3]  that  the  approximation  of  multiple  access  in¬ 
terference  (MAI)  to  AWGN  results  in  quite  close  cal¬ 
culation  of  the  bit  error  probability.  Under  the  ap¬ 
proximation,  the  bit  error  probability  is 

Pb  =  -1=  I _ exp{-u^/2)du  (4) 

V  JSNR 

where  SNR  is  the  effective  signal  to  noise  ratio  (S- 
NR)  with  the  approximation  taken  into  account.  The 
effective  signal  to  noise  ratio  at  the  input  of  the  de¬ 
coder  in  Figure  2  has  been  calculated  [4].  The  es¬ 
timation  procedures  in  this  paper  are  based  on  the 
observation  that  SNR  in  (4)  can  be  expressed  as 

SNR  =  (CA"  +  D)-^/^  ,  (5) 

where  is  the  effective  number  of  users  and  C  and 
D  are  determined  by  the  fading  types.  Note  that 
the  parameters  C  and  D  are  functions  of  the  fading 
intensity,  fading  range,  channel  covariance,  random 
signature  sequence  length,  and  SNR  of  AWGN. 

2.2  Packet  Error  Probability 

Let  Pc,  Pd,  and  P„  denote  the  probabilities  that  a 
decoded  packet  contains  no  error,  detectable  errors, 
and  undetectable  errors,  respectively.  Then  we  have 
Pc  H"  Pd  "h  Pu  =  1  • 

Since  Reed-Solomon  code  can  correct  up  to 
L(n  —  i)/2j  errors,  it  is  easy  to  see  that 

L(n-fc)/2j 

p<^=  E  (6) 

where  [xj  is  the  largest  integer  not  greater  than  x,  n 
is  the  codeword  length,  and  k  is  the  information  bit 
length.  We  also  have 

n 

where  Ai  is  the  weight  distribution  defined  as  [5] 


3  Adaptive  Code  Rate 
Change  Algorithms 

In  general,  the  error  performance  of  the  DS-SSMA 
system  is  worse  than  that  would  be  caused  by  K 
(real)  users  counted  by  the  base  station  because  of 
near  cell  interference.  The  added  error  performance 
degradation  effects  result  in  increase  of  the  number  of 
users,  the  effective  number  of  users  K' .  Note  that  Pe 
defined  in  (9)  is  now  a  function  of  the  code  rate  Ar/n, 
the  effective  number  of  users  K' ^  fading  environment, 
and  the  ratio  of  pure  AWGN  power  and  signal  power. 

3.1  Code  Rate  for  Maximum 
Throughput 

In  this  paper,  our  performance  criterion  is  to  max¬ 
imize  the  throughput  when  Pe  is  limited  to  a  fixed 
value,  where  the  throughput  is  defined  as 

E  =  ~{P.  +  Pc)  .  (10) 

Since  Pe  and  the  throughput  are  merely  increasing 
functions  of  the  number  of  effective  users  and  code 
rate  in  DS-SSMA  systems,  the  code  rate  which  sat¬ 
isfies  the  Pe  bound  maximizes  the  channel  through¬ 
put. 

When  the  effective  number  of  users  is  given  and 
the  fading  environment  is  known,  we  calculate  Pe 
for  all  code  rates  {k  =  1,  2,  •  •  • ,  n)  by  making  use  of 
(4),  (6),  (7),  and  (9).  We  then  select  the  largest  code 
rate  which  results  in  less  Pe  than  the  given  bound. 
Some  results  are  given  in  [6],  assuming  that  the  error 
correcting  code  length  n  =  15,  the  random  signature 
sequence  length  N  =  1023,  the  signal  power  S  = 
PT,  and  the  pure  AWGN  power  is  Nq,  It  is  also 
shown  in  [6]  that  the  code  rate  becomes  high  as  the 
effective  number  of  users  decreases,  and  the  effect 
of  the  pure  AWGN  on  Pe  is  more  severe  when  the 
fading  intensity  of  MAI  7  is  smaller. 

3.2  Estimation  of  Effective  Number 
of  Users 


(8) 


Figure  3  shows  our  estimation  scheme,  where  we 
assume  slotted  communication:  that  is,  each  user 
transmits  simultaneously  one  packet  per  slot.  In  Fig¬ 
ure  3,  Ts  is  the  slot  interval,  Tm  =  LTs  is  the  channel 
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monitoring  interval,  L  is  the  channel  monitoring  slot 
number,  Ri  is  the  number  of  backloged  packets  in 
the  tth  slot,  and  K[  is  the  estimate  of  K'  in  the  ith 
transmission  slot.  We  assume  that  the  code  rate  is 
constant  during  the  channel  monitoring  interval  and 
can  be  changed  at  the  start  of  every  channel  moni¬ 
toring  interval.  We  assume  the  number  of  users  K  is 
constant  over  a  channel  monitoring  interval,  which 
can  be  achieved  by  not  accepting  new  users  until  the 
end  of  a  channel  monitoring  interval  and  allowing 
them  to  communicate  at  the  beginning  of  the  nex- 
t  channel  monitoring  interval.  Once  the  estimation 
of  the  effective  number  of  users  is  accomplished,  we 
can  optimize  the  code  rate  based  on  the  estimated 
effective  number  of  users. 

More  specifically,  at  the  ith  transmission  slot,  we 
can  obtain  K-  from 

KPR{Km  =  Ri ,  (11) 

where  k  is  the  information  bit  length  trimmed  to  sat¬ 
isfy  Pe  limit  for  the  previously  estimated  effective 
number  of  users  and  PR{Ki\k)  denotes  the  retrans¬ 
mission  probability  for  !<(  effective  users  when  the 
code  rate  is  r  =  k/n.  This  probability  can  be  calcu¬ 
lated  using  (6)  and  (7),  or 

PR{A\k)  = 

=  ^  ^  Pu\Kt:=A  ^  PcIk'zzA  (12) 

for  Reed-Solomon  error  correcting  codes. 

At  the  end  of  the  channel  monitoring  interval,  we 
take 

:^=max|i^A';  ,  A'l  (13) 

as  the  estimate  of  the  effective  number  of  users.  If 
the  number  of  new  calls  minus  that  of  dismissed  calls 
during  the  channel  monitoring  interval  is  A"„,  we  as¬ 
sume  that  A  ^  -f  K„  (effective)  users  are  on  the  line 
in  the  next  channel  monitoring  interval  and  change 
the  code  rate  using  the  results  in  [6]  based  on  this 
number. 

Figures  4  and  5  show  simulation  results  of  the  esti¬ 
mation  where  kopt  is  the  information  bit  length  which 
results  in  the  maximum  throughput  under  the  Pe 
bound  10  In  Figures  4  and  5,  it  is  assumed  that 
A  =  20  and  the  fading  is  time  selective  with  A  =  1.5, 

7  =  1.0,  and  ^  =  lOdB.  In  Figure  4,  it  is  assumed 
that  A  =  300  and  K'  =  350.  In  Figure  5,  it  is  cis- 
sumed  that  A  =  400  and  A^  =  450.  We  can  see 
that  even  when  the  code  rate  was  incorrectly  select¬ 
ed  in  the  present  channel  monitoring  interval  because 
the  effective  number  of  users  had  been  estimated  in¬ 
correctly  in  the  previous  channel  monitoring  inter¬ 
val,  we  can  estimate  the  effective  number  of  users  to 
some  degree  of  accuracy.  The  estimate  of  the  effec¬ 
tive  number  of  users  can  then  be  used  to  change  the 
code  rate  adaptively  based  on  the  results  of  [6], 


4  Summary 

In  this  paper,  we  proposed  an  adaptive  code  rate 
change  scheme  in  DS-SSMA  systems.  In  the  pro¬ 
posed  scheme,  the  error  correcting  code  rate  was 
changed  according  to  an  estimate  of  the  effective 
number  of  users,  which  has  significant  effects  on  the 
bit  error  probability. 

We  estimated  the  effective  number  of  users  by 
making  use  of  the  retransmission  requests.  The  cri¬ 
terion  for  the  change  of  the  code  rate  based  on  the  es¬ 
timated  channel  states  was  to  maximize  the  through¬ 
put  under  given  error  bound. 
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Figure!  The  System  Model 


Figure  2.  The  Receiver  Model 
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Figure  4.  Simulation  Results  for  K  =  300  and  K'  =  350 

(a)  When  kgpt  is  used  K'^q  =  341  : 

(b)  When  kept  -  1  is  used  ^  =  336  : - 

(c)  When  kopi  +  1  is  used  ii^20  “  • 


Figure  5.  Simulation  Results  for  K  =  400  and  K'  —  450 

(a)  When  fc<,p<  is  used  444  :  ■  “ 

(b)  When  kopt  -  1  is  used  =5.^  =  443: - 

(c)  When  kopt  +  1  is  used  =>  —  449  . 


Figure  3.  Estimation  of  Effective  Users 
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ABSTRACT 

The  delay-locked  loop  (DLL)  is  well  suited  to  guar¬ 
antee  fine  synchronization  for  direct-sequence  spread 
spectrum  systems.  In  fading  channels  the  mean 
time  to  lose  lock  (MTLL)  of  a  DLL  is  very  impor¬ 
tant  because  a  loss  of  lock  during  a  fade  makes  a 
re-acquisition  process  necessary.  Using  a  two  state 
model  for  the  fading  channel  and  a  modified  DLL 
will  improve  the  MTLL  considerably.  The  modified 
DLL  operates  in  two  different  modes:  a  normal  track¬ 
ing  mode  for  good  channel  conditions,  and  a  ”fade” 
mode  for  bad  channel  conditions,  i.e.  during  a  deep 
fade.  The  modified  DLL  is  more  robust  against  bad 
channel  conditions. 

INTRODUCTION 

Spread  spectrum  techniques  have  become  popular  for 
various  commercial  applications,  e.g.  mobile  com¬ 
munications  based  on  Code  Division  Multiple  Ac¬ 
cess  (CDMA)  and  positioning  systems  (GPS).  For 
direct  sequence  spread  spectrum  systems  (DS-SSS) 
the  exact  code  phase  timing  estimate  is  very  impor¬ 
tant.  The  timing  is  influenced  by  clock  offsets  and  by 
Doppler  effects  especially  in  mobile  environments.  In 
DS-systems,  pseudo-noise  (PN)  code  tracking  with 
a  delay-locked  loop  (DLL)  is  commonly  used.  The 
code  phase  estimate  is  produced  by  comparing  the 
received  signal  with  both  early  and  late  replicas  of 
the  locally  generated  PN  reference  sequence.  The 
code  phase  timing  error  drives  the  clock  of  a  PN- 
code  generator  to  adjust  the  code  phase  timing.  This 
is  a  closed-loop  tracking  control  system.  The  con¬ 
ventional  DLL  has  been  studied  in  the  literature  in 
detail  [1],  [2], [3].  For  the  DLL  there  exist  two  com- 


Fig.  1.  Coherent  conventional  DLL 


monly  used  performance  criteria:  the  tracking  jit¬ 
ter  and  the  mean  time  to  lose  lock  (MTLL).  During 
normal  operation  the  tracking  jitter  should  be  small 
enough  to  guarantee  low  error  rates  for  spread  spec¬ 
trum  data  transmission  and  small  delay  errors  for 
ranging  systems.  The  tracking  jitter  is  of  main  in¬ 
terest  for  sufficient  high  signal  to  noise  ratios  (SNR). 
The  MTLL  is  very  important  for  low  SNR  values. 
The  MTLL  characterizes  the  mean  time  that  a  DLL 
stays  in  its  tracking  range.  When  the  DLL  falls  out 
of  lock  a  re-acquisition  process  has  to  be  started  dur¬ 
ing  which  no  data  transmission  or  ranging  is  possi¬ 
ble.  The  MTLL  should  be  larger  than  the  normal 
transmission  time.  The  calculation  of  the  MTLL  is 
based  on  Fokker-Planck  techniques  for  solving  non¬ 
linear  stochastic  differential  equations  [1],  [3],  [4],  [5]. 
In  this  paper  a  modified  DLL  for  fading  channels  is 
proposed. 

DLL  DESCRIPTION 

The  conventional  coherent  DLL  structure  is  shown 
in  Figure  1.  The  received  signal  is  multiplied  by  the 
early  and  late  replicas  of  the  local  PN-code.  The 
spacing  between  the  early  and  late  replicas  is  2(5Tc, 
with  Tc  I’hc  chip  duration  and  S  the  normalized  time 
offset  of  the  early-late  correlators.  The  error  signal 
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8  =  0.5 
—  8=1.0 


Fig.  2,  S-curve  for  different  coherent  DLLs 


e{t)  is  the  result  of  subtracting  the  despread  early 
and  late  signals.  The  loop  filter  generates  the  in¬ 
put  signal  for  the  voltage  controlled  oscillator  (VCO) 
steering  the  local  PN-code  generator.  The  loop  fil¬ 
ter  is  designed  to  keep  the  tracking  error  low.  Due 
to  the  low  bandwidth  in  the  DLL  an  autocorrelation 
(ACF)  over  the  PN-code  is  performed.  The  S-curve 
S{e)  for  the  conventional  coherent  DLL  is  shown  in 
Figure  2  as  a  function  of  the  timing  error  5.  The  S- 
curves  in  Figure  2  are  based  on  an  ideal  ACF  which 
is  the  limit  for  the  ACF  of  m-sequences  as  N,  the 
sequence  length  tends  to  infinity. 

The  linear  tracking  range  TRun  is  defined  by  the 
region  around  the  origin  where  the  relation  between 
the  delay  discriminator  output  S{€)  depends  linearly 
on  the  delay  error  e.  The  linear  tracking  range  is  re¬ 
duced  when  S  is  reduced.  The  overall  tracking  range 
TRov  is  defined  as  the  region  where  S{e)  produces  a 
useful  output  signal  to  drive  the  VCO  through  the 
loop  filter.  The  spreading  waveform  is  denoted  by 
c(t  —  Td).  Trf(^)  is  the  timing  offset  between  trans¬ 
mitter  and  receiver.  Trf(t)  is  the  receiver  estimate  of 
Tci{t),  The  received  signal  in  baseband  is  given  by 

Sr{t)  =  VPc{t  -  Td)  -h  n{t)  (1) 

where  P  is  the  signal  power  and  n{t)  is  additive  white 
Gaussian  noise  with  two  sided  spectral  power  density 
No/2.  The  received  signal  Sr{t)  is  correlated  with  the 
early  and  late  reference  spreading  waveforms  c{t  — 
Td±S  Tc).  The  correlator  outputs  are  given  by 

Ue/l  (t)  =  \/Pc{t  -  Td)c(t  -fd±STc)  + 

n{t)c{t  -fd  +  STc)  .  (2) 

The  phase  error  is  defined  by 

e={Td-fd)/Tc  .  (3) 


Fig.  3.  Baseband  equivalent  DLL  model 


Assuming  that  the  code  self-noise  [2]  can  be  ne¬ 
glected  and  calculating  the  long  term  average  ye//(f) 
yields 

Ve/lit)  =  VPRc{Td  -fd±STc)  +  ne/i(t) 

=  y/PRc{e±STc)  +  ne/i(t)  (4) 


with  the  idealized  spreading  code  ACF  defined  by 


Rcix) 


l-|a:|,  1*1  <1 

0,  else. 


(5) 


Now  the  early  and  late  branches  are  subtracted  to 
give  the  error  signal 

e(f)  =  yi-ye-  \/PS{€)  -h  n\t)  (6) 

which  consists  of  the  deterministic  signal  VpS{€) 
and  the  white  noise  n'(f).  The  noise  characteristic  of 
n*{t)  can  be  calculated  via  the  autocorrelation  func¬ 
tion  Rn'{T)  of  as  shown  in  [2]  for  the  conven¬ 
tional  DLL.  The  equivalent  noise  in  the  loop 
has  two-sided  power  spectral  density 


Sn'{f)^No  .  (7) 


The  equivalent  baseband  model  for  a  first-order  DLL 
is  given  in  Figure  3.  The  nonlinearity  S{e)  describes 
the  discriminator  characteristic  (S-curve). 

S{e)  =  Rc{e^S)-Rc{e  +  S)  (8) 

Td{t)  has  the  form 

Td{t)  ==  To  +  ait  with  ai  =  AR/R-^- vq/c  (9) 

where  To  represents  some  constant  time  delay,  AR 
the  code  clock  mismatch,  is  a  constant  velocity 
relative  to  transmitter  and  receiver  and  c  is  the  veloc¬ 
ity  of  light.  ai  is  the  Doppler  shift.  The  differential 
equation  of  the  loop  is  given  by 

Kt)  =  Y^-KVPS{e)-Kn'it)  .  (10) 
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Some  additional  parameters  are  defined  by 


K  =  K^fP, 


u  = 


ai 


tJ<' 


(11) 


For  the  noiseless  case  {p  —  0)  there  exists  one  stable 
equilibrium  point  (ea,0)  with  S[ea)  =  u  and  Sa  in 
the  linear  range  of  S(e).  Thus  values  u  7^  0  lead  to 
a  static  error.  The  case  u  =  0  is  only  of  theoretical 
interest  because  ai  will  never  be  exactly  zero  due  to 
the  code  clock  mismatch  A/?. 

The  closed  loop  transfer  function  H{s)  for  the  lin¬ 
earized  loop  is  defined  by 


with 


H{s) 


T,(s) 


Kl 

s  +  Kl 


Kl  =  ^^PKS'[€a)  =  KS'{€^)  , 


(12) 

(13) 


5'(ea)  =  2  being  the  slope  of  the  S-curve  at  ea  for 
the  =  0.5  DLL  and  the  loop  bandwidth  is  defined 
by 

Wl  =  I^(i27r/)|2  =  ^  .  (14) 


MTLL  AND  TRACKING  JITTER 


Exact  explicit  solutions  for  the  MTLL  exist  for  the 
first  order  DLL.  For  the  second  order  DLL  with  pas¬ 
sive  loop  filter  an  approximation  based  on  Kramers 
results  is  possible  [5].  This  approximation  leads  to 
a  compact  form  for  the  MTLL.  This  is  the  reason 
why  a  slightly  modified  approximation  based  on  the 
results  by  Kramers  is  used.  The  MTLL  for  the  first 
order  DLL  is  approximated  by 


iL  = 


(15) 


The  minimal  escape  energy  Ec  is  given  by  the  area 
under  the  S-curve  (see  Figure  2) 


Ec 


f  {S{e) 

Jen 


i)d€ 


(16) 


The  tracking  jitter  around  the  stable  tracking  point 
Ea  for  the  coherent  DLL  is  given  by  [2] 


,2  ^  NqWl  ^  NqKl 
p  “IP 


(17) 


This  shows  that  the  parameter  K  leads  to  a  tradeoff 
between  small  tracking  jitter,  large  MTLL  and  small 
loop  offset. 


CHANNEL  MODEL 

To  simplify  the  investigation,  a  two  state  channel 
model  is  used.  The  good  channel  condition  corre¬ 
sponds  to  the  normal  tracking  mode  of  the  DLL,  i.e. 
high  SNR.  The  parameter  K  will  be  chosen  to  guar¬ 
antee  low  tracking  jitter  in  this  mode.  The  bad  chan¬ 
nel  condition  corresponds  to  a  deep  fade,  i.e.  very 
low  SNR.  For  this  case  the  MTLL  of  the  DLL  is  cru¬ 
cial.  A  real  fading  channel  could  be  projected  onto 
this  model  by  introducing  a  threshold  value.  This  is 
shown  in  Figure  4.  If  the  received  SNR  is  above  the 
threshold  value,  then  the  channel  condition  is  good, 
otherwise  the  channel  condition  is  bad.  In  the  simpli¬ 
fied  model  SNRa  =  6. MB  is  assumed  for  the  good 
state  which  lasts  For  the  bad  channel  we  have 
SNRb  =  —1MB  and  is  =  62sec.  The  fade  duration 
is  much  smaller  than  the  good  state  duration  Ia 
and  SNRa  »  SNRb-  The  threshold  in  the  exam¬ 
ple  is  OdB.  The  mean  SNR  for  both  the  real  fading 
channel  and  for  the  two  state  channel  model  is  MB. 

MODIFIED  DLL 

The  problem  for  a  conventional  DLL  is  that  the 
MTLL  is  reduced  considerably  during  a  fade.  This 
increases  the  probability  that  it  will  lose  lock  during 
the  fade.  The  assumption  for  proposing  the  modified 
DLL  is  that  during  the  fade  the  timing  offset  between 
receiver  and  transmitter,  i.e.  the  Doppler  shift  varies 
slowly.  On  the  other  hand  the  noise  level  is  very 
high.  To  keep  the  DLL  in  lock  with  strong  noise  can 
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be  achieved  by  a  narrow  loop  bandwidth  Wl^  This 
is  achieved  by  a  small  loop  gain  K,  However,  a  small 
loop  bandwidth  is  not  able  to  track  large  Doppler 
shifts  for  a  first  order  DLL.  To  bring  together  the 
two  arguments  the  loop  is  opened,  i.e.  =  0  dur¬ 
ing  the  fade.  This  corresponds  to  a  loop  bandwidth 
Wl  =  0  which  is  optimal  if  there  is  no  loop  detun¬ 
ing.  During  the  fade  the  timing  offset  increases  due 
to  the  Doppler  shift.  As  long  as  the  fade  duration  Ib 
is  not  to  long,  the  tracking  error  will  still  be  in  the 
overall  tracking  range  TRov>  When  the  fade  is  over, 
the  loop  is  closed  again  and  the  DLL  will  drive  the 
loop  offset  to  zero. 

Typical  tracking  error  trajectories  for  a  conventional 
DLL  and  for  a  modified  DLL  are  shown  in  Figure 
5.  The  two  state  channel  model  is  used  and  Figure 
5  shows  details  near  the  fade.  The  solid  line  shows 
the  input  dynamics,  i.e.  the  constant  Doppler  shift. 
In  this  example  it  corresponds  to  a  velocity  of  3m/s. 
The  parameter  K  =  0.08  has  been  chosen  as  a  trade¬ 
off  between  good  Doppler  tracking  capabilities  and 
low  tracking  jitter. 

It  is  clear  that  during  good  channel  conditions  both 
DLLs  perform  equally  good.  During  a  fade,  i.e. 
SNRb  =  —  15dH  the  conventional  DLL  is  very  likely 
to  loose  lock  because  the  MTLL  is  reduced  drasti¬ 
cally.  This  can  be  seen  in  Figure  5  as  the  dotted  line 
oscillating  strongly  during  the  fade.  After  the  fade 
the  conventional  DLL  remains  unlocked. 

The  modified  DLL  operates  with  K  =  0,  i.e.  zero 
loop  bandwidth  during  the  fade.  Thus  it  can  not 
follow  the  Doppler  shift  al.  This  leads  to  a  loop  de¬ 
tuning  Sf  =  al*tB-  As  far  as  the  fade  duration  fjg  is 
short  enough,  as  in  this  example,  the  loop  detuning 
will  be  inside  the  tracking  range  of  the  DLL  at  the 
end  of  the  fade.  The  dashed  line  shows  the  trajectory 
of  the  modified  DLL.  It  can  be  clearly  seen,  that  the 
estimated  timing  offset  Td  remains  constant  during 
the  fade,  and  how  it  approaches  very  fast  the  actual 
timing  offset  Td  after  the  loop  has  been  closed  again. 


der  DLL.  The  modification  can  also  be  applied  to 
the  noncoherent  DLL  and  to  higher  order  DLLs.  For 
higher  order  DLLs  the  opening  of  the  loop  means 
that  timing  estimates  are  extrapolated  by  the  loop 
filter.  The  threshold  based  switching  will  make  the 
loop  robust  against  loss  of  lock  in  a  fading  chan¬ 
nel.  Further  investigations  for  different  fading  chan¬ 
nel  models  will  be  carried  out. 
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CONCLUSION 

The  modified  DLL  improves  the  lock  performance  in 
the  two  state  channel  model.  This  is  achieved  by 
adapting  the  DLL  to  the  channel  conditions.  The 
investigation  has  been  done  for  the  coherent  first  or- 
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ABSTRACT 

Track  circuits  are  the  traditional  and  perhaps  the 
most  important  way  to  provide  railway  traffic  control 
and  train  safety.  However,  electromagnetic 
interferences  from  solid  state  traction  power 
applications  and  increased  transmission  capacity 
requirements  are  calling  for  new  architecture  and 
design.  In  this  paper,  spread  spectrum  modulations 
are  considered,  in  order  to  provide  interference 
rejection  and  multiple  access  capabilities.  In 
particular,  the  possibility  of  provide  track  circuits 
suitable  to  work  indifferently  on  networks  equipped 
with  different  traction  power  systems  is  investigated. 

1.  INTRODUCTION 

Cab  Signalling  (CS),  Automatic  Train  Protection 
(ATP),  and  Automatic  train  Control  (ATC)  systems 
via  track  circuits  have  been  recently  faced  with  severe 
electromagnetic  compatibility  (EMC)  problems,  both 
in  rapid  transit  systems  and  in  main  line  equipment 
cases.  Problems  arise  both  from  solid-state  power 
electronics,  increasingly  utilised  in  traction  power  and 
vehicle  propulsion  control  applications,  and  from 
novel  requirements  of  information  bandwidths  larger 
than  already  used. 

Rapid  transit  equipment  problems  were  considered, 
for  instance,  in  [1]  and  [2].  New  high  speed  main 
lines,  requiring  both  refined  train  control  and  cab 
signalling  systems  and  very  large  traction  powers,  are 
causing  deep  changes  in  technical  requirement  and 
design  practices.  As  an  example,  the  CS/ATC  system 
adopted  for  the  TGV  lines  in  France  and  for  the 
Channel  tunnel  was  described  in  [3].  More  generally, 
data  transmission  systems  via  track  circuits  allowing 
relatively  large  information  bandwidths  have  been 
described,  as,  for  instance,  in  [4]  and  [5]. 


In  addition,  new  requirements  are  emerging  in 
Europe,  calling  for  easy  interoperability  of  rolling 
stocks  across  boundaries  of  regions  equipped  with 
different  electric  traction  and  signalling  systems. 
Traffic  control  interoperability  was  already  allowed  by 
parallel  installation  of  dedicated  equipment  in  a 
number  of  vehicles.  However,  more  efficient  solutions 
are  needed  today,  in  order  to  secure  at  a  time  both 
reduced  equipment  weights  for  high  speed  vehicles, 
and  larger  interoperability,  extended  to  several 
different  electric  traction  systems  [6]. 

Radio  transmission  systems  have  been  recommended 
to  this  purpose  in  various  European  studies, 
considering  very  demanding  technical  requirements: 
see  for  instance  [6],  and  the  ERWIN  and  TWIN 
projects,  developed  in  the  framework  of  the  ESPRIT 
program.  Radio  transmissions  are  likely  to  be  the  only 
solution  to  the  considered  ground-to-train  (and  train- 
to-ground)  communications  scenarios. 

However,  it  seems  interesting  to  deserve  some 
attention  to  a  possible  updated  version  of  coded  track 
circuit  transmissions,  which  apparently  was  not 
considered  in  the  previous  technical  literature. 

In  principle,  spread  spectrum  (SS)  modulations  [7] 
are  likely  to  offer  effective  solutions  to  CS,  ATP  and 
ATC  track  circuit  design,  as  robustness  against  hostile 
EMC  environments  was  already  demonstrated  in 
applications  leading  to  similar  problems,  i.e.,  mobile 
radio  and  guided  propagation  trough  power  line 
wiring  and  intrabuilding  power  distribution  circuits 
[8]  [9].  However,  both  the  bandwidths  allowed  by  the 
track  circuit  electrical  characteristics  (see,  for 
instance,  [10])  and  the  various  interference  signals 
make  not  straightforward  any  application  of  the 
customary  processing  gain  and  anti  jamming  margin 
evaluation  procedures. 

A  first  introduction  to  these  problem  was  presented 
in  [11],  were  different  SS  modulations  were 
considered,  anticipating  that  in  this  application  too,  as 
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usual,  the  relative  merits  of  the  Direct  Sequence 
modulation  (DS)  and  of  the  frequency  hopping  (FH) 
schemes  cannot  be  assessed  without  concrete  reference 
to  the  actual  electromagnetic  environment.  Moreover, 
it  was  emphasised  in  [11]  that  the  main  benefit  gained 
from  SS  modulations  will  be  perhaps  some 
insensitivity  to  the  traction  power  supply  techniques, 
more  than  any  performance  improvement  with  respect 
to  traditional  narrow  band  track  circuits. 

In  this  paper,  attention  is  focused  on  low  information 
rate  signalling  via  track  circuits  for  CS  and  ATP 
systems,  mainly  by  means  of  DS  modulations.  Notable 
difficulties  are  not  anticipated  in  considering  various 
similar  transmissions  sharing  the  same  circuits,  and 
unconstrained  multiple  access  schemes  similar  to 
those  described  in  [12]  can  be  easily  conceived,  in 
order  to  provide  service  to  different  mobiles  on  the 
same  track  section. 


2.  INTERFERENCES  SPECTRA  AND  SPREAD 
SPECTRUM  MODULATIONS  ON  TRACK 
CIRCUITS 


Some  examples  of  conducted  emission  spectra  on 
tracks,  to  be  faced  by  track  circuit  receivers,  were 
shown  in  [11],  Fig.  1,  and  some  detailed  spectra  for 
various  a.c.  and  d.c.  traction  power  supply  cases  are 
provided  in  [13],  [14],  and  in  [1].  In  particular, 
reference  is  made  to  Fig.  21/2  and  25/2  (a.c.  15  kV,  16 
2/3  Hz  tyristor  locomotive,  without  and  with  filter), 
22/2  and  23/2  (a.c.,  25  kV,  50  Hz  locomotives).  Fig. 
29/2  and  30/2  (calculated  spectra  for  2  four-quadrant 
controllers  and  4  four-quadrant  controllers,  a.c.  25  kV, 
50  Hz  asynchronous  motor  locomotives)  and  Fig.  36/2 
(d.c.,  chopper  drive  locomotive)  in  [13],  to  Fig.  2  and 
Table  1  in  [14]  and  to  Fig.  6  in  [1]  (chopper  drive 
locomotives).  Maximum  envelopes  for  conducted 
emissions  for  a  main  line  d.c.,  3  kV  system  and  for  a 
rapid  transit  system  are  shown  in  [14],  Fig.  1,  and  in 
[1],  Fig.  8. 

As  seen,  attention  is  normally  restricted  to  harmonic 
spectra,  as  needed  when  considering  traction  currents 
which  are  not  rapidly  changing  due  to  relatively  slow 
train  accelerations,  and  when  chopper  traction  units 
are  restricted  to  fixed-frequency  operation. 

Narrow  band  circuits  secure  immunity  by  rejecting 
spectral  lines  at  harmonic  frequencies.  However,  line 
frequencies  are  typical  of  the  various  electric  traction 
systems,  and  so  are  the  particular  frequencies  carrying 
the  major  interference  contributions. 

Considering  the  bandwidths  available  for  tracks 
circuits,  which  are  of  the  order  of  some  10  kHz,  SS 


modulations  will  offer  in  principle  processing  gains  of 
the  order  of  some  20  or  30  dB,  depending  on  the 
information  rate.  The  related  interference  margins 
will  be  largely  independent  from  spectral  details,  in 
particular,  from  the  interference  fundamental  and 
more  dangerous  frequencies.  However,  effective 
interference  margins  have  to  be  carefully  investigated, 
as  no  further  protection  is  available  to  counteract 
major  interference  spectral  lines  if  notch  filter  are  not 
utilised  (note  that  any  notch  filter  is  likely  to  limit 
standardisation  and  interoperability  of  equipment). 
From  the  above  documentation,  it  appears  that  some 
low  frequency  regions,  up  to  about  500  or  700  Hz, 
carry  the  larger  interference  lines  both  in  the  a.c.  and 
in  the  d.c.  supply  cases.  Higher  frequency  regions 
show  some  unconstrained  harmonic  levels  in  the 
Italian  maximum  envelope  for  conducted  emission, 
and  are  used  in  various  networks  to  carry  ancillary 
signals. 

Although  considering  the  maximum  envelope  for 
d.c.  traction  systems  is  not  realistic,  as  actual 
interferences  show  very  sparse  harmonic  spectra,  it  is 
evident  that  the  broad  band  nature  of  the  DS 
signalling  procedures  is  not  likely  to  provide 
comfortable  processing  gains,  and  that  a  more  realistic 
evaluation  of  the  receiver  performances  will  be  based 
on  white  noise  interference.  This  is  suggested,  in 
particular,  by  the  near  constant  envelope  of  the  a.c. 
locomotive  spectra:  when  larger  bandwidths  are 
occupied  by  the  modulation  spectra,  larger  interference 
powers  are  collected  by  the  receiver,  and  the 
performances  of  the  optimum  receiver  will  be  near 
insensitive  to  the  actual  signal  bandwidth. 

With  respect  to  narrow  band  systems,  the  protection 
of  dedicated  frequency  channels  is  lost,  and  worst  case 
interference  conditions  are  probably  worse.  However, 
some  insensitivity  to  particular  condition  is  certainly 
gained,  as  insensitivity  to  the  traction  current 
fundamental  frequency,  which  prevents 
interoperability  of  narrow  band  track  circuit  receivers. 
Moreover,  continuous  spectra  and  variable  line 
frequency  spectra  are  not  in  principle  more  harmful 
against  a  direct  sequence  receiver  than  fixed-frequency 
spectra,  and  probably  the  fixed-frequency  operation 
constraint,  already  introduced  in  many  networks  to 
protect  traditional  track  circuits,  can  be  removed, 
allowing  sweep  frequency  chopper  drive  operations. 

To  provide  quantitative  evaluation  of  the  effects  of 
the  various  interference  spectra,  a  simulation  program 
has  been  implemented.  The  first  results  on  a.c.  16  2/3 
Hz  locomotive  spectra,  considering  DS/PSK 
modulations  at  5  or  10  bit/s  information  rates,  show 
that  proper  operation  conditions  are  provided  at  the 
same  signalling  current  utilised  in  narrow  band  track 
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circuits.  A  chip  rate  of  some  hundred  of  Hz  has  been 
selected,  with  a  carrier  frequency  of  about  1  kHz,  and 
a  correlation  receiver  has  been  utilised.  Out  of  band 
spectral  lines  have  been  filtered,  as  pre-filtering  of 
any  out  of  band  (possibly  strong)  interference  is 
clearly  to  be  recommended  in  the  system  design. 

As  expected,  the  noise  probability-distribution 
functions  for  levels  comparable  to  2  or  3  times  the  rms 
current  values  are  similar  to  those  of  a  gaussian  noise, 
and  the  receiver  performances,  at  least  for  low  signal 
to  noise  power  ratios  and  large  bit  error  rates,  are 
rather  similar  to  those  of  a  matched-filter/correlator 
receiver  in  the  presence  of  white  gaussian  noise. 

However,  the  periodic  nature  of  the  interference, 
which  is  likely  to  exhibit  repetition  periods  of  the  same 
order  of  the  bit  duration,  has  to  be  taken  into  account: 
to  provide  comparison  with  the  gaussian  white  noise 
case,  the  actual  phases  of  the  interference  spectral 
lines  have  been  periodically  set  to  new  pseudo-random 
values. 

Further  work  is  needed  to  complete  the  evaluation  of 
the  DS  receiver  with  various  interference  spectra 
(processing  gains  of  about  15  or  20  dB  are  expected 
against  continuous  wave  interferences),  and  with 
various  design  parameters,  and  to  provide  comparison 
among  different  SS  schemes  and  with  narrow  band 
systems. 

In  any  case,  it  seems  that  signal  acceptance  checks 
similar  to  those  utilised  in  narrow  band  circuits  (as 
coding,  detection  hysteresis,  on-off  keying  at  very  low 
rates  followed  by  duty  cycle  evaluation,  detection  of 
unbalanced  traction  currents  on  tracks  )  have  to  be 
retained  in  the  design  of  spread  spectrum  systems, 
which  will  be  exposed  to  more  unpredictable 
interference  conditions.  - 

To  counteract  unexpected  strong  interferences, 
which  are  the  weakness  of  DS  systems,  frequency 
hopping  modulation  systems  can  provide  effective 
solutions,  because  signal  check  and  rejection 
mechanisms  similar  to  those  implemented  in  narrow 
band  tracks  circuits  filtering  can  be  utilised,  and  the 
low  information  rates  allow  to  retain  only  signal 
received  on  clear  frequency  channels.  Moreover, 
compatibility  with  current  narrow  band  systems  can  be 
easily  provided  by  tuneable  fixed-frequency  operation 
and  adjustable  detection  procedures,  easily  provided, 
at  low  information  rates,  by  means  of  digital  signal 
processing  techniques. 

Similarly,  diversity  transmissions  can  be  usefully 
investigated:  in  particular,  multicarrier  MSK 
modulation  systems,  already  proposed  for  data 
transmission  on  track  circuits  [5],  can  provide 
diversity  if  the  different  carriers  are  modulated  by  the 
same  low  rate  data  sequence,  as  anticipated  in  [11]. 


A  further  possibility  deserves  some  investigation. 
Considering  a  SS  track  circuit  sharing  the  low 
frequency  band  of  the  current  narrow  band  systems, 
compatible  operations  of  the  two  systems  calls  for 
negligible  mutual  interference.  This  is  likely  to  be 
feasible,  but  the  accurate  assessment  of  both  system 
sensitivities  and  of  the  current  levels  needed  for  the 
SS  system  is  needed.  A  frequency  hopping  system  will 
be  probably  ignored  by  many  narrow  band  traditional 
circuits;  similarly,  due  to  low  power  spectral  densities, 
direct  sequences  systems  will  be  not  harmful,  even 
considering  the  increased  power  levels  needed  to 
counteract  the  interference  caused  by  the  narrow  band 
circuit  itself. 


3.  CONCLUSIONS 

A  first  assessment  of  the  capabilities  of  spread 
spectrum  modulated  track  circuit  has  been  attempted, 
looking  both  for  transmission  performances 
comparable  to  those  secured  by  means  of  the  current 
narrow  band  track  circuits,  and  for  providing  suitable 
operation  by  means  of  a  sole  standard  equipment  in 
different  EMC  conditions,  as  experienced  in  railway 
networks  equipped  with  different  electric  traction 
systems.  Although  track  circuit  spread  spectrum 
techniques  seem  not  to  be  likely  to  provide 
performances  similar  to  those  expected  from  radio 
transmission  systems,  some  interesting  possibilities 
have  been  shown,  and  some  future  work  outlined, 
which  is  needed  to  evaluate  effective  feasibility. 
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ABSrreACT 

Developments  in  techniques  for  modelling  of  photodiodes 
used  in  optical  parametric  receivers  are  examined  in  this 
paper.  As  parametric  operation  is  non-linear  and  quasi- 
peri  odic  in  nature,  special  modelling  techniques  arc 
required  Tl^e  effect  of  diode  parameters  (eg  capacitance) 
on  gain  and  bandwidth  have  been  examined.  Time  domain 
techniques  have  been  used  for  simulation.  Different  non¬ 
linear  models  for  depicting  the  photodiode  have  been 
evolved  during  the  course  of  analysis.  A  summarized 
analytical  derivation  is  also  presented  Finally,  the  results 
obtained  from  an  experimental  down  converter  are 
included 

1.  INTRODUCTION 

The  concept  of  using  parametric  receivers  for  microwave 
amplification  has  been  established  for  several  years.  This 
technique  is  now  sought  to  be  extended  to  receivers  for 
optical  carriers  with  microwave  modulatioa  Since 
parametric  operation  is  non-linear,  the  analysis  of  these 
circuits  presents  sonrs  difficulty.  Various  configurations 
such  as  i^-convcrtcrs,  down  converters  and  baseband 
anplifiers  are  possible  in  this  system  One  of  the  factors 
vdiich  is  of  paramount  importance  in  governing  receiver 
performance  is  the  photodiode  capacitance.  This  has  some 
influence  on  the  gain  as  well  as  the  bandwidth.  Ihis 
receiver  can  principally  be  used  fibre  based  microwave 
sub-carrier  systems. 

2.  SYSTEM  CONFIGURATION 

The  system  has  interaction  of  optical  and  microwave 
signals.  The  transmitter  consists  of  a  laser  source  which  is 
arrplitude  modulated  by  RF  or  microwave  signals. 
Specifically  in  a  microwave  sub-carrier  system  several 
microwave  subcarriers  are  combined  and  modulated  on  the 
laser  beam.  The  receiver  consists  of  a  photodiode  which  is 
operated  in  a  parametric  mode  to  perform  anplification 
and  to  recover  the  baseband  Parametric  operation  is 


effected  by  periodic  variation  of  the  diode  capacitance, 
This  is  accomplished  applying  a  microwave  signal  (level 
of  10-15  dBm)  called  pump  across  the  diode.  Manley  and 
Rowe  have  developed  the  basic  theory  of  electric 
parametric  anplifiers  and  this  can  serve  as  a  basis  for  this 

caseV  It  is  assumed  that  the  level  of  the  detected  signal  is 
much  smaller  than  the  pump  and  that  the  modulation  of  the 
capacitance  is  due  to  the  pump.  The  profile  of  the  diode’s 
Capacitance  Voltage  characteristic  and  the  ratio  of  the 
maximum  to  minimum  capacitance  is  of  significance  in 
governing  the  performance  of  the  amplifier. 


3.  ANALYSIS 

A  summarized  analytical  derivation  is  now  presented  for 
the  down-converter.  In  the  analysis,  the  periodic  variation 
of  the  diode  capacitance  by  electrical  means  is  represented 
as  a  sinusoidally  varying  capacitance.  Therefore, 
capacitance  of  a  pumped  photodiode  can  be  approximately 
represented  in  Vie  following  manner  [2], 

C=  Co  [  1  -  Vp  Sin  cop  t]  (1*) 

2(Vo  +  VlO 

where  V|)  is  the  pump  voltage,  Vo  is  the  barrier  potential, 
Vb  is  the  bias  voltage,  Co  is  the  zero  bias  diode 
capacitance  and  oop  the  pump  frequency. 

The  photocunrent  can  be  expressed  as  : 

Is  *=  Iso  Sin  oos  t  (2) 

An  expression  can  be  derived  for  the  total  current  in  the 
photodiode  as: 

1=  dCC.Vg)  =  VsdC  +  CdVs  (3) 

dt  dt  dt 

Where  Vs  is  the  signal  voltage  across  the  diode  due  to  the 
photocurrent  Is.  Substituting  form  equations  (1)  and  (2) 
into  equation  (3)  ,  the  expression  for  the  component  of 
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the  diode  cunrcnt  (peak  value)  at  the  down  -converted 
frequency  (Dp  -  CDs  can  be  derived  as: 

Id  *“  _ lao  Yp  [  tap  -  <ita]  (4) 

4  (  Vb  +  Vo)  CDS 

If  the  load  resistance  at  this  frequency  is  Ro  (normally  50 
ohms),  then  the  output  voltage  is  given  by  Id  Rq*  ^ 

accordance  with  normal  microwave  practice,  the  input 
voltage  or  power  is  measured  across  a  50  ohm  load. 
Hence  the  input  voltage  can  be  written  as  I^^  where  R^ 

**  50  ohms.  Therefore  the  gain  of  the  Down  -converter  can 
be  shown  to  be: 

CffiP.  _ Yp _ L  (5) 

CDS  4  (  Vb  +  Vo) 

Equation  (5  )  shows  that  the  gain  is  dependant  on  the 
ratio  of  the  down-converted  frequency  to  the  signal 

fircqucncy. 

4.  NON-LINEARSI^fULA'^ON 

Simulation  can  be  effectively  used  to  supplement 
analytical  methods  in  investigating  parametric  systems. 
Because  of  the  nature  of  the  system,  the  simulation 
technique  used  should  be  able  to  tackle  large  signal  non¬ 
linearity.  For  non-linear  simulation  time  both  domain  and 
frequency  domain  methods  are  available.  Time  domain 
methods  like  transient  analysis  are  quite  versatile  in 
tackling  non-linearity  and  are  used  in  packages  like 
Spice3.  As  the  anr^liUide  of  the  harmonics  are  required  at 
the  end,  spectrum  analysis  has  to  be  performed,  The  circuit 
to  be  simulated  is  ^own  in  fig.  1  ,  where  the  photo-current 
is  depicted  as  a  current  source. 


Fig  1  Simulated  circuit  of  receiver 

Two  parameters  vdiich  are  inportant  in  transient  analysis 
are  the  step  size  and  the  duration  of  analysis.  These  are  to 
be  fixed  depending  on  the  frequencies  of  interactioa  In  a 
typical  case  of  a  down-converter,  the  pump  frequency  may 


be  1  GHz  and  the  signal  frequency  990  MHz  vdiich  gives  a 
down-converted  frequency  of  1 0  MHz.  It  would  therefore 
necessary  to  resolve  10  MHz  around  a  central  frequency 
of  IGHz-  To  accomplish  this,  the  duration  of  analysis 
should  include  several  cycles  of  the  smallest  frecjucncy  to 
be  resolved  [3].  In  addition  transient  analysis  is  performed 
at  discrete  points  on  the  input  waveform  with  interpolation 
in  betweea  In  order  to  minimize  interpolation  errora,  step 
size  should  be  chosen  as  a  fraction  of  the  pump  period 
Subsequently,  during  Fourier  analysis  on  the  transient 
analysis  data,  it  should  be  remembered  that  only  a  finite 
data  length  is  used  which  can  result  in  amplitude  errors  [3]. 
This  can  be  minimized  by  the  choice  of  a  proper  window,  A 
Hamming  window  offers  a  compromise  between  frequency 
resolution  and  leakage.  On  the  other  hand,  a  raised  cosine 
window  which  has  a  small  central  lobe  leading  to  better 
resolution  and  roll  off  characteristics  has  been  used  in  the 
package. 


5.  MODEL  DEVELOPMENT 

Different  models  have  been  evolved  for  representing  the 
non-linear  diode  behaviour.  Initially  the  standard  diode 
model  in  the  package  was  used  [4].  However  for  exactriess, 
dedicated  models  have  been  evolved  two  of  which  are 
discussed  here. 

5.1  Non-linaar  capacitor  model 

As  the  name  suggests  the  photodiode  is  represented  as  a 
non-linear  capacitor.  As  a  non-linear  capacitor  is  not 
available  as  a  standard  model  in  Spice3  a  model  was 
synthesized  as  shown  in  fig  2 .  The  non-linear  capacitance 
is  realized  by  having  a  non-linear  source  drive  a  current 
throu^  a  linear  capacitor. 


Fig  2  Non-Linear  capacitor  model. 

The  non-linear  source  is  dependent  on  the  voltage  between 
dl  and  d2.  The  current  through  the  c^acitor  is  reflected 
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between  dl  and  d2  and  this  represents  the  non-linear 
capacitor  The  series  resistance  of  the  diode  is  represented 
by  Rg  .  If  required  a  shunt  resistance  and  series 

inductance  can  also  be  included  A  Down  -converting 
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Fig.  3  Output  Spectrum  Gain=10.45dB 

configuration  was  simulated  using  this  model  using  the 
circuit  shown  in  fig.  2.  The  C-V  characteristic  was 
measured  using  a  parameter  analyzer.  The  diode  had  a 
zero  bias  capacitance  of  9.9  pF.  The  polynomial  used  for 
representing  the  non-linear  capacitor  was  C  ~  9,95  - 

4.18V  +  1.2  V^.  The  simulation  was  performed  for  a 
punrp  frequency  of  1  GHz  and  a  input  frequency  of  980 
MHz  (Level  =  -52  dBV).  The  output  spectrum  is  shown  in 
fig.  4  ^ch  indicates  a  gain  of  10.45  dB  at  the  down- 
converted  frequency  of  20  MHz. 

5.2  Enhanced  Model 

The  non-linear  capacitor  model  works  well  in  modelling 
normal  junction  diodes.  However,  photodiodes  usually 
have  a  PIN  structure.  The  enhanced  model  was  developed 
with  the  intent  of  forming  an  exact  representation  of  the 
diode  structure.  This  is  acconplished  in  the  model  shown 
in  fig.  4.  At  zero  bias,  a  portion  of  the  intrinsic  region  is 
undeplcted  which  is  atterrpted  to  be  represented  in  the 
enhanced  model.  Here,  the  diode  structure  is  split  into  two 
parts,  one  representing  the  junction  region  and  the  other 
the  undeplcted  part  of  the  intrinsic  regioa  At  zero  bias  the 
capacitance  of  the  junction  region  dominates  by  ip  to  a 
factor  of  ten  over  the  intrinsic  regioa  With  the  application 
of  bias,  the  capacitances  of  each  region  behave  differently. 
The  junction  region  can  be  modelled  as  before 


with  a  polynomial.  The  undepleted  intrinsic 


Non  linear 
Capacitor  2 
(1  -Region) 


Fig  4  Enhanced  model  of  photodiode 

region  is  represented  as  an  exponentially  increasing 
capacitance.  The  simulation  was  repeated  on  a  down¬ 
converting  configuratioa  At  an  iiput  frequency  of  980 
MHz,  the  simulation  predicted  a  gain  of  6.0  dB  at  the 
output  frequency  of  20  MHz  (fig.  5).  This  model  can  be 
used  to  balance  intrinsic  region  properties  of  the 
photodiode  which  affects  photodetection  with  those  of  the 
junction  region  which  affects  parametric  performance. 


Fio.  5  Output  Spectrum.  Gain  “  6.0  dB. 

InpjJt  =  -50  dBV 

The  simulation  was  also  repeated  to  study  the  performance 
of  ip-converters.  One  of  the  factors  of  importance  is 
amplifier  bandwidth.  Though  a  smaller  diode  capacitance 
(<  1  pF)  would  enable  higher  speed  of  operation  ,  it  was 


213 


observed  that  for  improved  parametric  properties  resulting 
in  higher  gain  arid  bandwidth,  a  higher  capacitance  of 
around  1 0  pF  is  preferable, 

6.  PRACTICAL  RESULTS 

Practical  down-converters  and  if)-convertcrs  have  been 
tested  and  these  have  performed  well.  A  practical  down- 
converter  was  built  using  a  broad  band  laser  transmitter 
which  could  accept  modulation  up  to  1  GHz.  The 
photodiode  was  built  mounted  in  a  micro-strip  box  In  a 
down-converting  arrangement,  using  a  1  GHz  pump  and  a 
900  IvIHz  signal  the  receiver  gave  a  gain  of  7.9  dB  at  the 
down-converted  frequency  of  100  IvlHz(fig.  6) 
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Fig  C.  Output  of  Down-converter 
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The  performance  can  be  improved  by  shorting  the  idlers 
which  would  otherwise  dissipate  some  of  the  energy  in  the 
load.  Up-converters  earlier  built  which  bandwidths  of  up 
to  1 00  MHz  at  a  pumf*  frequency  of  1 .5  GHz  [4 ]  A  gain  of 
12.5  dB  could  be  obtained  at  a  signal  frequency  of  100 
MHz 


7.  CONCLUSIONS 

The  development  of  simulation  and  modelling  techniques 
for  parametric  receiver  design  is  presented.  The  non-linear 
capacitor  model  helps  in  incorporating  the  measured 
Capacitance -Voltage  characteristic  of  the  photodiode  in 
the  simulatioix  while  the  erhanced  model  helps  in 
modelling  a  more  exact  photodiode  structure.  Tlie 
modelling  of  the  down- converter  has  been  a  guide  in  the 
practical  constrijction  in  predicting  the  down-conversion 
gain  available  for  a  range  of  input  frequencies. 

The  effect  of  diode  capacitance  in  influencing  parametric 
gain  and  bandwidth  as  indicated  by  simulation  and  practical 
experiments  suggests  that  a  value  of  10  pF  is  desirable. 
The  enfianced  model  will  help  in  the  evolution  of  a  new 
diode  structure  which  will  balance  pholodetection  and 
parametric  properties 
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ABSTRACT 

We  will  report  on  design  considerations  and  general 
discussion  of  the  applicability  of  the  transimpedance 
ampMer  concept  for  optoelectronic  receivers  at 
extremely  high  bit  rates  up  to  40Gb/s.  The  investigated 
receiver  design  is  based  on  a  low  gate-leakage 
InAlAs/InGaAs/InP  heterostructure  fieldeffect  tran¬ 
sistor  (HFET)  [1].  The  very  important  noise  modeling  of 
these  devices  is  done  using  an  extended  temperature 
noise  model  [2]  in  order  to  produce  a  reliable 
extrapolation  far  beyond  frequency  limits  of  common 
measurement  setups.  The  fitted  transistor  model  shows 
excellent  agreement  with  measured  data  concerning  rf  as 
well  as  noise  performance.  The  evidence  of  inductive 
peaking  near  the  comer  fi'equency  of  the  transimpedance 
Zy  is  correlated  to  a  phase  difference  between  the 
voltage  gain  and  Zy  itself  Furthermore,  the  distinct 
influence  of  the  length  of  the  feedback  line  on  receiver 
performance  is  discussed.  Based  on  0.7^m  gate  HFETs 
produced  by  optical  lithography  and  offering  a  current 
gain  cut  off  frequency  of  7^=40GHz  the  following 
receiver  features  can  be  predicted:  low  frequency 
transimpedance  Z  jq  =39.4dBn,  comer  frequency 
73^=22GHz,  mean  equivalent  input  noise  current 
density  i„g  «43pA/>/Hz.  Thus  the  receiver  shows  an 
excellent  calculated  sensitivity  of  JjPmm  -  -13.2dBm  at 
40Gb/s. 

1.  INTRODUCTION 

The  transimpedance  amplifier  concept  for  low  noise 
preamplifiers  in  optoelectronic  receivers  is  most 
common  up  to  bit  rates  of  lOGb/s  [3].  Recently,  even 
20Gb/s  demonstrators  can  be  found  in  literature.  But  this 
concept  becomes  more  doubtfiil  with  increasing  bit  rate 
due  to  wave  propagation  effects  within  the  circuit.  Other 
approaches  using  impedance  matched  amplifier  concepts 
like  traveling  wave  amplifiers  specially  modified  for 


optoelectronic  receiver  applications  [4]  suffer  fi'om  the 
bad  power  matching  conditions  at  the  input.  From  a  pure 
physical  point  of  view  a  photo  detector  (PD)  is  a 
capacitively  shunted  current  soince.  All  efforts  to  match 
this  device  to  a  low  impedance  wave  guide  result  in  a 
significant  drop  of  responsivity  of  the  PD  [5].  Therefore, 
up  to  now  the  transimpedance  amplifier  seems  to  be  the 
best  concept  for  low  noise  preamplification  in 
optoelectronic  receivers. 

Aim  of  this  paper  is  to  point  out  some  very  important 
design  considerations  for  the  development  of  such  high 
bit  rate  receivers  based  on  the  transimpedance  amplifier 
concept.  Nevertheless,  because  all  basic  model  data  are 
derived  from  measured  devices,  the  demonstrated 
amplifier  design  is  suitable  for  technological  realization. 
All  simulations  have  been  carried  out  using  the 
Microwave  Design  System  (MDS)  by  Hewlett  Packard. 
Due  to  the  significant  influence  of  the  noise  modeling  on 
sensitivity  estimations,  we  have  implemented  a 
temperature  noise  model  (TNM)  [2]  for  the  HFET 
devices.  The  calculation  of  receiver  sensitivity  is  done  in 
a  measurement  compatible  way  and  hence  direct 
correlation  of  simulated  and  measured  data  is  possible. 

2.  HFET  MODEL 

The  dc  and  rf  model  of  the  FET  is  based  on  the 
EEHEMTl  large  signal  model  in  MDS.  Fig.  1  and  2 
demonstrate  the  excellent  agreement  in  both,  dc  and  rf 
characteristics  between  measured  and  simulated  data. 
Furthermore,  that  model  shows  very  good  agreement  for 
a  wide  range  of  bias  conditions.  For  the  noise  .simulation 
the  FETs  were  modeled  using  the  TNM  [2],  which  has 
the  advantage  of  describing  significant  noise  sources  and 
small  signal  behavior  at  the  same  time.  The  comparison 
of  measured  and  calculated  noise  data  is  shown  in  fig.  3. 
The  TNM  allows  extrapolation  far  beyond  measurement 
frequency  limits  without  producing  non  physical  results. 
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This  is  most  significant  for  a  valid  prediction  of  receiver 
sensitivity.  Low  frequency  1/f-noise  is  not  considered  by 
the  TNM.  Nevertheless,  sensitivity  estimations  will  not 
be  influenced  significantly  due  to  the  very  small 
bandwidth  of  1/f-noise  compared  to  the  bit  rate. 

3.  AMPLIFIER  LAYOUT 

The  investigated  transimpedance  amplifier  layout  is 
straight  forward  as  can  be  seen  from  fig.  4.  It  is  based  on 
a  three  stage  concept  with  a  first  voltage  amplifier  stage 
(Tj)  followed  by  a  current  amplifier  and  level  shift  (Dj- 
D3)  driving  stage  (Tj)  for  bias  adjustment  To  achieve  a 
suitable  voltage  gain  bandwidth  and  to  minimize 
additional  noise  contributions  the  first  load  should 
be  realized  by  a  chip  resistor  rather  than  an  active  load. 
Within  the  second  stage  an  active  load  (T3)  acting  as 
current  source  results  in  more  stable  bias  conditions 
without  significantly  increasing  noise.  For  a  circuit 
simulation  according  to  a  real  chip  design  some  physical 
line  models  (TLj,  TL2)  have  been  added  within  the  level 
shift  and  feedback  path  which  obviously  have  the 
maximum  length  compared  to  all  other  interconnections. 
The  third  stage  (T4,  Tj)  drives  and  matches  a  coimected 
50  Q.  line  termination. 


and  TL2  is  clearly  demonstrated.  With  increasing  line 
length  inductive  peaking  occures  which  significantly 
deteriorates  impulse  response  as  demonstrated  in  fig.  6. 
Scaling  of  these  line  lengths  directly  correlates  with  chip 
size.  Therefore  the  total  chip  size  should  be  as  small  as 
possible  for  high  bit  rate  receivers.  Inductive  peaking  of 
the  transimpedance  corresponds  to  a  distinct  phase 
difference  between  voltage  gain  and  transimpedance 
function,  hence  the  bandwidth  of  should  be  larger 
than  that  of  transimpedance  Zy  itself  (fig.  5).  A  flat 
characteristic  of  transimpedance  is  achieved  with  a 
comer  frequency  of  fjjg  «22GHz  corresponding  to  a 
bit  rate  of  S=40Gb/s.  Variations  of  group  delay  time  are 
of  the  order  of  gr|  <10ps.  The  equivalent  input  noise 
current  density  is  shown  in  fig.  7.  The  mean  equivalent 
input  noise  current  over  bandwidth  A/  =B  is 
*na  «43pA/''/Hz.  Thus  receiver  sensitivity  can  be 
calculated  to  ///min”  -13.2dBm.  The  receiver  output 
signal  shows  a  clearly  open  eye  at  that  bit  rate  and  an 
effective  transimpedance  corresponding  to  the  eye 
opening  of  (fig.  8).  Concluding  it  should 

be  pointed  out,  that  especially  included  lines  within  the 
feedback  path  deteriorate  impuls  behavior  of  the 
amplifier  significantly. 


4.  CIRCUIT  SIMULATION 
Four  types  of  simulations  have  been  carried  out.  A  dc 
simulation  is  used  to  examine  bias  conditions  of  all 
FETs.  S-parameter  simulation  is  used  to  derive  all 
relevant  small  signal  data  like  voltage  gain 
transimpedance  Zj  (magnitude  and  phase)  and  power 
gain  G.  A  noise  voltage  and  current  simulation  in 
combination  with  the  S-Parameter  data  allows  direct 
calculation  of  the  equivalent  square  input  noise  current 
(iJui)  hence  the  prediction  of  receiver  sensitivity 
after  [6]  as: 


where  Q  is  the  noise  factor  corresponding  to  the  bit  error 
rate  (gw6  for  BER=10"^,  B  the  actual  bit  rate  and  Zq 
the  characteristic  impedance  and  load  impedance,  resp. 
^N.out  the  spectral  noise  power  density  at  the 
terminated  amplifier  output,  h  Planck's  constant  and  v 
frequency  of  the  optical  input  signal. 

5,  RESULTS  AND  CONCLUSIONS 
In  fig.  5  the  transimpedance  vs  fi-equency  is  depicted. 
The  influence  of  the  length  of  transmission  lines  TL^ 
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Fig.  1:  Comparison  of  measured  and  modeled  HFET 
output  characteristics  using  the  large  signal 
EEHEMTl  model 


frequency  f/QHz  — ► 


Fig.  3:  Measured  and  calculated  noise  parameters 
based  on  the  temperature  noise  model  [2] 
(F(J5=-0.2V, 


Fig.  2:  Measured  and  modeled  S-parameters  using  the 
EEHEMTl  model 
(F(j^-0.2V,  F^5-1.2V) 
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Fig.  4:  Circuit  diagram  of  the  investigated 

transimpedance  amplifier  including 

transmission  lines  TL^  and  TLj  with  length 
and  I2,  respectively. 


frequency  f/GHz  — ► 

Fig.  5:  Dependence  of  transimpedance  and  voltage 
gain  on  transmission  line  length  0r^>  300pm, 
600pm,  900pm  2caAlf=li,  /2=3/2  Ij) 
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Fig.  6;  Impulse  response  in  dependence  on  the 
transmission  line  lengths  (/,=0,  300pm, 

600pm,  900pm  and  /;=/,,  /2=3/2  /;) 
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Fig.  8:  Eye  diagrams  for  40Gb/s  of  the  investigated 
transimpedance  amplifier 
(above;  without  transmission  lines, 
below:  including  transmission  lines) 
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ABSTRACT 

We  discuss  the  similarity  between  the 
plasma  waves  in  two  dimensional  systems  and 
sound  waves.  We  show  that  the  behavior  of 
both  types  of  waves  is  governed  by  the  same 
equations.  Therefore,  we  may  create  resonant 
structures  for  plasma  waves,  similar  to  those  in 
musical  instruments,  and  plasma  waves  can  be 
excited  by  a  dc  current,  just  like  wind  musical 
instruments  are  excited  by  air  jets.  However,  the 
plasma  wave  velocity  (on  the  order  of  10^  to 
10^  cm/s)  is  much  higher  than  the  sound 
velocity,  and  the  sizes  of  devices  with  the  two 
dimensional  electron  gas  are  very  small  (on  the 
order  of  a  micrometer  or  less).  Thus,  the  plasma 
wave  frequencies  are  in  the  terahertz  range,  and 
the  plasma  waves  can  be  coupled  to  far  infrared 
electromagnetic  radiation. 

1.  INTRODUCTION. 

When  the  electron  mean  free  path  for 
collisions  with  impurities  and  phonons  is  much 
greater  than  the  mean  free  path  for  electron- 
electron  collisions,  electrons  behave  like  a  fluid, 
which  may  be  described  by  hydrodynamic 

equations.  In  our  recent  papers  ^2  ^we  showed 
that  this  condition  can  be  met  for  two 
dimensional  (2D)  electrons  in  a  Field  Effect 
Transistor  (FET)  and  that  the  hydrodynamic 
equations  describing  this  electron  fluid  coincide 
with  those  for  shallow  water.  Furthermore, 
plasma  waves  in  the  FET  channel  are  similar  to 
shallow  water  waves.  We  also  showed  that  in  a 
short  enough  device,  an  instability  should  occur 
at  a  relatively  small  direct  current  because  of 
spontaneous  plasma  wave  generation.  This 
provides  a  new  mechanism  for  the  emission  of 
tunable  far  infrared  electromagnetic  radiation. 

In  this  work,  we  discuss  the  similarity 
between  the  plasma  waves  in  2D  systems  and 


sound  waves.  We  show  that  the  behavior  of  the 
plasma  waves  in  2D  systems  is  governed  by  the 
same  equations  as  for  sound  waves.  Therefore, 
resonant  structures,  similar  to  those  in  musical 
instruments,  may  be  realized  for  the  plasma 
waves,  and  these  waves  can  be  excited  by  a 
direct  current  just  like  wind  musical  instruments 
are  excited  by  air  jets.  However,  the  plasma 
wave  velocity  is  much  higher  than  the  sound 
velocity  and  the  FETs  are  very  small.  As  a 
consequence,  the  plasma  wave  frequencies  are  in 
the  terahertz  range.  These  plasma  waves  are 
accompanied  by  a  variation  of  a  dipole  moment 
created  by  charges  in  the  FET  channel  and  mirror 
image  charges  in  the  gate.  This  dipole  moment 
variation  should  cause  the  emission  of  far 
infrared  (terahertz  range)  electromagnetic 
radiation. 


2.  PLASMA  WAVES  AND  SOUND 
WAVES 

As  was  shown  in  1  and  mentioned  above, 
the  2D  electron  fluid  in  a  FET  is  described  by  the 
hydrodynamic  equations,  which  coincide  with 
those  for  shallow  water,  with  the  gate-to-channel 
voltage,  U,  playing  the  role  of  the  water  level. 
In  a  FET,  the  plasma  wave  dispersion  law  is 
given  by 

(£>=sk 

the  plasma  wave  velocity,  U  is  the  gate-to- 
channel  voltage  swing,  m  is  the  effective  mass, 
and  e  is  the  electronic  charge.  Plasma  waves  are 
similar  to  shallow  water  waves  or  to  sound 
waves  since  they  have  a  linear  dispersion  law. 

Shallow  water  behavior  is  also  similar  to 
the  dynamics  of  a  gas  with  pressure  proportional 
to  the  square  of  the  gas  density,  (see,  for 

example,  Thus,  the  nonlinear  hydrodynamic 
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equations  for  the  2D  electron  fluid  are  similar  to 
(but  not  identical  with)  the  equations  for  a  real 
gas,  such  as  air.  However,  the  linearized 
equations  describing  small-amplitude  plasma 
waves  in  a  FET  and  sound  waves  in  a  gas  are 
identical.  Since  the  linearized  equations 
determine  the  instability  threshold  for  a  steady 
flow  (i.  e.  the  wave  generation  threshold),  the 
instability  conditions  for  a  real  gas  and  for  a  2D 
electron  fluid  should  be  similar,  provided  that  the 
Reynolds  numbers  and  quality  factors  of 
resonance  cavities  are  the  same.  Below,  we  will 
show  that  these  parameters  for  our  "electronic 
flute"  may  be  of  the  same  order  of  magnitude  as 
for  a  conventional  flute. 


Air  Jet 


(a) 


Drain 


(b) 

Fig.  1.  Simplified  diagram  of  a  jet  driven  pipe 
musical  instrument  of  a  flute  family  (a.)  and 
electronic  flute  (b).  White  areas  in  Fig.  1  b 
show  the  gated  region  of  the  device  with  the 
2D  electron  fluid  in  the  channel.  ^ 


The  part  of  the  device  in  Fig.  1  b  shown  below 
the  dashed  line  is  the  electronic  flute. 

Fig.  la  (from  Ref.  4)  shows  the 
schematics  of  a  jet  driven  pipe  musical  instrument 
of  a  flute  family.  In  this  instrument,  an  air  jet 
excites  a  resonant  cavity  formed  by  the  pipe 
closed  at  both  ends.  A  similar  structure  can  be 
realized  using  a  modulation  doped  gated  device 
(see  Fig.  lb). 

The  top  gated  portion  of  the  device 
connected  to  the  drain  is  similar  to  the  outside  air 
space  for  a  jet  driven  musical  instrument.  We 
call  the  part  of  this  device  shown  below  the 
dashed  line  in  Fig.  lb  an  electronic  flute.  The 
remaining  gated  portion  of  the  device  connected 
to  the  drain  is  similar  to  the  outside  air  space  for  a 
jet  driven  musical  instrument.  In  the  electronic 
flute,  a  direct  current  flow  excites  plasma  waves 
in  the  resonance  cavity  in  the  same  way  as  an  air 
jet  excites  sound  waves  in  an  acoustic  cavity. 

In  Table  1,  we  compare  the  relevant 
parameters,  Reynolds  numbers  and  quality 
factors  for  a  conventional  flute  and  for  an 
electronic  flute.  One  can  see  that  the 
dimensionless  parameters  determining  the  wave 
generation,  i.  e.  the  Reynolds  numbers  and 
quality  factors,  for  the  electronic  flute  are  not  that 
different  from  those  for  a  conventional  flute, 
even  though  their  dimensional  parameters  (size, 
flow  and  wave  velocities,  and  frequency)  ^ffer 
by  many  orders  of  magnitude. 


Parameter 

Musical 
Instrument  of 
Flute  Family 

Electronic  Flute 

relevant 

dimension 

1  cm 

10-4  cm 

10  cm/s 

10^  cm/s 

ESESSSHlIi 

3x1 04  cm/s 

10^  cm/s 

viscosity 

0.15  cm2/s 

15  cm^/s 

frequency 

100- 104  Hz 

loll .  1013  Hz 

Reynolds 

number 

60 

60 

Quahty 

factor 

3-100 

Table.  1.  Parameter  comparison  for 
conventional  and  electronic  flutes. 
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Therefore,  we  presume  that  the  electronic  flute 
will  operate  in  a  manner  similar  to  a  conventional 
flute  but  in  a  terahertz  range  of  frequencies  (see  ^ 
for  more  details). 


1 ,  2,  3,  4  -  resonance 
cavities 


Electron  Flow 


Fig.  2.  Array  of  electronic  flutes  with  a  more 
efficient  coupling  of  plasma  waves  to 
electromagnetic  radiation.  The  plasma  wave 
oscillations  are  excited  in  the  peripheral 
resonance  cavities  by  2D  electrons  flowing 
from  the  source  to  the  drain.^ 

The  basic  structure  shown  in  Fig.  1  b  can 
be  modified  and  repeated  many  times,  and  this 
device  may  be  made  as  an  array  with  dimensions 
equal  to  a  quarter  of  the  wave  length  of  the 
electromagnetic  radiation  with  the  frequency 


coinciding  with  the  plasma  waves  for  the 
optimum  coupling  to  the  infrared  radiation.  For 
example,  in  the  device  shown  in  Fig.  2  the 
plasma  wave  oscillations  are  excited  in  the 
resonance  cavities  by  the  current  flowing 
between  the  source  and  drain.  Even  though  only 
a  few  resonance  cavities  are  shown  in  Fig.  2  (for 
simplicity),  for  the  optimum  coupling  to  the 
infrared  radiation,  the  basic  structure  can  be 
repeated  many  times  and  this  device  may  be  made 
as  an  array  with  dimensions  equal  to  a  quarter  of 
the  wave  length  of  the  electromagnetic  radiation 
with  the  same  frequency.  (Of  course,  just  as  for 
musical  instruments,  the  optimum  design  of  the 
electronic  flute  may  be  both  a  matter  of  skill  and 
art.) 

Observations  of  infra-red  absorption  ^ 
and  infra-red  emission  ^  in  a  silicon  FET  with  a 
grating  metal  structure  on  the  gate  indirectly 
confirm  the  possibility  of  exciting  the  plasma 
waves  in  FETs  and  their  coupling  to 
electromagnetic  radiation.  The  frequency  of  the 
observed  electromagnetic  radiation  was  equal  to 
the  frequency  of  the  plasma  waves  which  were 
excited  by  the  drain-to-source  current  and  were 
tuned  by  the  gate  bias. 

In  an  ungated  modulation  doped  structure 


CO  = 


9 

Aize  n 


em 


and  the  plasma  waves  are  equivalent  to  deep 
water  waves.  Even  though  for  ungated 
structures,  there  is  no  such  a  strong  similarity  to 
sound  waves  as  for  a  FET,  we  could  speculate 
that  similar  effects  may  occur  nevertheless. 


6.  SUMMARY 


In  conclusion,  a  complete  similarity 
between  the  plasma  waves  in  a  FET  and  sound 
waves  led  us  to  believe  in  the  possibility  of 
realizing  an  electronic  flute  based  on  the 
excitation  of  plasma  waves  by  a  direct  current  in 
gated  modulation  doped  structures  and  operating 
in  a  terahertz  range  of  frequencies.  Other  similar 
devices,  such  as  arrays,  can  be  designed  using 
the  resonant  structures  for  the  plasma  waves. 
These  resonance  properties  should  also  allow  us 
to  use  a  HEMT  as  a  detector  or  mixer  ^  operating 
in  a  terahertz  range  with  sensitivities,  which  may 
exceed  the  sensitivity  of  Schottky  barrier  diodes 
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ABSTRACT 

An  algorithm  for  conversion  of  analog  signals  into 
quaternary  numbers  is  described.  Sinuilalion  results  of  tlie 
algoritlim  are  reported.  A  flash  type  implementation  of  tlvis 
algoritlim  is  described. 

K  INTRODUCTION 

Systems  based  on  nuilliievcl  logic  and  higher  radix 
number  systems,  have  the  advantage  of  being  fasterby  a  factor 
of  log2R  over  binary'  systems,  where  R  is  tlie  radix  of  the 
number  system 

The  development  of  quatcrnaiy’  ADCs  is  taken  up  as  a 
first  step  towards  qnatenian*  systems  for  data  storage  and 
tnuismission.  The  algorillnn  developed  is  quite  parallel  to 
the  window  algorithm  for  (crnar\  systems  reported  in  (I). 

2.  QDATICRNARY  NUMBERS  AND 
CONVERSION  ALGORITHM. 

2.1  Quaternary  numbers 

Some  of  tltc  4  digit  quatenuuy’  numbers  mid  their  decimal 
equivalents  are  shown  in  Table  2  1.  it  is  observ  cd  that  while 
0,  is  fixed.  Q,  ciiaiigcs  as  1.2  and  1  and  the  decimal 
equivalent  changes  i’s  t),  to  ,12  etc.  Similar  pattern  can  be 
obserx  ed  for  and  (J,  if  we  consider  inlcrmediale  numbers, 
not  shown  in  the  mblc. 

It  is  clear  tliat,  to  resolve  Ave  must  divide  the  input 
range  into4  regions  (0,63.5).  (63.5.127.5).  (127.5,191.5)  and 
(191.5.255.5)  and  assign  values  0.  I,  2  and  3  to  Q^. 
respectively,  depending  upon  the  region  in  whicii  Uic  input 
lies,  lliis  can  be  done  by  compiiring  the  input  with  a  'window’ 
centered  at  127  5.  and  liaving  lower  threshold  at 
(127.5  -  64)  -  63.5  ajui  upper  threshold  at  ( 1 27.5  +  64)-  19 1 .5, 
The  comparison  can  be  done  in  two  ways.  The  input  is  directly 
compiircd  willi  127. .5  ±  64  or  Uie  offset  127.5  is  subtracted 
from  tlie  input  and  the  resulting  value  is  compared  wiUt  ±64. 
We  adapt  tlie  second  strategy.  It  has  the  advantage  of  being 
simpler  for  flash  type  implementation. 


We  note  from  Table  2. 1  that,  given  Q,  =  O.then  Q,  =  0 
for  decimal  equivalent  <  16  and  Q.^  =  3  for  decimal  equivalent 
>  48.  The  next  window^  is  thus  given  by  W^  31.5  ±  16.  For 
Q,=  l,2  and  3.  tlic  windows  for  resolving  Q,  arc  63.5±16, 
95.5±16  mid  I27.5±16  respectively.  It  is  clear  tliat  the  value 
of  Q,  decides  the  offsetio  be  subyacted  from  tlie  input  before 
we  compare  it  witli  ±16. 


2.2  Symbol  for  window  comparison. 

We  will  now  define  a  symbol  for  window  comparison  so 
that  we  identify  two  comparisons  simultaneously.  The  symbol 
is  “Z"  which  is  to  be  interpreted  as  combination  of  the 
mid  signs.  The  statement  ”Y  Z  ±X  T  will  mean  Y  >  +X 
?  or  Y  <  -X  T  and  will  be  read  as  *’  Y  compared  with  ±X’\ 
Similarly  the  statement "  Y  Z  X,,  X,  ?*’  means  " Y  >  X,  7  or  Y 
<  X,  T\  Note  the  order  of  >  and  <  signs  as  related  to  +  and  - 
sign  of  X  or  tlie  values  X^  mid  X,. 

2.3  Quaternary  conversion  algorithm. 


We  now  define  tlie  window  s  and  offsets 
system. 

INITIAL  OFFSBI' 


OFFSETS 
WINDOWS 
ANALOG  INPUT 
AT  VARIOUS 
STAGES 
SIGN  BIT 


;  -  (4-^)/2 


for  mi  n  digit 
k  0 

k-  1,2.3.. .11 


;  N  ,  ,  k  =  0  to  n 

nk'l 

:  B  -  0  for  positive  input, 

1  for  negative  input 


The  bivalued  variable  B  is  used  to  keep  track  of  tlie  sign 
of  the  input  at  various  stages. 

The  algoritlim  for  n  digit  quaternary'  numbers  is  as  below. 

1)  Initialize  k  =  0 

2)  Initial  input  N^^  N. 

3)  IfN,^>0,  B  =  0eIseB=  1 

■0  =  . 2.1 

5)  •>  Q„.,,  =  3orQ„,.,  =  () 
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else 

N„..,ZO  ?  Q„.,.,  =  2orQ„,,=  l 

fa(Qn....)=(Q..-.)W„.,..  -3)/2 

7)  N.,..  =  (N„.,.,-W.,,(-l)«)(f.(Q„.,.,))  + 

(N.*.,)(f,(Q.*,))  . 2,2 

8)  Increment  k  by  1. 

9)  If  k  *  n  then  N ,  “  N,  -  F.,  (*1)®  aitd  STOP, 
else  go  to  step  3. 

Eqn  2. 1  takes  care  of  positive  as  well  as  negative  inputs. 

Whenever  a  digit  at  position  (n-k-1)  is  0  we  have  to  add 
4“  ^-'  to  the  input  before  it  is  passed  on  to  the  next  stage.  If  the 
digit  is  3,  then  we  liave  to  subtract  4"  '“'  from  the  input.  No 
such  operation  is  requti  ed  when  tlic  digit  is  1  or  2.  Eqn.  2.2  is 
used  for  tliis. 

Tlic  value  N.,  is  the  quantization  error  defined  as 

If  the  input  to  it  stage  is  exactly  equal  to  a  threshold,  it 
can  be  assumed  to  be  liigher  or  lower  than  tlie  Uneshold.  The 
same  convention  should  be  used  throughout  the  conversion 
process. 

The  algorithm  is  illustrated  for  a  4  digit  scheme  as 
follows. 

Offsets:  F,  =  127.5,  F^  =  32,  F,  =  8,  F„  =  2,  F ,  =  0.5 
Windows:  ±W,  ~±64,  ±W,  =  ±16, 

±W,  =±4,  ±W„  =  ±1 

k  =  0 

Let  input  N  = +59.3  ThusB  =  0. 

N,  =  N  -  F,  (-1)®  =  59.3  -  127.5  =  -68.2 
N,Z±W,  ?  -68.2  Z±  64? 

Hence  Q,  =  0  and  B  ==  1 
f,(Q,)=randf3(Q,)=0 
Nj  =  -68.2  +  64  =  -4  2  , 
k=  1,B  =  1. 

Nj  =  N,  -  Fj  (-1)®  =  -4.2  -  32  (-1)'  =  27.8 
NjZ±Wj?  27.8  Z  ±16  ? 

Hence  Qj  =  3 

Nj  =  27.8-  16=  11.8, 

k  =  2,  B  =  0. 

N,  =Nj-F,  (-1)®  =  11.8-8  =  3.8 
N,  Z  ±W,  ?  3.8  Z  ±4  ? 

Hence  Q,  =  2 
f,(Q,)=Oandf,(Q,)=l 
N,  remains  mtchanged. 
k  =  3,B  =  0 


No  =  N,-Fo(-l)®  =  3.8-2=  1.8 

NoZ±Wo?  1.8  Z±l? 

Hence  Qg  =  3  and  tlic  conversion  is  complete. 

f,(Qo)=l  and  f, (0,^=0 

No  =  No-Wo=  1.8  -  1  =0.8 

k  =  4 

N ,  =  0.8  -  0.5  =  0.3 

The  quantization  error  is: 

59.3-(3xl6  +  2x4  +  3)  =  0.3 

It  is  verified  that  N ,  is  the  quantization  error.  The 
algorithm  produces  complementary  code  for  negative  inputs. 

3.  SIMULATION  RESULTS  AND 
IMPLEMENTATION 

The  algorithm  is  simulated  by  means  of  a  computer 
program  .The  error  is  computed  according  to  step  9.  It  is  found 
that  the  error  is  limited  to  ±1/2  of  Uie  least  significant  digit 
(LSD).  Fig.  3. 1  is  the  graph  of  quantization  error  vs.  input. 

The  circuit  to  resolve  the  most  significant  digit  is  shown 
in  Fig.  3.2.  When  two  more  such  cells  are  cascaded,  the 
complete  circuit  is  a  three  digit  quaternary  ADC.The  input 
range  is  ±  12.6  V  and  tlte  LSD  is  0.2  V.  .  The  quaternary 
digits  3, 2, 1  and  0  are  represented  by  levels  0,  -1,  -2  and  -3  V 
.respectively.  The  analog  output  of  the  last  stage  is  the 
quantization  error.  Fig.  3.3  shows  this  output  for  a  triangular 
wave  input  of  10  V  amplitude.  We  see  that  the  error  is  within 
±  0.1  V.  Tire  circuit  settles  witliin  1.5  ps  for  a  step  input. 

There  is  no  need  of  clocked  operation  and  the  circuit  is 
essentially  a  flash  ADC.  The  conversion  time  depends  on 
the  propagation  speed  of  signals  through  the  chain  of  digit 
cells.The  complexity  grows  linearly  with  the  number  of 
quaternary  digits.  Hence  the  advantages  are  similar  to  tliose 
reported  in  [2]  for  the  binaiy'  ADCs. 

4.  CONCLUSIONS. 

It  is  possible  to  obtain  a  flash  quaternary  ADC  with 
linearly  increasing  complexity.  The  simulation  indicates  that 
the  error  is  within  ±1/2  LSD. The  error  output  of  the  circuit  is 
also  found  to  be  mostly  within  ±0.1  V. 
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Abstract 

A  new  method  of  non-linear  system  design,  and  non-linear 
subsystem  modelling  is  presented.  The  method  is  based  on 
identifying  a  non-linear  model  for  each  subsystem  and 
combining  the  individual  models  to  solve  the  entire  system. 
Time  domain  modelling  and  simulation  is  used  throughout 
the  procedure  but  frequency  domain  characteristics  are  also 
available.  Modelling  of  non-linear  subsystem  is  achieved  by 
time  domain  measurement  which  offers  considerable  saving 
in  measurement  time.  The  identified  models  are  not  based 
on  equivalent  circuits  and  this  offers  flexibility  and  saving  in 
the  amount  of  data  stored  for  each  model. 

1.  New  Approach  to  System  Design 

This  paper  describes  a  powerful  technique  of  modelling 
linear  and  non-linear  systems  that  leads  to  very  efficient 
methods  of  measurement  and  simulation.  The  measurement 
are  carried  out  in  the  time  domain  and  this  results  in  a  much 
reduced  effort  and  volume  of  the  measured  data  compared 
to  frequency  domain  measurements.  In  the  frequency 
domain  a  large  number  of  measurements  have  to  be  carried 
out  to  determine  all  four  scattering  parameters,  in  amplitude 
and  phase,  as  functions  of  frequency  and  at  a  number  of 
power  levels.  In  the  time  domain  the  response  is  a  real 
function  (no  phases)  and  usually  much  fewer  power  levels 
and  time  points  are  required  to  model  non-linear  systems. 

The  identification  does  not  use  equivalent  electrical  circuits. 
The  basic  elements  of  the  identified  systems  are  : 

Multipliers  which  can  be  constant  or  non-linear 
functions  of  any  signal  in  the  system. 

Delays  which  can  also  be  non-linear. 

Differentiators  and  Integrators. 

Adders. 

The  identification  is  performed  entirely  in  the  time  domain 
and  produces  a  very  efficient  process  that  can  identify  any 
non-linear  system  with  any  non-linearity  and  with  fewer 
elements  than  the  usual  equivalent  circuit  approach.  If  a 
transversal  structure  is  used,  then  the  stability  of  the 


identified  model  is  guaranteed. 

In  the  developed  procedure,  subsystems  such  as  amplifiers, 
mixers,  filters  ...  etc  are  identified  separately  and  then  the 
subsystems  are  connected  together  to  make  a  complete 
system.  Measurements  on  the  complete  system  are  then 
performed  and  compared  to  the  results  of  the  simulations. 

It  should  be  also  mentioned  here  that  this  technique  will 
require  separate  identification  of  the  transfer,  feedback  and 
reflection  characteristics  of  each  subsystem  in  order  to 
simulate  the  interaction  between  the  subsystems  when 
connected  together. 

The  advantages  of  this  approach  are  summarized  below: 

A  large  system  can  be  divided  into  smaller  subsystems 
and  the  models  for  each  subnetwork  are  derived 
separately. 

A  library  of  subsystems  models  can  be  developed  and 
stored  for  future  use  without  the  need  of  an  equivalent 
circuit.  This  includes  amplifiers,  mixers,  oscillators, 
filters  and  couplers. 

The  model  characterising  a  non-linear  subsystem  can  be 
derived  to  match  experimental  data  without  the  need  to 
develop  a  physically  realizable  equivalent  circuit.  This 
gives  a  greater  flexibility  in  modelling  active  devices. 

n.  Time  Domain  IdentiHcation  of  Non-Linear 
Subsystems 

An  example  of  time  domain  nonlinear  model  structure  is 
shown  in  Fig.  1.  This  general  non-linear  model  structure 
can  simulate  linear,  non-linear,  lumped  and  distributed 
system.  The  modelling  process  is  to  extract  the  parameters 
and  the  non-linear  functions.  This  modelling  method  treats 
the  subsystem  as  a  black  box.  No  internal  information  of  the 
subsystem  has  to  be  known,  but  only  the  external  measurable 
characteristics 

As  an  example  a  receiver  is  to  be  constructed  using  an 
amplifier,  a  double  balanced  mixer  and  a  band-pass  filter. 
The  system  is  to  mix  two  signals  at  0.277  Ghz  and  0.347 
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Ghz  and  produce  an  IF  signal  at  0.07  Ghz. 

First  each  of  the  subsystems  is  identified  separately,  then  the 
whole  system  is  constructed  and  the  measured  and  simulated 
response  of  the  overall  system  are  compared. 

a.  Amplifier 

An  amplifier  whose  measurement  and  modelled  pulse 
responses  are  shown  in  Fig.  2  and  3.  The  extracted  non¬ 
linear  model  can  simulate  the  saturation  characteristics  of 
the  amplifier  as  shown  in  Fig.  4.  The  alternative  frequency 
domain  method  requires  a  large  number  of  frequency 
domain  measurements  at  different  input  frequencies  and 
power  levels.  Impulse  response  methods  can  be  used  to 
characterise  linear  circuits  or  systems.  It  is  more 
complicated  in  non-linear  system  as  the  waveform  of  the 
identifying  input  are  important  and  each  system  will  have  to 
be  identified  by  a  time  domain  function  appropriate  to  its 
characteristics  and  degree  of  non-linearity. 

b.  Double  Balanced  Mixer 

A  double  balanced  diode  mixer  is  also  modelled  in  time 
domain  and  the  measurement  and  modelled  output  spectrum 
are  shown  in  Fig.  5.  The  nonlinear  model  can  predict  the 
conversion  loss  of  the  mixer  as  shown  in  Fig.  6. 

c.  Band-Pass  Filter 

A  linear  model  of  a  band  pass  filter  is  extracted  using  time 
domain  method  and  the  measured  and  modelled  frequency 
responses  are  shown  in  Fig.  7. 


The  whole  system  is  then  simulated  using  the  identified 
subsystem  models  and  the  characteristics  of  the  system 
is  obtained  in  either  the  time  or  frequency  domains. 
Finally  measurements  on  the  whole  system  should 
confirm  the  simulated  results. 

IV.  Conclusions 

A  new  method  of  subsystem  modelling  and  system 
simulation  has  been  developed.  Non-linear  modelling  and 
simulation  are  performed  in  time  domain.  Black  box  model 
is  used  instead  of  equivalent  circuit  model.  The  system 
simulation  can  deal  with  any  number  of  subsytems  and  any 
arbitrary  connection.  A  receiver  system  is  modelled  and  the 
simulation  results  agree  with  measurements. 
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in.  System  Simulation 

The  characteristics  of  the  complete  system  are  obtained 
from  simulation  after  the  modelling  of  each  subsystem  is 
achieved.  The  overall  gain  of  a  receiver  system  composed 
of  the  above  mixer,  band  pass  filter  and  amplifier  as  shown 
in  Fig.  8  is  measured.  The  measurement  and  the  simulation 
results  are  shown  in  Fig.  9  and  10.  Simulated  and  measured 
waveforms  at  diffeent  check  points  of  the  system  are  shown 
in  Fig.  11. 

The  procedure  for  system  design  can  be  summarised  as 
follows: 

.  The  charactCTistics  of  each  subsystem  is  measured  using 
time  domain  measurements  with  an  appropriate  time 
function. 

A  linear  or  non-linear  model  is  identified  to  fit  the  time 
domain  measurements. 
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Fig.  5  Output  Spectrum  of  The  Mixer 


Fig.  3  Negative  Pulse  Response  of  Model  and  Measurement 
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ABSTRACT 

A  novel  computer-aided  impedance  charac¬ 
terization  has  been  done  on  the  basis  of  annlyticjil 
model  equations  and  suitable  computer  programs 
for  the  microwave  resonant-c^  circuits  used  in 
IMP  ATT  sources.  This  provides  the  d^)endence 
of  the  load  impedance  (terminating  the  cap  cav¬ 
ity)  and  also  of  the  real  and  imaginary  parts  of  the 
cap  circuit  in^^edance  (at  the  device  plane)  on  the 
cavity  parameters  and  frequency.  The  results  are 
in  good  qualitative  agreement  with  the  previously 
reported  experimental  observations  and  can  give  a 
better  physical  insight  regarding  the  important 
characteristics  of  the  cap  circuit  and  optimum  p^- 
formance  of  the  cap-type  IMP  ATT  sources. 

1.  INTRCMJUCTION 

The  microwave  and  millimeter-wave  IMP  ATT 
sources  can  be  easily  and  efficiently  designed  ly 
embedding  the  device  in  a  resonant-c^  cavity. 
Many  researchers  have  carried  out  experimental 
studies  on  the  various  aspects  of  the  c^type 
IMPATT  and  also  Ouim  sources  [1-4].  But  a  de¬ 
tailed  and  coherent  analytical  investigation  and 
computed  results  for  the  in^rtant  properties  of 
the  cap  circuit  and  optimum  performance  of  the 
cap-type  sources  are  not  readily  available  in  pub¬ 
lished  Uteratures. 

The  author  has  thus  carried  out  the  analytical  mod¬ 
eling  and  computer-aided  solution  of  the  model 
equations  to  characterize  the  impedance  properties 


of  the  resonant-cap  circuits.  At  first  the  analytical 
expressions  have  been  derived  for  the  real  and 
im^inary  parts  of  the  cap  circuit  impedance  at  the 
device  plane  by  ai^>roxiinating  the  resonant-cap 
cavity  as  a  radial  transmission  line.  The  load  im¬ 
pedance  terminating  the  cap  cavity  has  been  ob¬ 
tained  from  resonance  condition  of  the  c^-tyj)e 
IMPATT  oscillator.  Computations  were  carried 
out  to  obtain  the  variation  of  the  real  j)art  of  the 
resonant-cap  inqjedance  with  the  cavity  parame¬ 
ters  i.e.  cap  hei^  (h)  and  c^  diameter  (D).  Also 
the  frequency  dependence  of  the  imaginary  part  of 
the  c^  circuit  impedance  was  computed  for  dif¬ 
ferent  p>arametric  values  of  h  &  D.  Finally,  the 
normalized  cap  circuit  impedance  was  plotted  in 
an  indigenously  develop)ed  PC  generated  Smith 
Chart.  On  the  basis  of  these  anatytically  derived 
con^>uted  results  mary  of  the  in^x>rtant  projjer- 
ties  of  the  resonant-cap  circuit  and  of  the  c^tyjje 
sources  including  the  experimental  results  of  the 
earlier  researchers  are  now  understood  more 
clearly. 

2.  THE  MODEL  EQUATIONS 

In  a  resonant-ciq}  tyjje  source  the  device  is  embed¬ 
ded  in  a  high-Q  cqp  cavity  ar>d  the  two  together 
mounted  in  a  rectangular  waveguide  through 
which  the  source  is  connected  to  the  load  as  shown 
in  Fig.l.  The  disc  of  the  resonant-cap  and  the 
bottom  broad  wall  of  the  rectangular  waveguide 
forms  the  cap  cavity  which  is  basically  a  radial 
transmission  line  causing  efficiwit  power  transfer 
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from  the  device  to  the  load.  The  analytical  solution 
of  the  e.m.  fields  witliin  tlie  cap  cavity  leads  to  tlie 
following  expressions  for  tlie  and  imaginary 
parts  of  the  cap  circuit  impedance  at  the  device 
plane  (i.e.  at  r  =  n) : 


where  Cd  is  the  total  diode  capacitance  and  tV  is 
tlie  resonant  frequency  of  the  oscillator 

3.CX)MPUTED  RESULTS  AND  DISCUSSIONS 

The  analytically  derived  model  equations  have 
been  evaluated  through  suitable  computer  pro¬ 
grams.  The  values  of  Zl  for  different  cap  diameter 
(D)  and  cap  height  (h)  is  computed  from  eqn.(4) 
utilizing  the  experimentally  measured  fr  for 
resonant-cap  IMP  ATT  oscillators  and  is  shown  in 
the  following  table  : 


^  =  T->  is  the  resonant  wavelength. 

Kr 

h,  ri,  rLand  ZLare  indicated  in  Fig.  1.  While 
Jm  (kr)  and  Nm  (kr)  are  the  m*  order  Bessel  and 
Neumann  functions  respectively. 

The  load  impedance  Zl  terminating  the  cip  cavity 
can  be  obtained  from  the  oscillation  condition  for 
an  IMP  ATT  device  embedded  in  a  resonant-cap 
cavity  and  is  given  by  : 


(4) 


D 

Zl 

h 

Zl 

425 

397 

80 

324 

475 

385 

90 

341 

500 

360 

too 

360 

525 

354 

120 

371 

550 

329 

140 

369 

(D  «fe  h  in  mils,  Zl  in  ohms.) 


It  may  be  seen  that  maximum  power  output  is 
expected  with  a  cap  circuit  having  D  =  500  mils 
and  h  =  100  mils  for  which  Zl  -  360  Q ,  because 
of  optimum  impedance  matching  with  the  charac¬ 
teristic  impedance  (Z©  ~  360  Q)  of  rectangular 
waveguide  (at  f  ~  10  GHz)  which  is  supported  by 
the  experimental  observation  by  the  author. 

The  plot  of  RejZi]  vs  D  and  h  computed  from 
e(pi.(l)  (wherein  Zl  values  are  used  from  the 
above  table)isshowninFig.2.  The  value  of  Re[Zi] 
is  found  to  change  significantly  (  0. 105  /  mil ) 

with  cap  height  (h)  while  its  change  is 
comparatively  insignificant  (  0.018  Q  /  mil  ) 
with  c^  diameter(D)  which  agrees  quahtatively 
with  the  experimental  observations  of  Doling  and 
Seebald  [5]  and  can  also  account  for  the  experi¬ 
mentally  observed  predominant  change  of  gain 
with  c^  height  for  the  cq>-type  IMPATT  amph- 
fier  as  reported  by  Kar  &  Roy  [6].  It  may 
further  be  observed  that  the  cap  circuit  resistance 
Rc  =  5.288  Q  (  for  D  =  500  mils  and  h  =  100  mils, 
vide  Fig.  2)  nearly  equals  the  negative  resistance 
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of  the  IMPATT  device  Rd  ~  5  Q  (  as  specified  by 
the  diode  manufacturers  )  thereby  giving  optimum 
oscillator  performance. 

The  frequency  dependence  of  the  Im[ZiJ  with  D  or 
h  as  variable  parameters  are  computed  from 
eqn.(2)  and  plotted  in  Figs.  3  &  4  respectively,  It 
is  seen  from  Fig.  3  that  each  cap  diameter  corre¬ 
sponds  to  a  specific  resonant  frequency  ( i.e.  the 
frequency  at  which  hn  [Zi]  =  0)  above  which  the 
cap  circuit  impedance  is  inductive  and  below 
which  it  is  capacitive.  These  anal5rticalfy  com¬ 
puted  results  are  in  agreement  with  the  experimen¬ 
tal  observations  of  Bates  [3].  The  nature  of  the 
c^es  in  Figs.  3  &  4  can  be  understood  with 
single  physical  reasoning.  The  curves  of  Fig.  3 
can  be  fitted  to  an  empirical  formula :  Dfc  =  const. 

(  where  f;  is  the  circuit  det^mined  resonance 
frequency)  which  is  in  agreement  with  the  experi¬ 
mental  observations  by  Karetal.  [2].  However,  the 
clusterii^  of  the  curves  around  a  typical  frequency 
of  10.55  GHz  for  different  cap  heights  is  due  to  the 
fact  that  in  the  theoretical  analysis  the  post  struc¬ 
ture  P2  under  the  disc  (vide  Fig.  1)  was  not  consid¬ 
ered.  When  the  effect  of  P2  is  considered  an  extra 
inductance  (  which  increases  with  cap  height  ) 
comes  into  the  circuit  and  hence  :fca\/h&s  was 
observed  experimentally. 

A  package  for  the  Smith  Chart  has  been  developed 
on  u4uch  the  nature  of  the  cap  circuit  ifnp«»Hanofr 
can  be  displayed  with  color  graphics  for  different 
D  and  h  values.  The  photographs  showing  the  plot 
of  the  resonant-cap  impedance  (normalized  w.  r.  t 
the  characteristic  impedance  of  the  radial  line  at 
the  device  plane  )  for  two  different  D  and  h  values 
are  shown  in  Figs.  5  (a)  &  (b)  respectively.  It  is 
s»en  that  the  real  part  of  the  cap  circuit  impedance 
is  predominantly  determined  by  the  cap  height 
(vide  F ig.  5(b))  while  its  change  with  cap  diameter 
is  very  small  ( vide  Fig.  5(a)).  Also  the  cap  circuit 
impedance  is  clearly  seen  to  be  capacitive  below 
the  resonant  frequency  (i.e.  below  X  =  0  line  in 
the  Smith  Chart )  and  it  is  inductive  above  the 
resonant  frequency  for  different  cap  height  or 
cap  diameter 


4.  CONCLUSION 

The  analytic  model  and  the  computed  results  rep¬ 
resenting  the  impedance  characteristics  of  the 
cap-type  microwave  circmt  presented  in  this  paper 
is  expected  to  be  a  sound  Ixisis  for  understanding 
the  in^rtant  properties  of  the  cap  circuit  and  the 
optimum  performance  of  the  cap-type  IMPATT 
sources. 
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ABSTRACT 

In  this  paper,  we  present  a  newly  developed  bandpass 
filter  (BPF)  using  aXgo/2  coplanar  waveguide  (CPW) 
resonator  with  the  short  ends  and  the  input/output  ports, 
which  are  constructed  by  the  coupled  shorted-short-line 
(C-SSL).  The  frequency  characteristics  of  the  input 
impedance  of  the  filters  are  also  specified.  Next,  a 
method  for  suppressing  spurious  resonance  responses  of 
the  filters  is  discussed.  Using  the  nodal  analysis,  the  circuit 
simulation  and  the  electromagnetic  fields  solver 
computation,  the  transmission  characteristics  of  the 
spurious  suppression  over  the  wide  frequency  range  of 
the  CPW-BPF  is  specified.  The  spurious  resonance 
responses  of  the  CPW-BPF  are  suppressed  more  than 
-45.0  dB  between  3.0  and  20.0  GHz  . 

1.  INTRODUCTION 

Recently,  the  mobile  communication  systems  are 
changing  from  analog  systems  to  digital  systems.  Die 
microwave  filters  are  required  very  severer  specifications 
such  as  the  suppressing  harmonic  resonance  responses  over 
the  wide  frequency  range. 

In  this  paper,  we  described  the  properties  of  the  Xgo/2 
Coplanar  Waveguide  (CPW)  resonator  with  the  short  ends 
and  the  input/output  ports,  which  are  constructed  by  the 
Coupled  Shorted-Short-Line  (C-SSL),  and  the 
transmission  characteristics  of  the  bandpass  filters  (BPFs) 
using  the  resonator.  Using  the  nodal  analysis,  the  circuit 
simulation  and  the  electromagnetic  fields  solver  com¬ 


putation,  the  transmission  characteristics  of  the  spurious 
suppression  over  the  wide  frequency  range  of  the  CPW- 
BPF  is  specified.  In  the  results  of  the  theoretical  analysis 
and  calculation,  the  new  type  CPW-BPFs  have  the 
excellent  spurious  suppression  characteristics. 

2.  BANDPASS  FILTER  USING  Xgo/2  CPW- 
RESONATOR  WITH  THE  SHORT  ENDS 

As  you  well  know  the  spurious  harmonic  resonance 

responses  of  tlie  BPF  using  the  X«o/2  resonators  with  the 
short  ends  arise  at  the  frequencies  of  the  integral  multiples 
of  the  center  frequency  fo. 

In  the  past,  we  used  the  transforming  sections  using 
the  Xgo/4  coupled  transmission-lines  or  the  tapping  feed 
for  input/output  feeding  in  the  conventional  type  Xgo/2 
CPW-resonator  BPF.  On  the  newly  developed  CPW- 
BPFs,  we  have  been  used  C-SSL  input/output  instead  of 
the  above  feeding  structures.  Using  the  C-SSL  input/output 
configurations  and  the  loaded  capacitors,  these  CPW- 
BPFs  can  be  achieved  the  wide  suppression  spurious 
harmonic  resonance  responses. 

First,  the  circuit  model  #1  of  the  BPF  using  a  Xgo/2 
CPW-resonator  with  the  short  ends  and  input/output  ports, 
which  are  constructed  by  the  C-SSL  is  shown  in  Figure  1. 
The  ABCD-matrix  Ka  of  the  circuit  model  #1  is  following 
equation  (1). 

Secondly,  the  circuit  model  #2  of  the  CPW-BPF  using 
the  very  short  resonator  with  the  short  ends  attached  shunt 
capacitor  at  the  center  of  the  one  and  the  C-SSL  input/ 
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output  structures  is  shown  in  Figure  2.  The  ABCD-matrix 
Kb  of  the  circuit  model  #2  is  following  equation  (2). 

Finally,  the  circuit  model  #3  of  the  CPW-BPF  using 
the  very  short  resonator  with  the  short  ends  attached  shunt 
capacitor  at  the  center  of  the  one  and  furthermore  the 

C-SSL  attached  the  shunt  capacitors  at  the  input/output 
ports  is  shown  in  Figure  3.  The  ABCD-matrix  Kc  of  the 
circuit  model  #3  is  following  equation  (3). 

The  equivalent  circuits  of  these  CPW-BPFs  are  shown 


in  Figs.  1, 2  and  3,  respectively.  By  the  theoretical  analysis 
and  calculation  for  these  configurations  in  the  Figs.  1,  2 
and  3,  the  input  impedance  characteristics  of  these  BPF 
and  the  transmission  characteristics  of  these  ones  are 
shown  in  Figs.  4  (a)  and  (b),  5(a)  and  5(b)  and  6(a)  and 
6(b),  respectively.  Fig  .7  shows  the  transmission 
characteristics  of  the  CPW-BPF  with  the  C-SSL  in  Fig.l 
by  the  electromagnetic  fields  solver  computation. 
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Fig.l  The  circuit  model  #1  and  its  equivalent  circuit  of  the  BPF 
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Fig.2  The  circuit  model  #2  and  its  equivalent  circuit  of  the  BPF 
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Fig.3  The  circuit  model  #3  and  its  equivalent  circuit  of  the  BPF 
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Fig.  4  (a)  The  normalized  input  impedance  and  (b)  the  transmission  characteristics 
of  the  circuit  model  #1 


Fig.5  (a)  The  normalized  input  impedance  and  (b)  the  transmission  characteristics 
of  the  circuit  model  #2 
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ABSTRACT 

The  application  of  joint  time-frequency  techniques  to  extract 
the  scattering  mechanisms  contained  in  electromagnetic 
backscattered  data  is  reviewed.  Research  activities  on 
interpreting  the  scattering  phenomenology  in  the  time- 
frequency  plane,  methods  to  improve  the  resolution  of  joint 
time-frequency  processing,  and  imaging  algorithms 
combining  joint  time-frequency  techniques  with 
conventional  inverse  synthetic  aperture  radar  processing  are 
discussed. 


1.  INTRODUCTION 

In  radar  signature  applications,  the  electromagnetic 
backscattered  returns  from  a  complex  target  are  commonly 
processed  in  either  the  time  or  the  frequency  domain  in  order 
to  extract  useful  target  features  for  diagnostic  and 
identification  purposes.  For  example,  scattering  centers  are 
manifested  in  the  time  domain  as  distinct  pulses  and  can  be 
related  to  the  local  features  on  the  target.  Similarly,  the 
natural  resonances  of  a  target  are  manifested  in  the  frequency 
domain  as  sharp,  discrete  events  and  can  be  attributed  to  the 
unique  global  features  of  the  target.  For  target 
characteristics  which  are  not  immediately  apparent  in  either 
the  time  or  the  frequency  domain,  the  joint  time-frequency 
representation  of  the  electromagnetic  backscattered  data  can 
sometimes  provide  more  insight  into  the  echo  interpretation 
process.  The  usefulness  of  the  time-frequency 
representation  of  signals,  of  course,  has  long  been 
recognized  in  the  signal  processing  arenas.  In  the 
electromagnetic  scattering  community,  the  joint  time- 
frequency  analysis  was  introduced  by  Moghaddar  and  Walton 
[1]  to  explain  the  measurement  data  from  an  open-ended 
waveguide  cavity.  The  goal  of  this  paper  is  to  provide  an 
overview  of  the  progress  made  in  the  past  several  years  in 


the  application  of  joint  time-frequency  techniques  to 
electromagnetic  signature  data. 

2.  SCATTERING  PHENOMENOLOGY  IN  THE 
JOINT  TIME-FREQUENCY  PLANE 

The  standard  tool  in  generating  the  time-frequency 
representation  of  a  signal  is  the  short-time  Fourier 
transform  (STFT).  It  is  basically  a  sliding  window  Fourier 
transform  in  time.  By  taking  the  Fourier  transform  of  the 
windowed  time  data  as  the  window  is  moved  in  time,  a  two- 
dimensional  time-frequency  image,  or  the  spectrogram,  is 
generated.  Spectrogram  provides  information  on  the 
frequency  content  of  the  signal  at  different  time  instances. 
Alternatively,  the  STFT  can  also  be  interpreted  as  a  sliding 
window  Fourier  transform  in  frequency  where  the  fiequency 
window  function  is  given  by  the  Fourier  transform  of  the 
time  window.  In  electromagnetic  applications,  the  joint 
time-frequency  image  can  be  used  to  simultaneously  display 
the  different  scattering  phenomenology  due  to  scattering 
centers,  target  resonances  and  dispersive  mechanisms. 
Shown  in  Fig.  1  are  the  time-frequency  features  of  the 
various  mechanisms.  A  scattering  center,  which  is  a 
discrete  event  of  time,  shows  up  as  a  vertical  line  (Fig. 
1(a))  as  it  occurs  at  a  particular  time  instance  but  over  all 
frequencies.  A  resonance,  which  is  a  discrete  event  of 
frequency,  shows  up  as  a  horizontal  line  (Fig.  1(b)). 
Dispersive  phenomena,  on  the  other  hand,  are  characterized 
by  slanted  curves  in  the  time-frequency  image.  For 
example,  surface  wave  mechanisms  due  to  material  coatings 
are  characterized  by  curves  with  a  positive  slope  in  the 
time-frequency  image  (Fig.  1(c)).  This  is  because  as  the 
frequency  is  increased,  the  surface  wave  becomes  more 
tightly  bound  to  the  material  layer,  and  consequently, 
propagates  at  a  slower  velocity  and  results  in  a  longer  time 
delay.  Another  type  of  dispersion  is  due  to  waveguide 
structures,  such  as  those  found  in  inlet  ducts  and  slotted 
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(a)  Scattering  Center 


(b)  Resonance 


(c)  Material  Dispersion 


time 


(d)  Structural  Dispersion 


Fig.  1 .  Scattering  mechanisms  are  manifested  in  the  joint  time-frequency  image  as  distinct  features. 


waveguide  antennas.  These  "structural  dispersion" 
mechanisms  are  characterized  by  curves  with  a  negative 
slope  in  the  time-frequency  image  (Fig.  1(d)).  This  can  be 
explained  by  the  fact  that  as  the  frequency  is  lowered,  one 
approaches  the  cutoff  frequency  of  a  waveguide  mode  and 
the  propagation  delay  associated  with  this  mechanism 
approaches  infinity.  All  of  the  above  mentioned 
phenomena  have  been  observed  in  a  wide  variety  of  targets, 
from  simulation  data  on  canonical  structures  [1],[2],[4]  to 
measurement  data  on  complex  platforms  [1],[3].  In  general, 
the  time-frequency  representation  is  particularly  effective  for 
identifying  scattering  mechanisms  in  targets  containing 
"sub-skinline"  structures  such  as  inlet  ducts,  antenna 
windows  and  material  coatings.  For  full-scale  targets,  these 
effects  are  most  dominant  in  the  UHF  frequency  ranges. 


3.  IMPROVEMENTS  TO  THE  TIME- 
FREQUENCY  PROCESSING  ENGINE 

The  insights  gained  in  the  time-frequency  plane  come  at  the 
price  of  resolution.  The  time-frequency  image  generated  by 
the  STFT  is  limited  in  resolution  by  the  extent  of  the 
sliding  window  function.  Smaller  time  window  results  in 
better  time  resolution,  but  leads  to  worse  frequency 
resolution.  This  is  a  well-known  tradeoff  of  the  time- 
frequency  display.  To  overcome  the  resolution  limit  of  the 
STFT,  a  number  of  alternative  time-frequency 
representations  developed  in  the  signal  processing 
community  have  been  utilized  to  analyze  electromagnetic 
scattering  data.  They  include  the  Wigner-Ville  distribution 
[1],  the  continuous  wavelet  transform  [2],  superresolution 
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algorithms  [4]  and  adaptive  techniques  [5].  The  well-known 
Wigner-Ville  distribution  provides  good  localization  of 
scattering  mechanisms  at  the  expense  of  additional  cross 
terms  which  lead  to  "ghosts"  in  the  time-frequency  image. 
The  wavelet  transform  utilizes  multi-scale  basis  functions 
to  provide  multiresolution  capability.  For  frequency- 
domain  radar  echo  which  consists  of  both  small-scale 
natural  resonances  and  large-scale  scattering  center 
information,  the  wavelet  transform  defined  in  the  frequency 
domain  is  better  suited  for  resolving  these  multi-scale 
events  of  frequency  than  the  STFT. 

In  either  the  STFT  or  the  wavelet  transform,  the  resolution 
in  the  time-frequency  display  is  limited  by  the  extent  of  the 
sliding  window  function  as  opposed  to  the  full  bandwidth  of 
the  signal.  To  further  improve  the  Fourier-limited 
resolution  in  the  time-frequency  plane,  the  processing  of  the 
data  within  each  time  or  frequency  window  using  modem 
superresolution  techniques  such  as  ESPRIT,  MUSIC  or 
matrix -pencil  algorithms  have  been  investigated.  Super- 
resolved  parameterization  retains  the  advantage  of 
simultaneous  time-frequency  display  while  overcoming  the 
resolution  issue.  However,  additional  processing  is  needed 
to  fully  parameterize  the  data,  especially  when  dispersive 
mechanisms  are  present.  Furthermore,  the  robustness  of 
the  algorithms  to  noise  needs  to  be  carefully  considered. 

The  use  of  wavelet  is  a  step  towards  variable  resolution  in 
the  time-frequency  plane.  It  is,  however,  rather  rigid  in  its 
particular  form  of  the  time-frequency  grid.  Flexible 
resolution  in  the  time-frequency  plane  to  accommodate 
components  of  the  signal  with  different  resolution  is  highly 
desirable.  Recently,  a  signal  representation  scheme  that 
uses  adaptive  normalized  Gaussian  functions  was  introduced 
[5].  Unlike  the  STFT  and  wavelet  decomposition,  the  time 
and  frequency  resolution  as  well  as  the  time-frequency 
centers  are  adjusted  to  best  match  the  signal.  The  adaptive 
spectrogram,  a  signal  energy  distribution  in  the  joint  time- 
frequency  domain  based  on  this  adaptive  representation,  can 
be  applied  to  backscattered  signals  containing  scattering 
centers  and  resonances  with  very  high  resolution  in  both 
time  and  frequency.  Issues  that  need  to  be  further  addressed 
include  the  uniqueness  of  the  representation,  computational 
complexity  and  efficient  treatment  of  dispersive 
mechanisms. 


4,  JOINT  TIME-FREQUENCY  ISAR 
PROCESSING 

While  the  joint  time-frequency  processing  is  applicable  to 
single  aspect  data,  it  can  also  be  combined  with  the 
traditional  ISAR  (inverse  synthetic  aperture  radar)  technique 


to  make  use  of  the  additional  angular  information.  The 
ISAR  imaging  algorithm  is  basically  a  spatial  mapping 
algorithm  for  point  scatterers.  For  targets  which  can  be 
well  described  by  a  collection  of  point  scatterers, 
conventional  ISAR  imagery  provides  an  ideal  display  of 
target  features.  However,  when  the  target  contains  other 
non-point  scattering  mechanisms,  the  resulting  ISAR 
image  will  contain  artifacts  which  do  not  correspond  to  the 
spatial  features  on  the  target.  For  instance,  the  engine  inlet 
return  of  an  airplane  usually  shows  up  as  a  large  cloud 
outside  of  the  airframe  structure  in  the  ISAR  image.  A 
joint  time-frequency  ISAR  can  be  formed  by  viewing  the 
point  scatterers  in  the  conventional  ISAR  plane  while 
placing  other  non-point  scattering  mechanisms  in  the  time- 
frequency  plane.  Instead  of  the  conventional  (range)-(cross 
range)  display,  this  is  accomplished  by  applying  joint  time- 
frequency  processing  to  the  range  (or  time)  axis  to  gain  an 
additional  frequency  dimension.  The  resulting  (range)-(cross 
range)-(frequency)  ISAR  allows  us  to  view  the  frequency 
behavior  of  the  scattering  mechanisms  not  available  in 
conventional  ISAR.  For  aircraft  signatures,  for  example,  it 
is  possible  to  remove  the  highly  resonant  engine  inlet 
contribution  from  the  ISAR  image  and  view  it  separately  in 
the  joint  time-frequency  plane. 

Similar  to  the  above  idea,  the  joint  time-frequency 
processing  can  also  be  applied  to  the  cross  range  (or 
Doppler  frequency)  axis  of  the  ISAR  image.  This  technique 
has  been  used  to  overcome  motion  compensation  errors  in 
stepped- frequency  radar  data  [6].  For  radars  with  long  dwell 
time  on  fast-moving,  close-in  targets,  the  effect  of  motion 
compensation  errors  can  be  quite  severe,  resulting  in  cross 
range  smearing  in  the  ISAR  image.  If  the  joint  time- 
frequency  technique  is  applied  to  the  Doppler  frequency 
axis,  an  additional  aspect  dimension  is  generated.  The 
resulting  (range)-(cross  range)-(aspect)  display  allows  us  to 
view  the  instantaneous  ISAR  images  of  the  target  at 
different  aspect  angles.  Since  motion  compensation  errors 
are  reduced  over  smaller  angular  windows,  the  resulting 
sequence  of  images  has  less  cross  range  blurring  problem 
than  the  original  ISAR  image. 


5.  SUMMARY 

Research  in  the  last  several  years  has  shown  that  the  joint 
time-frequency  space  is  an  attractive  feature  space  for 
identifying  target  characteristics  from  electromagnetic 
backscattered  data.  Ongoing  works  are  focused  on  applying 
multiresolution,  superresolution  and  adaptive  resolution 
techniques  to  enhance  the  resolution  in  the  time-frequency 
image  to  facilitate  feature  identification.  Incorporation  of 
joint  time-frequency  processing  to  traditional  ISAR 
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imaging  also  opens  up  a  new  avenue  where  multi-aspect 
data  can  be  properly  utilized.  Once  more  fully  developed, 
these  processing  techniques  will  be  excellent  tools  for 
understanding  the  scattering  phenomenology  from  computed 
or  measured  data.  In  addition,  they  should  find  applications 
in  ultra-wideband  radar  target  identification. 
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ABSTRACT 


application  to  signal  backscattered  by  a  target  containing 
a  small  open  cavity. 


In  this  paper  we  present  a  joint  time-frequency  ISAR 
algorithm.  This  algorithm  allows  the  separation  of  the 
original  ISAR  image  into  two  different  images:  an  ISAR 
image  containing  only  scattering  center  information,  and 
a  frequency-aspect  image  showing  only  resonance 
features  associated  with  the  target.  We  also  show,  as  an 
example,  the  application  of  this  technique  to  signal 
backscattered  by  a  target  containing  a  small  open  cavity. 

1.  INTRODUCTION 

Inverse  synthetic  aperture  radar  (ISAR)  imaging  [1]  has 
long  been  used  in  the  radar  community  for  signature 
diagnostic  and  target  identification  applications.  It  is  a 
simple  and  very  robust  process  for  mapping  the  point 
scatterers  present  in  the  target.  However,  while  simple 
targets  can  be  modeled  as  a  collection  of  point  scatterers, 
real  world  targets  contain  many  non-point  scattering 
mechanisms  such  as  resonances  and  dispersions.  These 
scattering  mechanisms  appear  in  the  ISAR  image  as 
blurred  clouds  which  extend  down-range,  sometimes 
even  obscuring  other  scattering  centers.  On  the  other 
hand,  these  mechanisms  provide  important  features  that 
can  be  used  in  the  target  classification  process,  if 
properly  inteipreted. 

To  address  these  issues,  we  present  a  new  processing 
technique,  the  joint  time  frequency  ISAR.  This  technique 
allows  the  separation  of  the  original  ISAR  image  into  two 
different  images:  one  ISAR  image  containing  only 
scattering  center  information,  and  another  frequency- 
aspect  image  showing  strong  frequency  dependent 
features  associated  with  the  target.  This  separation  is 
accomplished  by  using  the  adaptive  Gaussian 
representation  [2].  In  this  paper  we  give  a  brief 
introduction  to  the  proposed  algorithm  and  show  its 


2.  ISAR  IMAGES  OF  TARGETS  CONTAINING 
NON-POINT  SCATTERING  FEATURES 

For  targets  that  can  be  modeled  as  a  collection  of  ideal 
point  scatterers,  it  is  well  known  that  by  gathering  data  at 
different  frequencies  and  different  aspect  angles  (and 
therefore  at  different  values  of  kx  and  it  is  possible  to 
obtain  an  image  in  the  x-y  plane  that  will  have  peaks  at 
the  locations  of  these  scattering  centers.  This  image  can 
be  easily  obtained  by  using  the  inverse  Fourier  transform, 
that  will  map  the  kx-ky  plane  into  the  x-y  plane. 

However,  real  targets  contain  many  non-point  scattering 
mechanisms,  such  as  resonances  and  other  dispersive 
phenomena,  that  cannot  be  focused  in  the  x-y  plane.  For 
example  if  the  target  contains  a  vary  small  cavity  at  the 
coordinates  Qco.yo),  that  resonates  at  the  frequency  fc,  it 
can  be  shown  that  the  scattered  field  due  to  this 
resonance  can  be  written  as: 


^  a+;27c(/-/o) 

,  _  1%  f  ^ 


(la) 


Under  a  small  aspect  aperture  approximation  this 
resonance  would  be  mapped  in  the  x-y  plane  as: 


s{,x,y)  =  B8(y-  yg  )exp(j2kox) 
exp(-2a  xlc)u(x  -Xq) 


(lb) 


So  we  see  that  this  kind  of  mechanism  tends  to  spread 
over  the  down-range  of  the  image,  making  the  task  of 
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locating  scattering  centers  much  more  difficult.  For 
example.  Fig.  2  shows  the  ISAR  image  of  a  perfecdy 
conducting  plate  containing  a  small  cavity  with  an 
opening,  as  depicted  in  Fig.  1.  The  scattering  data  were 
simulated  numerically  using  a  method  of  moment 
electromagnetics  code.  We  notice  in  the  image  that,  in 
addition  to  the  three  scattering  centers  corresponding  to 
the  two  edges  of  the  plate  and  the  cavity  exterior,  there  is 
also  a  very  strong  return  outside  of  the  target.  This 
return  corresponds  to  the  energy  that  is  coupled  into  the 
cavity,  and  is  re-radiated  towards  the  radar  at  the 
resonance  fitequencies.  This  non-point  scatterer 
contribution  clearly  complicates  the  image  interpretation 
process  since  its  location  in  the  ISAR  image  does  not 
correspond  to  the  physical  location  of  the  scattering.  In 
addition,  it  may  obscure  any  other  scattering  centers 
located  at  these  positions. 

3.  JOINT  TIME-FREQUENCY  ISAR 

From  the  above  discussion  we  conclude  that  it  will  be 
very  useful  to  obtain  a  representation  that  can  allow  us 
not  only  to  locate  the  frequency-independent  scattering 
centers,  but  also  to  identify  the  frequency-dependent 
events  in  the  signal.  This  can  be  accomplished  by  the  use 
of  the  proposed  joint  time-frequency  ISAR  algorithm  that 
combines  the  idea  of  standard  ISAR  with  the  joint  time- 
frequency  representation.  In  our  recent  woik  we  have 
found  the  joint  time-frequency  display  to  be  an  excellent 
tool  for  visualizing  signals  that  contain  scattering 
centers,  resonances  as  well  as  other  dispersive 
mechanisms  [3], [4].  To  obtain  the  desired  joint  time- 
frequency  ISAR  representation,  one  choice  is  to  apply  the 
standard  short-time  Fourier  transform  (S  IFT)  to  each 
horizontal  line  of  the  ISAR  image.  For  a  fixed  y  (cross 
range),  each  horizontal  line  is  a  function  of  x  (range), 
which  is  in  turn  directly  proportional  to  the  time  (for 
small  angular  aperture).  Therefore,  the  application  of  the 
STFT  to  each  line  of  our  image  will  generate  a  3- 
dimensional  matrix  (x-y-f).  Each  (x-y)  slice  of  this 
matrix  is  an  ISAR  image  at  a  particular  frequency.  This 
display  can  thus  show  how  the  ISAR  image  varies  with 
frequency.  Furthermore,  we  can  distinguish  the 
frequency-independent  part  of  the  signal  from  frequency- 
dependent  events  by  viewing  the  slices. 

This  implementation  however  presents  three  major 
problems.  First,  the  frequency  resolution  is  obtained  at 
the  cost  of  lower  range  resolution.  This  is  a  well-known 
property  of  the  STFT.  Second,  it  is  difficult  to 
automatically  distinguish  the  frequency-dependent  events 
from  the  frequency-independent  ones.  Third,  we  have  to 
generate  the  whole  cube  before  any  feature  identification 


can  be  made,  and  that  requires  too  much  memory.  In 
order  to  solve  these  problems  we  adopt  a  different 
algorithm  based  on  the  adaptive  Gaussian  basis  function 
to  process  our  ISAR  image,  as  described  next 

4.  JTF-ISAR  USING  ADAPTIVE  GAUSSIAN  BASIS 
REPRESENTATION 


The  adaptive  spectrogram  is  a  signal  representation 
scheme,  recently  introduced  by  Qian  and  Chen  [2],  that 
uses  adaptive  normalized  Gaussian  as  basis  functions. 
The  objective  of  this  method  is  to  expand  a  signal  s(t)  in 
terms  of  normalized  Gaussian  functions  h  (t)  with  an 

p 

adjustable  standard  deviation  and  a  time-frequency 
center  (tp.fpy. 


p=l 


where 


(2) 


/ip(r)  =  (7caj)'^’^%xp 


(3) 


The  coefficients  Bp  are  found  one  at  a  time  by  an  iterative 
procedure.  One  begins  at  the  stage  p=l  and  chooses  the 
parameters  Gp,  tp  and  fp  such  that  hp(t)  is  most  “similar” 
to  s(t),  that  is: 


=  ^^Op,tpjp\sp_At)hl{t)dt 


(4) 


where  So(t)  =s(t).  For  p  >1,  Sp(t)  is  the  remainder  after  the 
orthogonal  projection  of  Sp.j(t)  onto  hjt)  has  been 
removed  from  the  signal: 


Sp{t)  =  Sp_iit)-Bp{t)hpit) 


(5) 


We  see  from  expression  (3)  that  scattering  centers,  i.e., 
signals  with  very  narrow  length  in  time,  will  be  well 
represented  by  basis  functions  with  very  small  Cf^. 

Frequency  resonances,  on  the  other  hand,  will  be  better 
depicted  by  large  a  .  Therefore  the  parameters  extracted 

'  P 

should  give  us  the  location  and  strength  of  these 
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scattering  mechanisms  with  very  high  resolution.  If  we 
apply  the  above  algorithm  to  each  horizontal  line  of  the 
ISAR  image,  instead  of  using  the  STFT,  we  will  obtain  a 
summation  of  Gaussians  with  different  variances.  Those 
Gaussian  bases  with  small  variances  should  be  related  to 
the  scattering  centers.  Consequently  if  we  reconstruct 
our  ISAR  image  using  only  the  functions  in  this  group, 
we  will  obtain  a  much  “cleaner”  image  that  will  show 
only  the  scattering  centers.  We  can  also  reconstruct  the 
ISAR  image  using  only  the  large  variance  Gaussians, 
those  related  to  the  resonances  and  other  discrete 
frequency  events.  It  is,  however,  more  meaningful  to 
view  these  frequency-dependent  mechanisms  in  the 
Fourier  transform  domain  of  the  ISAR  image.  By  doing 
so,  we  will  obtain  a  frequency-aspect  display  that  will 
allow  us  to  precisely  locate  those  frequencies. 

5.  EXAMPLES 

Fig.  3  shows  the  enhanced  ISAR  image  of  Fig.  2, 
obtained  by  applying  the  above  algorithm  and  keeping 
only  the  small  variance  Gaussians.  Only  the  scattering 
center  part  of  the  original  signal  remains  in  the  image,  as 
expected.  Fig.  4  shows  the  frequency-aspect  display  of 
the  high  variance  Gaussians.  Several  equispaced  vertical 
lines  are  observed.  They  correspond  to  the  resonance 
frequencies  of  the  cavity,  which  should  occur  at  5.30, 
8.39,  11.86  and  13.52  GHz  based  on  the  dimensions  of 
the  cavity.  Indeed,  we  see  that  they  occur  close  to  these 
frequencies  and  are  almost  aspect  independent.  Using  our 
algorithm,  we  have  also  processed  real  measured  data  of 
a  scaled  model  airplane  taken  inside  a  radar  chamber  and 
results  will  be  presented. 

6.  CONCLUSIONS 

In  this  paper  we  presented  a  new  joint  time-frequency 
ISAR  image  processing  algorithm,  that  allows  the 
analysis  of  multi-aspect  multi-frequency  scattered  signals 
containing  not  only  scattering  centers  but  also  other 
frequency  dependent  mechanisms.  This  technique  allows 
the  separation  of  the  original  ISAR  image  into  two 
different  images:  one  enhanced  ISAR  image  with  only 
scattering  center  features,  and  another  frequency-aspect 
image  showing  only  strong  frequency  dependent  features. 
This  is  accomplished  without  any  loss  in  resolution. 
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ABSTRACT 

Inverse  synthetic  aperture  radar  (ISAR)  uses  target’s 
motion  to  generate  images  on  the  range-Doppler 
plane.  In  the  conventional  ISAR  system,  the 
Doppler  spectra  are  computed  by  the  Fourier 
transform.  Due  to  the  target  irregular  translational 
and  rotational  motion,  the  Doppler  frequencies  change 
over  the  time.  Consequently,  the  Fourier  transform 
based  ISAR  image  may  become  blurred.  To 
overcome  this  problem,  the  joint  time-frequency 
transform  is  introduced  to  estimate  the  Doppler 
spectra.  Because  the  joint  time-frequency  transform  is 
able  to  resolve  the  instantaneous  Doppler  frequencies, 
the  resolution  of  the  resulting  ISAR  images  are 
significantly  improved.  Moreover,  the  proposed 
ISAR  system  is  not  only  effective,  but  also  amenable 
to  hardware  implementation 

1,  INTRODUCTION 

Inverse  synthetic  aperture  radar  (ISAR)  is  an  imaging 
radar,  which  uses  target's  pitch,  roll  and  yaw  motions 
to  generate  image  on  range-Doppler  plane.  Figure  1 
is  a  block  diagram  of  the  conventional  stepped- 
frequency  ISAR.  As  shown,  the  stepped-frequency 
ISAR  first  transmits  a  sequence  of  N  bursts.  Each 
burst  consists  of  M  narrow  frequency  band  pulses. 
The  center  frequencies  of  successive  M  pulses  are 
increased  by  a  constant  frequency  step.  When  the 
target  rotational  and  translational  motion  is  mainly 
corrected,  the  received  frequency  signatures  of  bursts 
can  be  treated  as  the  time  history  of  the  target 
reflectivity  at  discrete  frequencies.  Taking  M-point 
inverse  DFT  for  each  received  frequency  signature  will 
naturally  yield  N  range  profiles,  each  containing  M 
range  cells.  The  samples  taking  at  the  same  range 
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cell  for  the  N  range  profiles  constitute  a  time  history 
series.  The  DFT  of  the  time  history  series  gives  Ap¬ 
point  Doppler  spectrum.  Finally,  by  combining  the 
M  Doppler  spectra  at  M  range  cells,  the  M-by-A^ 
ISAR  image  is  formed.  The  range  resolution  of  the 
stepped-frequency  ISAR  is  determined  by  the  burst 
bandwidth  (M  times  the  constant  frequency  step). 
The  Doppler  frequency  (or  cross-range)  resolution  is 
determined  by  the  observation  time  (number  of  bursts 
N ).  Longer  observation  time  provides  better  Doppler 
resolution  and  better  signal-to-noise  ratio,  but  causes 
more  range  and  phase  errors. 

It  is  generally  believed  that  the  Doppler 
frequencies  change  over  the  time.  The  instantaneous 
Doppler  frequency /x)(0  is  the  function  of  target  radial 
velocity  scattered  rotation  rate  I2(^),  and 

scattered  distance  from  the  rotation  center  rXt)  as 
follows 

/^(O  =  ^6(0  X  (t)  (0  (1) 

c  c 

where  fo  and  c  denote  the  radar  carrier  frequency  and 
wave  propagation  velocity,  respectively.  Using  the 
DFT  to  compute  the  Doppler  spectra,  we  will  obtain 
the  global  spectra  rather  than  the  instantaneous 
frequency  /^(O-  Consequently,  the  image  will  be 
blurred  when  target  radial  velocity  v^t),  scattered 
rotation  rate  12(0 »  or  scattered  distance  from  the 
rotation  center  r^t)  change  fast. 

To  overcome  this  problem,  we  employ  the  joint 
time-frequency  transform  to  compute  the  Doppler 
spectra.  Following  a  brief  discussion  of  the  joint 
time-frequency  transform,  numerical  simulations  are 
presented  to  demonstrate  the  effectiveness  of  the 
technique  introduced  in  this  paper. 
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2.  TIME-FREQUENCY  DISTRIBUTION 
SERIES 


For  signal  s(t),  the  Fourier  power  spectrum  can 
computed  by 


fS(a))p  =  |J  sit)  exp  {-jox}dt 


be 


=  J  jR(t)  exp  f- j(x)T}dT 


(2) 


Doppler  spectra  dramatically  change  over  a  short 
period. 

If  making  the  auto-correlation  function  time- 
dependent,  for  instance, 

Rit,T)  =  sit+^)s*it-j)  (3) 

and  inserting  it  into  eq.(2),  then  we  obtain  a  time- 
dependent  spectrum 


Figure  1.  The  Conventional  Stepped-Frequency  ISAR  Imaging  System 


where  jR(t)  is  called  the  auto-correlation  function 
given  by 

jR!(t)  =  J s(t)s{t-  X)dt 

which  is  time-independent.  Although  the  power 
spectrum  depicts  how  signal’s  energy  distributes  in 
the  frequency  domain,  it  does  not  give  any  indications 
how  signal’s  frequency  contents  change  over  the  time. 
Therefore,  the  image  may  be  blurred  when  the 


WVD{t,0))  = 

j  s(t  +  ^)s*it-^)  exp  {-j(av}dt 

(4) 

which  is  named  the  Wigner-Ville  distribution  (WVD). 
It  is  well  known  that  the  WVD  better  characterizes 
time-dependent  spectra  than  any  other  algorithms 
known  so  far.  Without  loss  of  the  generality,  let’s 
assume  signal  s(t)^A(t)exp{j<l)(t)},  where  A(0  and  0(0 
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denote  the  amplitude  and  phase,  respectively.  Then, 
0X0  is  traditionally  considered  as  the  instantaneous 
frequency.  We  can  show  that  the  mean  frequency  of 
the  WVD  at  any  time  instant  t  is  always  equal  to 
signal’s  instantaneous  frequency,  i.e.. 


V«(0= 

exp  +jnClt} 

2(7 

(7) 

Taking  the  WVD  with  respect  to  eq.(6)  yields 

WD(t,®)  =  5;SC.,.C*.,..  WVD,  At.<0) 

m,n  m',n' 


The  main  problem  of  the  WVD  is  the  so-called 
the  cross-term  interference.  When  the  analyzed  signal 
has  more  than  one  frequency  tone  at  time  instant  t, 
spurious  peaks  will  appear  in  the  middle  of  each  pair 
of  frequency  tones.  However,  it  is  interesting  to  note 
that: 

•  Many  useful  properties  possessed  by  the  WVD, 
such  as  eq.(5),  are  determined  by  averaging  the 
WVD; 

•  The  cross-term  interference  is  highly  oscillated 
and  localized; 

These  observations  suggest  that  if  the  WVD  is 


where 


and 


m,n  m',n’ 

exp{-<7"^(f  a^{(0  -  (o^f) 

t\^{-j{pT(o-qQ.t)} 

(8) 


m  +  m'  n  +  n' 


p=m—m\ q=n—ri 


considered  as  the  sum  of  harmonic  series,  than  we  can 
discard  the  high  harmonics  and  use  the  lower  order 
terms  to  well  delineate  the  time-dependent  spectrum. 
This  is  because  that  the  high  harmonic  terms  has 
smaller  average. 


Eq.(8)  decomposes  the  WVD  as  the  linear 
combination  of  localized  two-dimensional  harmonics 
that  is  named  the  time-frequency  distribution  series 

(TFDS). 


Hence,  they  have 
limited  influence  to 
the  useful 

properties.  On  the 
other  hand,  they  are 
closely  related  to 
the  cross  term 
interference.  The 
resulting 

presentation  is 
named  the  time- 
frequency 

distribution  series 
(TFDS)  (also 
known  as  the  Gabor 


When  all  terms  are 
included,  the  TFDS 
manifestly  converges  to 
the  WVD  and  thereby  it 
holds  all  the  properties 
of  WVD.  If  we  only 
retain  the  terms  for 
m=m’andn=w’,  then  we 
obtain  the  positive 
representation  that  is 
similar  to  the 
spectrogram.  Because 
the  useful  properties  are 
mainly  determined  by 
the  low  harmonics,  one 


spectrogram). 

To  compute 

the  TFDS,  the  analyzed  signal  is  first  decomposed  as 
the  orthogonal-like  Gabor  expansion,  i.e., 


can  selectively  remove 
the  high  order  harmonic 
to  well  characterize  time-dependent  spectrum  with 
limited  cross-term  interference. 


*5(0  =  X  E  (6)  3.  TFDS  BASED  ISAR  SYSTEM 

Figure  2  depicts  the  part  of  the  proposed  ISAR 
where  the  Gabor  basis  function  are  time-  and  imaging  scheme.  Note,  the  only  difference  of  the  new 

frequency-shifted  Gaussian  functions,  traditional  ISAR  systems  is  that  the  Doppler 

frequencies  in  the  proposed  ISAR  are  computed  by 
TFDS  rather  than  by  DFT.  Since  the  Doppler 
spectrum  is  computed  by  the  TFDS,  one  image  frame 


in  the  traditional  ISAR  system  is  now  decomposed 
into  several  instantaneous  frames. 

4.  SIMULATION  AND  CONCLUSION 

Extensive  testing,  on  both  the  simulated  and  the  real 
data,  has  been  conducted  in  Naval  Research 
Laboratory.  All  simulations  indicate  that  the  new 
scheme  significantly  improves  the  traditional  ISAR 
performance.  Figure  3  compares  the  simulated  image 
of  MIG-25  obtained  by  the  traditional  ISAR  and  one 
of  the  multiple  frames  generated  by  the  new  ISAR. 

While  the  DFT  based  ISAR  image  is  blurred  due 
to  the  aircraft  fast  rotating,  a  clear  aircraft  image  is 
observed  by  TFDS  based  ISAR.  As  far  as  the  authors 
are  aware,  such  result  has  not  been  achieved  by  any 
currently  used  methods.  Moreover,  it  is  worth  to 
notice  that  the  only  difference  between  the  new  ISAR 
and  the  traditional  ISAR  is  that  the  TFDS  replaces 
the  DFT.  Therefore,  the  traditional  ISAR  can  be 
easily  up-dated  without  significantly  altering  the 
existent  hardware  structure. 

Because  of  the  effectiveness  and  simplicity  of 
this  new  ISAR  scheme,  we  believe  that  the  technique 
discussed  in  this  paper  will  strongly  impact  to  future 
ISAR  processor  design. 
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Abstract  -  In  personal  communications  systems,  users  con¬ 
tend  for  the  resources  of  frequency  and  time,  with  re-use 
determined  by  spatial  separation,  power  allocation,  antenna 
beam  patterns,  data  rate,  and  the  required  signal  to  interfer¬ 
ence  ratio  for  reliable  operation.  We  describe  a  highly  adap¬ 
tive  transceiver  which  implements  distributed  control  of 
power,  channel  assignments,  and  antenna  weights,  to  maxi¬ 
mize  network  throughput  That  is,  each  link  is  optimized 
independently,  with  coupling  only  via  the  mutual  interfer¬ 
ence. 

I.  Introduction 

A  high-performance  experimental  radio  transceiver  is 
under  development  at  UCLA.  It  will  include  frequency  hop¬ 
ping,  variable  bit  and  power  allocation,  channel  coding, 
adaptive  equalization,  coherent  M-QAM  signaling,  rapid 
channel  probing,  and  adaptive  transmitter  and  receiver 
antenna  arrays.  It  will  achieve  data  rates  between  64  kb/s  to 
64  Mb/s,  operating  in  the  2.4  GHz  ISM  band.  A  block  dia¬ 
gram  of  the  system  architecture  is  given  in  Figure  1.  The  tar¬ 
get  application  is  communication  among  notebook 
computers,  however,  our  principal  objective  is  to  develop  a 
highly  adaptive  radio  that  enables  experimentation  with  a 
wide  variety  of  multimedia  network  control  protocols.  To 
this  end,  a  combination  of  TDMA,  FDMA,  and  frequency 
hopped  CDMA  will  be  supported.  Most  control  functions  for 
the  link  layer  will  be  distributed,  requiring  at  most  feedback 
between  a  transmitter  and  its  intended  receiver,  so  that  dis¬ 
tributed  network  control  protocols  may  also  be  supported. 
Each  link  independently  attempts  to  achieve  the  highest  pos¬ 
sible  signal  to  interference  ratio  (SIR).  One  of  the  challenges 
is  to  design  a  set  of  adaptive  algorithms  that  will  interact  in  a 
stable  fashion,  while  increasing  the  robustness  and  through¬ 
put  of  the  netwoik.  In  the  following  sections,  we  outline  the 
major  adaptive  subsystems, 

n.  Dyanmic  Power  and  Channel  Allocation 

In  a  radio  network,  frequency  and  time  slots  (channels) 
may  be  re-used  at  some  distance  due  to  propagation  losses. 
Conventional  designs  re-use  frequencies  every  four  to  seven 
cells  in  TDMA  and  FDMA  systems.  Such  fixed  assignments 
are  wasteful  of  channel  resources  since  they  must  be 
assigned  based  on  90%  or  99%  worst  case  channel  condi¬ 
tions.  As  channel  conditions  vary  widely  due  to  shadowing, 

1.  Supported  by  ARPA  contract  JFBI94-222/J4C942220 


this  results  in  conservative  re-use  distances.  In  dynamic 
power  and  channel  allocation  (DPCA)  algorithms,  channels 
and  transmitter  powers  are  assigned  to  users  so  that  all  mem¬ 
bers  of  the  network  meet  their  own  SIR  requirement  for  reli¬ 
able  communication  [e.g.  1-3],  Every  channel  is  potentially 
available  in  every  cell,  with  contention  for  the  channels 
resolved  using  information  on  the  mutual  interference. 

The  basic  mathematical  statement  of  the  problem  is  as 
follows.  Let  Gjj  be  the  propagation  gain  between  the  j^  base 
station  and  the  mobile  to  which  the  i^  base  station  is  com¬ 
municating  directly  for  some  time  slot  Let  Pj  be  the  power 
transmitted  by  the  user  belonging  to  base  station  i.  If  PpO, 
there  is  no  mobile  communicating  with  base  station  i  in  the 
slot.  The  SIR  for  basestation  i  is  then  given  by 


For  reliable  operation,  we  require  Rj  >  y,  for  all  basestations. 
This  set  of  conditions  is  equivalent  to  the  following  set  of 
linear  inequalities 

G,7/’,-Y2:c,/y>0 

where  we  must  find  whether  there  exists  a  set  of  positive 
powers  {Pj}  for  which  all  the  inequalities  are  satisfied.  It 
may  be  shown  that  this  problem  is  equivalent  to  solving  the 
matrix  equation  Gn=l,  where  n  is  the  vector  of  power 
assignments,  Gij  =  Gij,  and  Gy  =  -'>Gij,  with  appropriate  re¬ 
numbering  of  entries  if  not  all  basestations  are  active.  Then 
if  every  component  of  the  vector  11  is  positive,  there  is  a  fea¬ 
sible  assignment.  Moreover,  the  vector  of  powers  is  the  best 
one  to  use  apart  firom  a  scaling  factor  determined  by  the 
background  noise,  in  the  sense  that  it  lies  in  the  center  of  the 
region  of  feasible  power  assignments,  and  thus  is  least  sensi¬ 
tive  to  perturbations. 

To  apply  this  single-channel  algorithm  to  a  multi-chan¬ 
nel  environment,  we  must  in  the  worst  case  check  among  all 
possible  combinations  of  users  and  find  the  assignment  that 
leads  to  the  largest  number  of  users  being  admitted.  How¬ 
ever,  we  may  instead  employ  any  algorithm  developed  for 
performing  dynamic  channel  assignment  [e.g.,  4],  replacing 
the  channel  compatibility  calculations  of  these  algorithms 
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with  solution  of  the  system  of  equations  above. 

In  the  above,  it  was  implicitly  assumed  that  propagation 
data  was  reliably  obtained  at  some  central  site.  This  is  diffi¬ 
cult  to  arrange,  but  fortunately  unnecessary.  In  [5]  a  fully 
distributed  form  of  the  algorithm  is  presented,  in  which  the 
only  information  exchange  required  is  that  between  a  mobile 
and  its  own  base  station,  revealing  their  measurements  of  the 
interference  power  and  the  propagation  loss  in  that  single 
link.  In  the  absence  of  dynamic  range  limitations  on  the 
power  control,  the  algorithm  not  only  converges  to  the  opti¬ 
mum  power  vector  if  a  feasible  solution  exists,  but  it  also 
rejects  new  users  in  the  absence  of  a  feasible  solution. 
Dynamic  power  range  limitations  can  be  accommodated  by 
either  limited  centralized  network  control  operations,  or 
through  voluntary  termination  of  admission  attempts  when 
the  resistance  of  other  users  is  apparent  from  rapidly  increas¬ 
ing  interference  levels  [6]. 

m.  Channel  Probing 

In  distributed  DPCA,  each  user  increases  or  decreases 
its  power  by  one  step  depending  on  whether  the  SIR  is 
acceptable.  Other  users  will  react  to  power  increments  by 
increasing  their  power,  which  is  then  observed  in  the  form  of 
increased  interference.  The  incremental  increase  of  interfer¬ 
ence  compared  to  the  incremental  increase  of  transmitted 
power  may  be  viewed  as  the  “resistance”  of  the  channel  to 
the  entry  of  new  user.  The  resistance  of  the  channel  together 
with  the  propagation  loss  on  one’s  own  link  are  powerful 
indicators  of  whether  or  not  a  feasible  power  assignment 
exists.  A  relatively  small  number  of  iterations  of  the  algo¬ 
rithm  at  low  power  levels  will  yield  a  prediction  of  the  maxi¬ 
mum  attainable  SIR  [.];  we  refer  to  this  as  channel  probing. 
We  find  that  in  cellular  systems  there  is  usually  one  domi¬ 
nant  interferer.  This  case  may  be  analyzed  to  yield  a  particu¬ 
larly  simple  rule  for  estimating  the  final  SIR.  Probing  in  this 
fashion  permits  more  rapid  evaluation  of  multiple  channels, 
and  reduces  the  disturbance  of  already  active  users  by  new 
users  attempting  to  gain  entry  into  the  network. 

IV.  DPCA  and  Frequency  Hopped  Systems 

The  proposed  transceiver  is  a  frequency  hopped  system. 
There  are  many  options  for  the  application  of  DPCA,  since 
each  user  accesses  many  channels.  We  may  probe  to  predict 
the  final  SIR  in  each  and  assign  bits  and  power  to  maximize 
the  throughput  for  this  expected  SIR  distribution.  Channel 
coding  with  interleaving  across  the  frequency  slots  could 
then  serve  to  realize  the  frequency  diversity,  provide  coding 
gain,  and  some  smoothing  of  small  SIR  estimation  errors. 
We  may  arrange  the  frequency  hops  to  be  synchronous 
among  cells,  so  that  the  same  set  of  interferers  is  encoun¬ 
tered  on  each  hop.  DPCA  then  reduces  to  the  single  channel 


form.  A  second  option  is  to  randomize  the  hopping  patterns 
among  the  different  cells.  A  combination  of  bit  allocation 
and  coding  then  produces  a  hybrid  mix  of  interference  aver¬ 
aging  and  interference  avoidance,  since  we  may  choose  not 
to  allocate  any  power  to  those  hops  with  large  resistance. 
Simulation  results  for  the  simpler  case  of  choosing  M  out  of 
N  hops  to  allocate  an  equal  number  of  bits  reveals  that  net¬ 
work  throughput  is  very  similar  to  the  first  option.  However, 
this  procedure  is  more  robust  with  respect  to  channel  varia¬ 
tions  since  the  set  of  channels  occupied  can  be  slowly 
changed,  with  the  effect  on  any  other  being  small  since  there 
is  mutual  interference  in  only  one  hop.  Moreover,  since  hop¬ 
ping  patterns  are  randomized  among  cells,  each  pattern 
encounters  essentially  the  same  ensemble  of  interference. 
Thus,  choosing  a  new  pattern  is  unlikely  to  lead  to  signifi¬ 
cantly  better  performance,  and  so  we  avoid  the  need  to  re¬ 
probe  other  patterns.  This  provides  more  tolerance  to  mobil¬ 
ity  for  distributed  algorithms  than  the  first  method. 

For  maximum  flexibility,  we  will  actually  implement 
different  bit  assignments,  i.e.,  different  QAM  constellations 
in  different  hops.  The  digital  portion  of  the  modem  will  sup¬ 
port  up  to  256  QAM.  The  bit  assignment  problem  for  a  fre¬ 
quency  hopped  system  is  very  similar  to  that  of  discrete 
multitone  transmission,  except  that  we  must  probe  to  predict 
the  final  SIR  for  each  hop,  rather  than  use  the  value  under  the 
current  power  assignment.  On  all  hops  which  get  a  non-zero 
number  of  bits,  roughly  the  same  power  is  assigned.  We 
make  a  preliminary  assignment  of  bits  using  the  results  of 
probing  so  that  the  transmitter  power  is  minimized,  and 
adjust  as  channel  conditions  change.  In  this  case,  the  role  of 
channel  coding  over  the  hops  is  not  to  realize  frequency 
diversity  but  rather  to  provide  some  coding  gain  and  smooth 
out  errors  that  have  been  made  in  estimating  the  interference. 

Finally,  certain  geographically  disadvantaged  users  will 
require  high  transmission  power,  and  in  effect  impose  local 
frequency  re-use  restrictions  in  the  channels  they  occupy 
comparable  to  what  would  be  required  in  fixed  channel 
assignments.  These  users  couple  several  cells  together,  slow¬ 
ing  convergence  of  distributed  algorithms.  One  possible 
solution  is  for  users  with  high  transmitter  power  to  drop  to 
fixed  assignment  channels  voluntarily,  re-probing  the  chan¬ 
nels  with  DPCA  periodically  to  determine  if  they  may  be  re¬ 
admitted  with  low  power.  We  are  presently  investigating  the 
dynamics  of  both  this  hybrid  approach  and  a  pure  DPCA 
system. 

V.  Adaptive  Antenna  Array  and  Equalizer 

Antenna  arrays  may  also  be  used  to  suppress  multipath 
and  reduce  interference  [7].  We  propose  to  adapt  both  trans¬ 
mitter  and  receiver  arrays  using  least  squares  techniques. 
Switching  between  sets  of  fixed  beam  patterns  is  not  feasible 
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for  indoor  systems,  since  we  must  gain  some  compromise 
benefit  between  diversity  combining  and  interference  cancel¬ 
lation,  and  the  multipath  has  a  very  wide  angular  spread. 
Additionally,  the  human  body  interacts  with  the  terminal  to 
change  the  beam  pattern. 

The  most  interesting  interaction  is  that  between  different 
pairs  of  communicating  users.  As  the  transmitter  pattern  of 
one  array  changes,  so  do  the  receiver  patterns  for  all  users  in 
the  vicinity.  This  in  turn  affects  their  transmitter  patterns,  as 
the  latter  may  only  be  adapted  based  on  the  received  signals. 
The  antenna  patterns  must  also  react  to  changes  in  the  power 
levels  and/or  channel  assignments  of  the  other  users  in  the 
network.  This  is  similar  to  the  types  of  problems  encoun¬ 
tered  in  devising  distributed  DPCA  schemes.  Here,  the  trans¬ 
mitter  and  receiver  beam  patterns  affect  the  gains  G  for  both 
desired  and  interfering  links.  Due  to  the  interference  cou¬ 
pling,  there  is  the  possibility  that  the  independently  adapting 
antenna  arrays  will  not  converge. 

We  have  investigated  the  dynamics  of  an  adaptive 
antenna  array  with  a  variety  of  equalizer  and  transmitter 
adaptation  options,  with  the  happy  conclusion  that  the  ordi¬ 
nary  LMS  algorithm  should  be  adequate.  The  imposition  of 
orthogonality  among  channels  within  a  cell  together  with  the 
minimum  SIR  requirement  for  links  to  be  declared  feasible 
serve  to  decouple  the  users.  With  this  requirement  satisfied, 
the  beam  pattern  of  the  partner  on  the  desired  link  is  the  most 
important  component  of  the  signal  to  interference  ratio.  On 
the  contrary,  when  the  same  lErequency/time  slots  are  shared 
within  a  cell,  there  often  arise  deadlock  conditions.  In  this 
situation,  it  would  be  wise  to  use  omnidirectional  transmis¬ 
sion  and  adapt  only  the  receiver  weights,  even  though  when 
transmitters  can  be  adapted  significant  extra  gain  is  avail¬ 
able.  The  antenna  arrays  should  be  adapted  on  a  time  scale 
faster  than  power  control,  since  the  antenna  gain  aflfects  the 
perceived  path  gains  between  users.  Following  antenna 
adaptation,  the  radios  can  then  probe  to  determine  the  bit 
allocation  and  final  transmitted  power  levels. 

The  general  problem  of  finding  the  best  allocation  of 
weights  for  transmission  and  reception  in  a  laige  network 
with  multipath  resists  closed  form  solution,  and  would  in  any 
case  require  (unavailable)  reliable  channel  information  and 
considerable  cooperation  among  users.  We  have  instead  cho¬ 
sen  to  adapt  receiver  weights  using  a  least  squares  algorithm, 
and  then  use  these  weights  in  transmission.  A  small  number 
of  iterations  appears  to  give  very  good  results,  for  typical 
indoor  systems  simulations  providing  a  gain  over  omnidirec¬ 
tional  transmisison  of  9  dB  or  more  for  a  seven  element 
array. 

A  variety  of  equalizer  structures  were  investigated  in 
combination  with  the  antenna  arrays.  With  perfect  channel 


knowledge,  the  best  structure  to  use  was  found  to  be  a  multi¬ 
tap  combiner,  which  consists  of  an  (adaptive)  FIR  filter  on 
each  antenna  branch  followed  by  a  summing  junction.  How¬ 
ever,  with  LMS  adaptation,  this  structure  showed  little  per¬ 
formance  improvement  over  using  no  equalizer  at  all,  and 
we  always  found  that  under  a  constraint  on  the  number  of 
adaptive  elements  we  were  better  off  trading  in  the  direction 
of  increased  antenna  elements.  The  array  already  suppresses 
some  undesired  multipath  components  by  placing  nulls  in 
certain  directions,  and  thus  already  provides  some  equaliza¬ 
tion.  Given  the  transmitter  array  adaptation  strategy  outlined 
above,  we  have  decided  to  follow  our  antenna  array  with  a 
DFE  at  baseband,  which  will  be  adapted  on  a  slower  time 
scale  than  the  adaptive  antenna  array.  This  will  serve  the 
function  of  cleaning  up  residual  ISI,  as  may  be  required  for 
using  large  QAM  constellations,  or  occasional  pathological 
channel  conditions. 

VI.  Adaptive  Symbol  Rates 

The  three  orders  of  magnitude  range  of  data  rates  will  be 
achieved  through  a  combination  of  three  different  symbols 
rates,  variable  constellation  size,  and  TDMA  subdivision  of 
frequency  slots.  The  number  of  symbol  rates  must  be  limited 
since  each  implies  the  need  for  a  separate  bandpass  filter  at 
IF.  Thus  we  have  tentatively  decided  upon  symbol  rates  of 
64  kbaud,  1  Mbaud,  and  16  Mbaud.  The  high  symbol  rates 
could  be  used  when  line  of  sight  (LOS)  conditions  exist  in 
relatively  short  range  transmission,  while  the  lower  symbol 
rates  would  be  used  for  longer  range  transmission  and  for 
users  demanding  a  lower  throughput.  Where  possible,  lower 
baud  rate  transmission  is  favored  over  TDMA  division  of  a 
high  baud  rate  stream  due  to  reduced  equalization  require¬ 
ments  and  increased  resistance  to  noise  and  interference  for  a 
given  power  budget.  The  symbol  rate  would  be  chosen  after 
performing  a  channel  probe;  users  will  not  always  be  able  to 
achieve  the  throughput  they  desire,  but  the  system  will 
attempt  to  give  them  the  maximum  throughput  compatible 
with  the  ongoing  traffic.  Bit  rates  may  on  occasion  have  to 
be  reduced  as  propagation  conditions  change. 

The  coexistence  of  multiple  symbol  rate  streams,  with 
potentially  widely  divergent  maximum  power  spectral  densi¬ 
ties,  brings  some  additional  problems.  The  lower  symbol  rate 
systems  may  cause  extreme  interference  to  the  higher  sym¬ 
bol  rate  systems  if  each  operates  under  the  same  peak  power 
contraint,  since  many  low  rate  interferers  may  share  the 
same  band  of  &*equencies  as  the  high  rate  user.  One  obvious 
solution  is  to  separate  the  different  classes  of  users  into  dif¬ 
ferent  channels  (hopping  patterns)  that  are  consistent  across 
cells.  The  synchronization  requirements  are  easily  main¬ 
tained  with  distributed  algorithms.  A  variety  of  options 
which  permit  more  flexibility  in  channel  assignment  are 
under  investigation. 
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Vn.  Conclusion 
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The  links  in  a  multiple  access  radio  system  are  coupled  via 
their  mutual  interference.  This  and  the  signal  for  the  desired 
link  can  be  used  to  control  essentially  all  adaptive  functions 
of  the  radios,  including  symbol  rate,  bit  allocation,  power 
level,  channel  assignment,  antenna  transmitter  and  receiver 
weights,  and  equalizer  settings.  This  presumes  however  that 
consistent  timing  information  is  distributed  around  the  net¬ 
work,  and  further  that  in  each  cell  orthogonal  frequency 
assignments  are  made.  Otherwise,  adaptation  may  be 
extremely  slow.  Both  functions  require  relatively  little  con¬ 
trol  traffic,  operating  at  the  level  of  frames.  Thus,  with  mini¬ 
mal  centralized  control  a  highly  capable  set  of  adaptive 
algorithms  can  be  enabled.  The  alternative  of  attempting 
optimum  centralized  control  over  all  these  functions,  even  if 
it  were  possible  to  gather  ail  necessary  propagation  informa¬ 
tion,  would  be  computationally  intensive  in  the  extreme  and 
demand  very  high  control  traffic  overhead.  A  fully  asyncrho- 
nous  architecture  would  suffer  from  very  slow  adaptation, 
and  would  therefore  be  less  able  to  cope  with  variable  propa¬ 
gation  and  traffic  conditions.  Thus,  some  compromise 
between  these  extremes  is  required.  We  have  opted  for  algo¬ 
rithms  which  we  believe  may  operate  with  close  to  the  mini¬ 
mum  degree  of  centralized  control  required  for  robust 
operation. 
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ABSTRACT 

The  advantages  of  homodyne  receivers  and  the  problems 
necessary  to  overcome  in  order  to  realize  the  lequiied 
performance  have  been  reviewed.  Some  examples  of  our 
recent  results  and  homodyne  technologies  are  introduced  as 
applied  to  pagers,  wireless  remote  control  equipment,  and 
radio  systems  using  reflected  signals. 

1.  INTRODUCTION 

Homodyne  receivers  are  often  called  direct  conversion 
receivers^^^^^^  for  mobile  communications  equipment.  They 
are  inherently  suitable  for  circuit  integration  and  small 
equipment  because  they  have  no  IF  circuits.  A  homodyne 
receiver  can  be  regarded  as  a  super  high-speed 
analogue/digital  converter  because  it  can  obtain  the  signal 
for  digital  processing  from  a  propagating  radio  signal.  This 
is  the  ultimately  simplest  radio  receiver,  but  there  are  many 
barriers  to  get  over  and  it  has  not  yet  been  widely  used. 
There  are  some  examples  of  its  use  at  this  moment  with 
reasonable  receiver  characteristics  in  low-power 
consumption  such  as  our  paging  receivers.  In  radio 
communications  systems  using  reflected  signals  or 
TDD(Time  Division  Duplex)  or  simplex  transmitter/ 
receivers,  homodyne  receivers  enable  one  to  use  radio 
oscillator  as  both  a  transmitter  and  a  local  signal  for 
receiver. 

Usually,  this  configuration  has  some  faults  in 
characteristics,  and  can  be  adopted  when  circuit  design  and 
system  design  technologies  for  homodyne  receivers  are 
available  to  compensate  its  performance.  These  homodyne 
technologies  are  readily  available  for  simple  mobile 
equipment.  Here,  the  merits  and  requirements  of  homodyne 
receivers  and  the  recent  results  of  our  developments  and 
technical  features  are  presented. 


FiQ.1  Block  diagram  of  homodvne  receiver 


2,  MERITS  OF  HOMODYNE  RECEIVERS 

A  homodyne  receiver  is  shown  in  Fig.l.  The  reason  for 
features  contributing  to  its  small  size,  simple  configuration, 
and  suitability  for  circuit  integration  in  low-power 
consumption  are  sununarized  as  follows: 

(1)  Simple  filter  configuration 

Since  there  are  no  IF  circuits,  image  suppression  is 
unnecessary,  and  channels  can  be  selected  using  base 
band  channel  filters  suitable  for  circuit  integration.  Not 
only  a  narrow  band  pass  filter  in  the  IF  stage  but  also  a 
band  pass  filter  in  the  RF  stage  may  be  unnecessary 
when  it  has  adequate  AGC,  frequency  band  width  of 
antenna,  and  3rd  order  IM  of  RF  AMP/MIX. 

(2)  Minimum  number  of  generators 

Since  the  receiver  has  a  simple  configuration  and 
needs  only  radio  and  base  band  frequency  sections, 
which  are  theoretically  necessary,  there  is  little 
generation  of  erroneous  signals.  In  the  case  of  a 
homodyne  receiver,  there  is  no  need  for  a  2nd  local 
oscillator  and  its  crystal  element  as  compared  with  a 
double  super  heterodyne  receiver. 

(3)  Suitability  for  multi-channel  reception 

In  the  case  of  a  single  super  heterodyne  receiver,  a 
local  oscillator  is  usually  at  a  fixed  frequency  for 
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reinforcing  channel  filtering  performance,  although  a 
variable  fiequency  synthesizer  can  be  used  as  a  local 
oscillator  in  a  double  super  heterodyne  receiver  or  a 
homodyne  receive. 

These  merits  result  in  a  high-potential  receiver  with 
small  size  and  low  cost 

3,  REQUIREMENTS  AND  TECHNOLOGIES  OF 
HOMODYNE  RECEIVERS 
The  requirements  and  corresponding  technologies  of 
homodyne  receivers  in  low-power  consumption  are  pointed 
out  as  follows  in  comparison  with  general  hetercxlyne 
receivers: 

(1)  The  characteristic  load  to  RF  circuits  is 
increased. 

A  considerable  gain  of  fkont-end  RF  circuits  is 
required  to  increase  the  RF  signal  level  enough  for  the 
equivalent  input  conversion  noise  of  base  band  circuits. 
Although  the  sensitivity  of  the  receiver  can  be  inaeased, 
the  problem  of  distortion  appears  as  the  gain  of  RF 
circuits  increases.  Moreover,  a  spurious  response  of  even 
cx-dCT  intermodulation  distortion  caused  by  undesired  RF 
2  signals  and  2nd  order  harmonics  of  the  local  oscillator 
will  appear  in  the  demodulation  band  width,  although  it 
will  not  appear  in  the  case  of  the  super  heterodyne 
receiver.  Therefore,  low  distortion  RF  AMPs  with  low 
NF,  and  low  distortion  Mixers  with  low  NF,  are 
necessary.  AGC  circuits  are  effective  for  lowering  the 
distortion  in  a  strong  electric  field. 

(2)  Ensuring  high  isolation  is  important. 

The  leakage  of  the  LO  signal  in  the  receiver  obstructs 
nearby  sensitivity  points  in  the  receiver.  Radiation  from 
the  LO  signal  directly  obstructs  sensitivity  suppression 
in  other  receivers  with  almost  the  same  frequency. 
Moreover  large  RF  signal  in  a  strong  electric  field 
obstructs  a  VCO  of  the  LO  just  like  as  an  injection 
lock.  The  required  technologies  include  an  IC  circuit 
structure  for  high  isolation  at  first,  minimization  of 
local  oscillator  power  in  cxder  to  reduce  the  LO  signal 
leakage,  layout  of  circuits,  shield  and  housing  design  of 
equipment 

Since  the  output  signal  of  the  quadrature  mixer  is  a 
base  band  signal,  countermeasures  such  as  low-frequency 
noise  suppression  by  the  voltage  regulator  are  needed 
against  low-frequency  noise. 

(3)  Degradation  of  sensitivity  caused  by  1/f  noise 
should  be  minimized. 

In  directly  converted  base  band  signals,  1/f  noise  from 
zero  frequency  appears.  The  influence  of  base  band  1/f 


noise  at  the  output  of  the  quadrature  mixer  may  be 
reduced  by  arranging  a  high-pass  filter  in  the  channel 
filter,  although  it  loses  the  low  frequency  signal 
information.  This  influence  become  greater  at  the  higher 
C/N  points. 

(4)  Ensuring  sensitivity  band  width  for  higher  data- 
rate  receiving  is  necessary. 

The  sensitivity  band  width  depends  upon  the  cutoff 
frequency  of  the  channel  low-pass  filter  and  the 
performance  of  the  demodulator  in  the  case  of  a  low 
equivalent  modulation  index.  In  particular,  the  adoption 
of  circuit  architecture  that  corresponds  to  maintain  the 
sensitivity  band  width  in  a  low  equivalent  modulation 
index  with  low  power  consumption  is  expected, 

(5)  Rising  time  of  the  demodulator  should  be 
shortened  in  some  cases. 

Demodulation  is  performed  in  lower  frequency  than 
heterodyne  receiver.  Therefore,  a  large  capacitance  is 
essentially  required  for  the  demodulator.  This  results  in  a 
longer  rising  time. 

4,  SOME  EXAMPLES  OF  OUR  RESULTS  AND 
TECHNOLOGIES 

4.1  HOMODYNE  RECEIVING  PAGER 

Paging  receivers  have  an  extreme  demand  for  small  size 
in  mobile  communications  equipment.  The  modulation 
method  is  around  1,200bps  data-rate  FSK,  so  far,  which  is 
an  advantage  to  designing  a  simple  receiver  configuration. 
In  paging  applications  there  are  some  barriers  to  realizing  a 
practical  homodyne  receiver,  although  mainly  het^odyne 
receivers  are  used.  We  developed  an  exclusively  designed 
quadrature  mixer  IC  with  low  distortion  and  high  isolation 
for  homodyne  receivers  and  demodulator  IC  which  adopt  our 
proposed  Frequency  Multiplied  Digital  Phase-Shifting 
Demodulator'^^^'*^  shown  in  Fig.2  having  a  wide  sensitivity 
band  width  using  a  multiplied  signal  in  digital  circuits. 
Several  million  homodyne  receivers  with  the  above 
mentioned  2  ICs,  shown  in  Fig.3,  are  on  the  market 
already. 


PS:DigitalP base  Shifter  SD:Signa I  Detecting  Circuit 
FM:£requency  j^ultiplierDC:^etected  Signal  Combining  circuit 
Fig. 2  Detector  of  proposed  demodulator  for  homodvne  receiver  I 
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Fig. 3  A  photograph  of  the  developed  ICs 

Higher  data-rate  multilevel  FSK  paging  systems  such  as 
ERMES  or  FLEX  are  planned  to  be  widely  use  in  the  next 
generation.  Corresponding  to  such  systems,  the  availability 
of  our  proposed  Quadrature  Frequency  Digital  (QFD) 
Demodulator”  using  a  phase  to  frequency  convertor  in 
digital  circuits,  shown  in  Fig.4,  is  confirmed.  As  shown  in 
Fig.5,  the  sensitivity  around  BER=10'^  point,  before  errcx* 
correction  of  the  receiver  using  a  QFD  demodulato-  suitable 
for  homodyne  configuration,  is  obtained  almost  equivalent 
performance  as  a  conventional  heterodyne  receiver. 


Fig. 4  Block  diagram  of  proposed  receiver  H 


Fig. 5  Experimental  result 

4.2  APPLICATION  TO  TRANSMITTER  /  RECEIVER 
The  RF  oscillator  can  be  used  both  as  a  transmitter  and  a 
local  signal  for  receiver,  especially  in  TDD  (Time  Division 
Duplex)  or  simplex  radio  equipment  using  the  same 
frequency  as  shown  in  Fig.6.  There  is  no  need  for  additional 
frequency  conversion  by  mixers,  except  for  the  homodyne 
quadrature  mixers.  Both  transmitter  and  receiver  aie 


homodyne.  In  this  case,  additional  frequency  covct  range  of 
the  oscillator  is  not  necessary. 

One  of  the  applications  of  this  Configuration  is  wireless 
remote  control  equipment  which  communicates  relatively 
simple  data.  This  equipment  can  use  the  regulatory  applied 
for  specified  low  power  radio  station  in  Japan. 


4.3  RADIO  SYSTEM  USING  REFLECTED  SIGNAL 
Microwave  AVI  (Automatic  Vehicle  Identification) 
systems*^,  especially  those  which  feature  access  areas  around 
10m  and  high-speed  data  communication,  have  become 
somewhat  popular.  One  of  the  examples  of  its  practical  use 
is  management  system  of  train  movement.  Usually,  these 
systems  have  reading  and  writing  function  for  two  way 
communication  using  passive  microwave  cards,  the 
principal  operation  is  as  follows. 

1)  An  interrogator  continuously  sends  microwave  earner 
during  the  reading  operation,  it  receives  a  reflected 
modulated  signal  from  the  microwave  card  and  a  reflected 
carrier  signal  from  other  objects.  Demodulation  is 
performed  from  these  signals  and  information  of 
memorized  data  in  the  microwave  card  can  be  read. 

2)  The  interrogator  sends  an  ASK  signal  during  the  writing 
operation,  and  the  microwave  card  detects  the  ASK  signal 
and  some  information  can  be  written  into  the  card. 

In  these  systems,  basically,  transmitting  and  receiving 
frequency  is  the  same,  and  a  homodyne  receiver  may  be 
used.  When  many  interrogators  are  placed  nearby, 
interference  between  interrogators  disturbs  the 
communication  between  interrogators  and  microwave 
caids,and  degrades  communication  quality.  To  solve  this 
problem,  the  structure  and  main  features  of  our  proposed 
high  interf^ence  immunity  microwave  card  system  is  as 
follows: 

(1)  The  interrogator  has  a  spread  spectrum  function  in  the 
transmitter  and  a  despreading  function  in  the  receiver  is 
performed  using  the  same  signal  as  the  transmitted 
signal. 
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5.  CONCLUSION 

Although  homodyne  receivers  are  not  widely  used  in 
present  mobile  communications  equipment,  this  technology 
is  effective  for  the  simplification  of  radio  communications 
equipment.  Some  examples  of  the  results  and  technologies 
of  our  homodyne  receivers  in  practical  use  were  presented. 
This  technology  is  expected  to  spread  to  various  fields  of 
mobile  communications  equipment  through  continuous 
study. 


Fiq.7A  photograph  of  the  experimental  system 
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Fig. 9  Interference  Immunity  characteristics 

(2)  A  quadrature  homodyne  configuration  is  adopted  for  the 
purpose  of  countermeasure  of  the  demodulation  level 
fluctuations  caused  by  the  phase  relation  between  local 
signal  and  reflected  received  signal. 

Fig.7  shows  the  photograph  of  a  licensed  radio  station^ 
for  such  experiments  and  the  block  diagram  is  shown  in 
Fig.8.  The  oscillator  is  used  as  both  a  transmitter  and  a 
local  signal  for  the  receiver  at  the  same  time. 

To  confirm  the  interference  immunity,  the  simulation  of 
the  relation  between  BER  and  CIR  (carrier  to  interference 
ratio)  is  performed.  Simulation  results  are  shown  in  Fig. 9. 

Millimeter-wave  vehicular  radar  for  a  collision  warning 
or  avoidance  is  another  radio  system  using  reflected  signals. 
In  this  application,  the  homodyne  receiver  configuration  is 
expected  to  be  used  for  simple  and  low  cost  realization, 
which  is  a  principal  target  at  this  moment,  although  the 
heterodyne  configuration  is  also  a  candidate. 


Acknowledgements 

The  authors  gratefully  acknowledge  the  assistance  and 

support  of  their  many  colleagues  at  Information  and 

Communications  Technology  Laboratory,  Matsushita 

Electric  Industrial  Co.,  Ltd.,  and  Personal  Communication 

Division  and  Radio  Telecommunication  Division, 

Matsushita  Communication  Industrial  Co.,  Ltd. 

REFERENCE 

[1]  K.Chiba  and  K.Murota.  Performance  of  direct 
conversion  FSK  receiver.  National  Convention  Record, 
I.E.I.C.E.,  Part  10,  No.2232.,  1987 

[2]  I.A.W.Vance,  B.Eng,  and  M.  Sc.  Fully  integrated  radio 
paging  receiver.  lEE  Proc.,  129,  Pt.F,l,pp.2-6,  Feb. 
1982 

[3]  M.Hasegawa,  M.Ohba,  and  M.Makimoto.  An  FSK 
Quadrature  Demodulator  Using  Digital  Phase-Shifting 
Method  with  a  Doubler.  National  Convention  Record 
I.E.I.C.E,,  Japan,  B-316,  March  1991 

[4]  K.Takahashi,  M.Mimura,  M.Hasegawa,  M.Makimoto, 
and  K.Yokozaki.  A  Direct  Conversion  Receiver 
Utilizing  a  Novel  FSK  demodulator  and  Low  Power- 
Consumption  Quadrature  Mixer.  IEEE  Proc.  VTC, 
pp.910-915,  1992 

[5]  M.Hasegawa,  M.Mimura  and  ■  M.Makimoto.  Direct 
conversion  Receiver  Technology  for  Pagers  and  Small 
Mobile  Communications  Equipment.  Proc. 
International  Workshop  on  Multi-dimensional  Mobile 
Communications  (MDMC),  Japan,  2-2,  pp.31-34,  Nov. 
1994 

[6]  N.Adachi,  Y.Namura,  M.Mimura  and  M.Hasegawa,  A 
study  on  the  configuration  of  AVI  system  with  high 
immunity.  Proc.  I.E.I.C.E.,  Japan,  B-352,  Autumn 
1993 

[7]  N.Adachi,  K.Maeda,  T.Fukagawa  and  M.Hasegawa.  A 
Microwave  Card  System  with  a  High  Interference 
Immunity  utilizing  a  Spread  Spectrum  Method.  Asia 
Pacific  Microwave  Conference,  20-1,  pp.463-466,  1994 


262 


SYSTEM  DESIGN  FOR  CONTIGUOUS  CELL  COVERAGE  IN  HIGH  DENSITY 

MICROCELLULAR  NETWORKS 


Brendan  C.  Jones  and  David  J.  Skellem 

Electronics  Department,  School  of  Maths,  Physics,  Computing  and  Electronics 
Macquarie  University  NSW  2109  Australia 

email :  brendan@mpce . mq . edu . au ,  daves@mpce . mq . edu . au 


ABSTRACT 

This  paper  examines  whether  microcellular  networks  can  be 
designed  to  provide  contiguous  cell  coverage  for  a  given  pro¬ 
portion  of  mobile  terminals.  Simulation  results  using  a  gen¬ 
eral  microcellular  interference  model  show  that  contiguous 
cell  coverage  may  not  be  possible  for  an  acceptable  proportion 
of  mobile  terminals  even  if  the  call  loss  rate  is  low. 


1.  INTRODUCTION 

System  design  methodologies  for  conventional  (large  cell) 
cellular  networks  have  matured  through  a  mix  of  theory  and 
operational  experience  since  the  first  systems  were  developed 
in  the  1970s  [l]-[2],  but  the  applicability  of  these  design  tech¬ 
niques  to  the  microcellular  case  is  questionable  [3]-[6]. 

The  wide  scale  deployment  of  an  extensive,  high  grade,  wire¬ 
less  telephone  system  will  require  engineering  tools  and  tech¬ 
niques  that  allow  rapid  and  accurate  system  design  [7].  The 
fundamental  problem  that  needs  to  be  addressed  is  of  model¬ 
ling  the  end  result  of  multiple  users  propagating  in  a  congested 
area  [7],[8]. 

Some  of  the  factors  which  affect  the  performance  of  micro¬ 
cells  include  adjacent  channel  interference,  the  distribution  of 
user  terminals,  the  close  spacing  of  fixed  stations,  and  spatial 
traffic  variability  [3]-[19]. 

One  of  the  important  system  design  issues  for  microcellular 
networks  is  the  radio  coverage  quality.  Users  will  increas¬ 
ingly  demand  wireline  call  quality  [8]  throughout  the  network 
and  this  will  require  reliable,  contiguous  radio  coverage. 


2.  CELL  COVERAGE  MODEL 

A  general  interference  model  has  been  developed  that  enables 
microcell  coverage  and  cell  radius  distributions  to  be  analysed 
in  terms  of  the  system  design  parameters  [3]-[6]. 

The  model  takes  a  spatial  approach  to  analysing  the  interfer¬ 
ence  effects  on  a  wanted  link  of  an  arbitrary  number  of  inter¬ 
fering  mobile  terminals  communicating  with  an  arbitrary 
number  of  fixed  stations. 


An  important  aspect  of  the  model  is  that  it  incorporates  chan¬ 
nel  spill  interference  from  all  other  users  regardless  of  the 
magnitude  of  the  interference  at  the  source.  Theoretical  ana¬ 
lysis  and  simulation  results  have  demonstrated  that  even  in  the 
absence  of  cochannel  and  immediately  adjacent  channel  inter- 
ferers,  interference  from  other  users  can  be  significant  [6]. 

The  model  can  be  used  to  analyse  cell  coverage  performance 
via  a  parameter  called  the  ‘interference  to  noise  ratio’  or  INR, 
given  the  symbol  rj  [3].  The  INR  is  the  total  interference 
power  at  the  input  to  a  receiver  divided  by  the  receiver  noise 
power  and  describes  the  extent  of  noise  or  interference  domi¬ 
nance  of  the  wanted  link.  Using  this  parameter,  the  maximum 
possible  range  r  of  a  mobile  terminal  from  its  fixed  station  is 
given  by  a  simple  expression  of  the  form  [4]: 


where  rjp  is  the  uplink  INR  (i.e.  the  INR  at  the  fixed  station 
receiver).  The  parameter  ^  is  a  system  constant  for  a 
particular  mobile  technology  and  is  a  function  of  frequency, 
transmitter  power,  cochannel  protection  ratio  and  the  receiver 
noise  floor,  y  is  the  path  loss  exponent  in  a  simple  distance- 
dependent  path  loss  propagation  model. 

In  a  noise  limited  system,  77/7  <^0.1  and  \l{YjF-^V)  ^  1,  thus  the 
maximum  terminal  range  r  is  relatively  stable.  As  a  system 
becomes  interference  limited,  77/r  >  10, 1/(7777+!)  —  I/7777,  and 
the  maximum  terminal  range  r  becomes  sensitive  to  rjf  and  the 
arrangement  of  the  interferers.  The  INR  provides  a  means  of 
computing  cell  radius  distributions  and  thus  the  proportion  of 
mobile  terminals  which  meet  some  cell  coverage  criterion. 

Previous  papers  [5]-[6]  have  examined  the  uplink  INR  and 
cell  radius  statistics  in  cellular  and  microcellular  networks  for 
example  scenarios,  both  through  Monte  Carlo  simulations  and 
theoretical  analysis.  The  next  step  is  to  consider  how  these 
statistics  are  affected  by  varying  the  user  load  and  fixed  station 
separation. 

If  a  microcell  network  is  to  provide  reliable,  contiguous  cover¬ 
age,  then  a  system  design  methodology  needs  to  be  developed 
so  that  the  required  proportion  of  mobile  terminals  continues 
to  meet  coverage  targets  as  the  user  density  increases. 
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3.  CELL  COVERAGE  SIMULATION 


The  microcell  simulation  program  used  in  [4]--[6]  can  be  used 
to  investigate  cell  size  distributions  as  a  function  of  user  den¬ 
sity.  The  program  performs  Monte  Carlo  simulations  in 
accordance  with  the  microcell  interference  model  described  in 
the  previous  section.  The  simulation  provides  call  blocking 
and  dropout  estimates  and  Probability  Density  Functions 
(PDFs)  and  Cumulative  Distribution  Functions  (CDFs)  for 
the  resultant  INR  and  cell  radii  statistics. 

The  simulation  was  loaded  with  the  technical  specifications, 
call  set  up  and  channel  allocation  procedures  for  CT2  (an 
existing  microcellular  system).  In  each  simulation  1 0  000  call 
attempts  were  made  within  a  regular  hexagonal  arrangement 
of  19  CT2  fixed  base  stations,  each  173.2  metres  apart,  so  that 
the  mobile  terminal  range  required  for  contiguous  coverage 
was  100  metres.  A  fixed  number  of  mobile  terminals  were 
placed  in  random  locations  but  with  a  uniform  distribution  in 
each  cell.  Transmitter  power  control  was  not  used. 

Call  attempts  in  the  simulation  proceeded  in  a  random 
sequence.  A  mobile  terminal’s  call  attempt  would  fail  if  it 
didn’t  meet  the  required  S/[N+I]  at  both  the  fixed  and  mobile 
ends  of  the  link.  An  initially  successful  mobile  terminal  could 
also  drop  out  if  the  success  of  other  mobiles  led  to  an  increase 
in  interference,  causing  its  S/[N+I]  to  fall  below  threshold. 
Channel  reassignments  and  retries  were  allowed  in  accord¬ 
ance  with  the  CT2  specifications.  All  calls  were  cleared  and 
the  process  repeated  after  every  terminal  had  completed  a  call 
attempt.  The  propagation  exponent  y  was  set  to  3  for  both  sig¬ 
nal  and  interference. 

Table  1  gives  a  summary  of  the  call  failure  statistics,  uplink 
INR  {rjF)  statistics,  and  contiguous  coverage  results  as  the 
traffic  load  was  increased  from  1  to  4  simultaneous  users  per 
CT2  cell.  The  cumulative  rjf  statistics  for  the  successful  calls 
are  plotted  on  a  lognormal  probability  scale  (which  amplifies 
the  tails  of  the  distribution)  in  figure  1.  The  cumulative  cell 
radius  statistics,  computed  from  the  rjf  CDF  using  equation 
(1),  are  plotted  on  a  log  probability  scale  in  figure  2. 


Table  1  -  Simulation  Results  for  CT2  Network 


PARAMETER 

1  user/ 
cell 

2  users/ 
cell 

3  users/ 
cell 

4  users/ 
cell 

Call  Blocking  (%) 

0.00 

0.63 

1.74 

3.08 

Call  Dropping  (%) 

0.02 

0.51 

1.54 

3.54 

Total  Call  Loss  (%) 

0.02 

1.14 

3.28 

6.62 

Log  Average  rjf 

0.050 

0.375 

0.869 

1.410 

Std  Deviation  log  t/f 

1.180 

0.802  ! 

0.622 

0.527 

Terminals  with  con¬ 
tiguous  coverage  (%) 

95.2 

88.5 

80.9 

74.1 

Figure  1  -  CDF  vs  cell  load 

Figure  1  shows  that  as  the  user  loading  per  cell  increased,  the 
INR  distribution  approached  a  lognormal  distribution,  its  log 
weighted  average  increased,  but  the  standard  deviation 
decreased  (the  dotted  lines  in  figure  1  show  the  lognormal  line 
of  best  fit  to  each  simulated  data).  These  results  are  consistent 
with  the  results  presented  in  [6]  for  interferer  ensembles. 


Figure  2  -  Cell  radius  CDF  vs  cell  load 

The  cell  radius  CDF  (figure  2),  derived  from  the  rjf  CDF  using 
equation  (1),  shows  that  as  the  cell  loading  was  increased,  the 
proportion  of  terminals  enjoying  a  cell  radius  sufficient  to  pro¬ 
vide  contiguous  coverage  (i.e.  100  m)  reduced  from  95%  at  1 
user/cell  to  an  unacceptable  74%  at  4  users/cell,  even  though 
the  total  call  loss  rate  stayed  well  under  10%  (see  table  1). 

The  dotted  lines  in  figure  2  show  the  cell  radius  CDF  based 
upon  the  lognormal  lines  of  best  fit  as  shown  in  figure  1 .  The 
shape  of  these  CDF  curves  varies  substantially  as  a  function  of 
the  mean  and  standard  deviation  of  the  lognormal  estimate. 

The  question  that  needed  to  be  answered  was  whether  the  cov¬ 
erage  with  4  simultaneous  users/cell  could  be  improved  by 
reducing  the  fixed  station  separation.  Say  a  target  is  set  that 
90%  of  terminals  should  have  contiguous  coverage  as  deter¬ 
mined  by  the  fixed  station  layout.  An  iterative  design  strategy 
could  then  involve  determining  the  90%  terminal  range, 
reducing  the  fixed  station  separation  in  accordance  with  this 
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range,  and  performing  the  simulation  again  to  establish 
whether  the  coverage  target  had  been  met. 

In  the  above  simulation,  90%  of  terminals  had  a  cell  radius  of 
84.5  metres  or  more.  The  fixed  station  separation  was  reduced 
so  that  contiguous  coverage  would  have  required  a  cell  radius 
of  84.5  metres,  and  the  simulation  was  performed  again.  Note 
that  this  also  increased  the  user  density  as  a  fixed  number  of 
users  made  call  attempts  in  each  cell. 

This  simulation  indicated  that  90%  of  terminals  had  a  cell 
radius  of  72.7  metres  or  more,  again  insufficient  for  the  cover¬ 
age  target.  The  fixed  station  separation  was  reduced  again  so 
that  contiguous  coverage  would  have  required  a  cell  radius  of 
72.7  metres,  and  the  simulation  repeated.  This  process  was 
continued  and  the  results  are  shown  in  figures  3  and  4. 


Figure  3  -  j/i?  CDF  vs  ceil  spacing 

Figure  3  shows  that  as  the  fixed  stations  were  placed  closer 
and  closer  together,  the  average  uplink  INR  increased 
approximately  in  inverse  proportion  to  the  fixed  station 
separation  but  the  standard  deviation  of  the  INR  samples 
essentially  remained  unchanged. 


Figure  4  -  Cell  radius  CDF  vs  cell  spacing 

This  had  the  effect,  as  can  be  seen  in  figure  4,  of  increasing  the 
spread  of  cell  sizes.  This  only  served  to  compete  against  the 
cell  coverage  improvement  being  attempted  by  this  process. 


Thus  contiguous  coverage  was  not  achieved  simply  by  reduc¬ 
ing  the  fixed  station  separation,  but  it  is  not  clear  from  figure  4 
whether  reducing  the  fixed  station  separation  actually 
improved  or  reduced  the  proportion  of  terminals  with  contigu¬ 
ous  coverage. 


Figure  5  -  Proportion  of  mobile  terminals  with 
contiguous  coverage  vs  cell  spacing 

Figure  5  shows  the  proportion  of  mobile  terminals  which 
achieved  contiguous  coverage  (as  defined  by  the  fixed  station 
layout)  versus  the  fixed  station  separation.  With  4  simulta¬ 
neous  users/ceil,  the  proportion  of  terminals  with  contiguous 
coverage  increased  slightly  as  the  fixed  station  separation  was 
reduced,  but  the  proportion  of  terminals  with  contiguous  cov¬ 
erage  flattened  out  at  approximately  80%  at  very  small  fixed 
station  separations. 

However,  the  total  call  loss  rate  improved  slightly,  from 
around  6.6%  to  5.6%,  as  the  fixed  station  separation  was 
reduced.  The  rates  for  both  call  blocking  and  call  dropping 
reduced  as  part  of  this  reduction  in  total  call  loss. 

Hence  even  with  a  low  call  loss  rate  and  a  very  small  fixed  sta¬ 
tion  separation  contiguous  coverage  was  not  achieved.  It 
would  appear  that  the  coverage  quality  limit  is  a  function  of 
the  user  load,  the  terminal  distribution,  and  the  cumulative 
off-adjacent  channel  spill  from  interfering  users. 

Cochannel  and  immediately  adjacent  channel  interferers  can 
largely  be  discounted  from  causing  this  limit  because  such 
interferers  cause  horizontal  jumps  in  the  INR  and  cell  radius 
CDF  traces.  Examining  figure  4,  such  jumps  only  occur  in  the 
last  0.5%  of  samples,  thus  coverage  for  the  last  10%  of  termi¬ 
nals  would  mostly  be  determined  by  the  effect  of  off-adjacent 
channel  interference. 

Overcoming  a  coverage  limit  may  be  difficult.  Solutions 
could  include  a  hard  limit  on  the  system  capacity  (i.e.  control¬ 
ling  the  number  of  fixed  network  trunks  available  per  cell),  or 
imposing  stricter  limits  on  system  channel  spill  specifications. 
A  better  solution  may  be  to  develop  new  terminal  admission 
controls  that  monitor  and  tailor  the  INR  (and  thus  cell  radius) 
statistics  in  order  to  maintain  some  coverage  target. 
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4.  CONCLUSION 

Cell  sizes  experienced  by  individual  mobile  terminals  in  a  cel¬ 
lular  system  are  a  function  of  the  uplink  “interference  to  noise 
ratio”  or  INR.  The  extent  of  contiguous  coverage,  or  the  pro¬ 
portion  of  terminals  meeting  a  coverage  target  in  a  cellular  or 
microcellular  system,  can  be  deduced  from  the  INR  statistics. 

Simulations  have  shown  that  the  proportion  of  mobile  termi¬ 
nals  meeting  a  contiguous  coverage  requirement  in  a  micro¬ 
cell  environment  decreases  as  the  per-cell  load  increases. 
Further,  when  the  proportion  of  terminals  with  contiguous 
coverage  drops  below  the  target  level,  reducing  the  fixed  sta¬ 
tion  separation  improves  the  proportion  only  slightly. 

Hence  contiguous  cell  coverage  may  not  be  possible  for  an 
acceptable  proportion  of  mobile  terminals  in  a  microcell  sys¬ 
tem  even  if  the  call  loss  rate  is  low.  This  coverage  limit 
appears  to  be  a  function  of  the  accumulated  interference  from 
off-adjacent  channel  spills  of  other  users. 

These  results  suggest  that  a  system-wide  microcellular  design 
methodology  will  need  to  address  the  INR  statistics  in  a  sys¬ 
tem,  and  possibly  tailor  them  through  terminal  admission  con¬ 
trols,  if  contiguous  coverage  is  required  for  a  certain  propor¬ 
tion  of  mobile  terminals. 
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ABSTRACT 

The  problem  of  receiver  collisions  in  multichannel 
multiaccess  communication  systems  is  studied  in  this 
paper.  Analytic  solution  is  developed  for  throughput 
performance  assuming  receiver  buffer  with  capacity  of 
one  packet.  We  calculate  the  average  rejection 
probability  at  destination  of  a  packet  in  order  to 
estimate  the  effect  of  receiver  collisions.  The 
evaluations  are  carried  out  for  Multichannel  Slotted 
Aloha-type  protocols  with  Poisson  ajTivals  and  finite 
population  and  examine  the  throughput  degradation 
due  to  receiver  collisions.  Also  numerical  results  are 
showing  the  throughput  reduction  as  it  compared  with 
the  protocol  case  without  receiver  collisions. 

L  SYSTEM  MODEL  AND  ASSUMPTIONS 

A  multichannel  multiaccess  communication  system 
consisting  of  v  parallel  broadcast  channels  all  of  the 
same  capacity  is  considered.  A  finite  number.  M.  of 
stations  each  one  connected  by  means  of  separate 
interface  to  every  channel  of  the  system  is  assumed. 
The  time  axis  is  considered  to  be  divided  into  slots  of 
equal  length.  Each  station  has  access  to  all  channels, 
i.e.  it  can  transmit  and/or  receive  constant  length 
packets  that  fit  to  slot  size.  The  round  trip  propagation 
delay  is  small  enough  (i.e.  less  than  packet 
transmission  time  the  slot  duration).  The  set  of  rules 
that  the  proposed  protocol  implies  for  the  stations  in 
the  multichannel  system  are  as  follows: 

1)  Every  station  is  equipment  with  a  receiver 
buffer  and  a  transmitter  buffer  each  one  with  capacity 
of  one  packet.  If  the  transmitter  buffer  is  empty,  the 
station  is  said  to  be  free,  otherwise,  it  is  harklog^^ecL 
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Packets  are  collectively  generated  in  a  Poisson  stream. 
If  a  station  is  backlogged  and  generates  a  new  packet, 
the  packet  is  lost  and  never  returns.  We  assume  a 
uniform  distribution  of  retransmisions  from 
backlogged  stations.  We  also  assume  that  the  total 
offered  traffic  from  new  generated  and  retransmitted 
packets  obeys  to  Poisson  statistics. 

2)  We  assume  that  each  packet  has  a  source  and  a 
destination  address  information.  A  station  ready  to 
(re)transmit  selects  randomly  one  among  the  v 
(2<v<M)  channels  at  the  beginning  of  the  slot  in  order 
to  attempt  its  (re)transmission.  Each  channel  is  chosen 
with  equal  and  constant  probability  Pj  =  1/v.  If  more 
than  one  station  select  the  same  channel  during  a  time 
slot  to  (re)transmit  a  collision  will  occur.  In  case  of  a 
successful  (re)transmission  the  packet  is  uniformly 
distributed  among  M  stations.  If  the  receiver  buffer  of 
the  randomly  selected  station  as  destination  is  Tull'  the 
packet  is  rejected.  This  phenomenon  is  called  receiver 
collision. 

3)  If  a  backlogged  station  retransmits  successfully 
during  a  time  slot  and  the  retransmission  is  not  aborted 
due  to  a  receiver  collision,  it  becomes  free  at  the  next 
time  slot.  A  free  station  becomes  backlogged  in  case  of 
a  unsuccessful  transmission  or  receiver  collisions. 

4)  The  channels  are  error  free  and  there  are  no 
capture  phenomena.  Thus,  packets  may  be  corrupted 
only  because  of  their  concuirent  transmission  or 
receiver  collisions. 

2.  ANALYSIS 

The  throughput  reduction  induced  by  receiver 
collisions  is  related  with  the  possibility  of  receiver 
buffer  overflow  and  this  is  associated  with  buffer 


capacity  and  the  system  throughput  without  the  effect 
of  receiver  collisions.  The  possibility  of  receiver  buffer 
overflow  gives  rise  to  rejection  probability  at 
destination  in  steady  state  which  substantiates  the 
thi*oughput  loss  in  quantitative  fashion.  The  analysis  is 
composed  from  two  paits.  a)  Throughput  evaluation  of 
conventional  multichannel  system  protocol  and  b) 
Throughput  analysis  of  finite  receiver  buffer  size 
protocol. 

2.1  Multichannel  Slotted  Aloha  Without 
Receiver  Collisions 

The  traffic  offered  to  idh  channel  is  given: 

Gj  =  GPi  =  G/v  (1) 

For  finite  population  of  stations  we  adopt  the 
Bertsekas’s  [1]  assumptions  for  Poisson 
approximations  of  the  overall  traffic  (G).  Thus  the 
throughput  per  channel  in  steady  state  is  evaluated  as: 

Sj  =  Gj  exp(-Gj)  =  G/v  exp(-G/v)  (2) 

Thus  the  total  throughput  is: 


The  probability  S^CG)  =  of  finding  r  succes.sfully 
(re)transmitted  packets  during  a  time  .slot  obeys  to 
binomial  probability  low. 

Pr[  Sv(  G)  =  r  ]  =  [  v!  /  (v-r)!  r!  ]  (I  -  )(  v-r )  [ 

Psuc  (4) 

^suc  “  ^  successful  (re)transmi.ssion  during  a  time 

slot  in  channel  i  (1=1. ..v)  }  =  G/v  exp(-G/v)  (5) 

The  probability  Ay(r)  -  k,  of  finding  k  correctly 
received  packets  at  destination  given  that  r  packets 
have  been  transmitted  successfully  during  a  time  slot 
(.see  Appendix) 

We  define  as  conditional  throughput  SRc(k)  the 
average  number  of  different  stations  selected  as 
destination  chosen  by  a  set  of  k  successfully 
(re)transmitted  packets  from  multichannel  system 
during  a  lime  slot. 

niin(M.v) 

Si^c(><)=  Pr[Sv(G)=r]  Pr[A^(r)  =  k] 

(6) 

The  expected  value  of  S^e^k)  is  given  by 


S  =  Sj  =  G  exp(-G/v)  (3) 

i=l 

2.2.  Multichannel  Slotted  Aloha  With 
Receiver  Collisions 

Let 

Sy  =  Random  variable  representing  the  number  of 
successful  (re)transmissions  from  multichannel  system. 

Ay  =  Random  variable  representing  the  number  of 
correctly  received  packets  at  destination. 

Sv(  G  )  =  The  number  of  successful  (re)transmissions 
over  the  multichannel  system,  given  that  the  total 
traffic  is  Poisson  with  a  mean  rate  G  packets/slot  1  < 
Sv(G)<v. 

Ay(k)  =  The  number  of  correctly  received  packets  at 
destination,  conditional  that  k  succe.ssfui 
(re)transmissions  occuired,  1^  Ay(k)^  Sy(r)  for  every 
Sv(k)>0.  “  “ 


min(M,v) 

Si^c  “  k )  I  =  k  Sj^(;’(k) 

k=I 

(7) 

Average  Rejection  Probability 


We  define  P,.^,j,  the  average  rejection  probability  at 
destination  in  steady  state  as  the  ratio  of  the  average 
number  of  packet  rejection  at  de.stination  due  to 
receivers  buffer  overflow,  to  the  average  number  of 
succe.ssfully  (rc)transmitted  packets. 


P,ei  =  (S-SKc)/S 

4.  NUMERICAL  RESULTS 


(8) 


Figure  1  illustrates  the  throughput  versus  the  offered 
traffic  G  for  a  v=5,10,20(channel)systems  with  M=50 
stations.  It  can  be  observed  that  throughput  measures 
are  on  decrease  as  they  compared  with  the  protocol 
case  without  receiver  collisions  for  all  values  of  traffic 
rates.  Also  for  a  given  value  of  traffic  the  difference  S- 
SjiQ  is  increasing  function  of  v.  The  explanation  is  that 
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as  V  grows,  the  throughput  S  increases,  so  the 
possibility  of  receiver  collisions  is  large.  For  example 
let  G=4,  we  have  for  v=5  (S=  1.797,  Sy^q-1.141  and 
Prej=2.761%)  for  v=10  (S=2.681,  Sj^c=2.559  and 
Prej=4.543%)  and  for  v=20  (S=3.275,  and 

Prej=5.772%). 


Figure  1:  The  throughput  versus  the  offered  traffic  G 
characteristics  for  a  v=5,10,20(channcl)systems  with 
M=50  stations.  Analytical  results  with  and  without 
receiver  collisions  schemes. 

Figure  2  presents  the  histogram  of  the  maximum 
average  percent  rejection  probabilities  for 
v=2,5,10,20(channel)systeins  with  M=:50,100  stations. 
It  is  obvious  that  the  maximum  rejection  probability 
Pj.gj(max)  in  a  multichannel  system  corresponds  to 
maximum  achievable  throughput  IF  we  set  the 
first  derivative  of  the  equation  (3)  with  respect  to  G 
equal  to  zero,  we  find  the  optima!  G  that  maximize  the 
throughput  of  the  system.  Thus  we  take  G^^p,=v.  From 
the  Figure  2  is  evident  that  as  M  increases  the 
Prej(niax)  decreases.  Also  as  v  increases  P,.^j(max) 
increases  too.  In  case  of  M=50  we  have  for  v=5, 
P^^j(max)=2.825%  and  for  v=20,  P,.^j=12.145%.  We 
can  say  that  three  paiameters  characterize  the 
performance  behaviour  of  the  multichannel  system 
(v,M,Pj.g.j(max)}. 

Figure  3  illustrates  the  average  rejection  probabilities 
versus  Traffic  G(packets/slot)  for 
v=5,10,20(channel)sy$tems  with  M=50  stations.  For 
low  value  of  the  traffic  G,  the  average  rejection 
probability  increases  lineaiiy  with  G  (low  values  of  the 


throughput  S).  As  the  G  increases  approaching  G^pt 
and  the  throughput  approaching  Pj-^j  begins  to 
saturate  increasing  slowly  towards  P,.gj(max).  For 
higher  values  of  G  (G>G^^pt)  the  S  is  reduced  due  to 
collisions  over  multichannel  system  so  Pj-^j  decreases 
because  the  possibility  of  collision  at  destination  is 
low. 


Figure  2:  The  histogram  of  maximum  average  percent 
rejection  probabilities  for  v=2,5,10,20(channel)systems 
with  M=5(),I00  stations 

CONCLUSIONS 

The  objective  of  this  paper  is  to  examine  the  effect  of 
receiver  collisions  and  to  derive  an  accurate  evaluation 
of  the  throughput  performance  of  a  parallel 
multichannel  architecture  with  receiver  buffer  capacity 
of  one  packet.  We  have  examined  a  model  that  based 
upon  Poisson  assumptions  approximation  for  the  total 
offered  traffic  over  the  multichannel  system  which 
reduces  the  computational  complexity  [2]  for  the 
throughput  perfonnance  evaluation  and  the  rejection 
probability  at  destination,  for  vmious  number  of 
stations  and  channels.  Numerical  results  shows  that  for 
fixed  M,  as  the  number  v  of  channels  increases  the 
probability  of  packet  rejection  P,.ej  at  destination 
increases  and  this  consists  the  main  disadvantage  on 
the  performance  behaviour  of  the  multichannel  system. 
Inversely  for  fixed  v,  the  rejection  probability  reduces 
as  M  increases  improving  the  performance  approaching 
the  performance  of  that  without  receiver  collisions 
protocol. 
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Figure  3:  Average  rejection  probabilities  versus  Traffic 
G(packets/slot)  for  v=5,10,20(channel)sysiems  wi{h 
M=50  stations 

APPENDIX 

The  proposed  unifonn  model  coiresponds  to  the 
occupancy  problem  of  distribution  indistinguishable 
balls(packets)  to  cells  (  destination  stations)  supposing 
that  arrangements  should  have  equal  probabilities.  We 
assume  indistinguishable  packets  (re)transmitted  to 
indistinguishable  stations  using  Maxwell-Boltzman 
statistics  [3] 

Let  M  the  number  of  stations  in  a  v-channel  system. 
We  seek  the  probability  Pr[Av(k)=r]  of  finding  r 
successfully  received  packets  at  destination  when  k 
free  and/or  backlogged  stations  have  been  coirectly 
(re)transmitted  from  the  multichannel  system  during  a 
time  slot. 

It  is  obvious  that  k-r  packets  are  aborted  due  to 
destination  conflicts  phenomena  while  M-r  stations 
remain  without  any  reception,  then. 

Pr[Ay(k)=r]  =  {  Probability  of  distribution  k  packets  to 
r  stations  }x 

{  Probability  that  r  stations  have  received  at 
least  one  packet} 

It  is  supposed  that  each  station  can  transmit  to  and 
receive  from  every  M  station  and  according  to 


Maxwell-Boltzman  statistics  there  are  possible 
iUTangements  of  the  k  correctly  (re)transmitted  packets 
to  M  stations,  each  one  with  probability  M'^.  Also  the 
k  packets  can  be  distributed  among  the  r  (r  <  k) 
stations  in  r^^  different  ways. 

The  ratio  of  favourable  to  possible  an*angements  gives 
the  probability  Pi.^  of  distribution  k  packets  to  r  stations 

P,,,=  M!/(M-r)!r!  v^  /M^  A.l 

Where  M!/(M-r)!r!  are  the  ways  in  which  r  among  M 
stations  are  chosen  as  destination. 

Let  U(k,r)  be  the  probability  the  r  stations  have 
received  at  least  one  packet  at  end  of  the  time  slot. 
Thus  according  to  inclusion  -  exclusion  method  we 
lake: 

r 

U(k.r)=  (l/r><)  (-1/  r!^(M!  (ry)'‘ 

/=«  A.2 

The  probability  that  exactly  r  among  k  correctly 
(rc)transmitted  packets  have  successfully  received  at 
end  of  the  time  slot  is  given  by 

Pr[A^(k)=rl  =  PkrU(k,r)  = 

r 

=  M!/(M-r)!r!  (4/  r!/1(ry)!  [(ry)/M]k 

A.3 
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1.  Abstract 

Low  power  electronics  are  the  key  enabling  technology  of  a 
new  class  of  highly  functional  portable  information 
technology  components  and  consumer  products.  The  key 
challenge  is  to  reduce  power  dissipation  with  minimal  effect 
on  computational  performance.  Silicon-on-insulator  (SOI) 
technologies  offer  the  capability  to  maintain  high  device 
integration  levels,  reduce  power  supply  voltages  to  <1V,  and 
concurrently  reduce  device  capacitance,  allowing  high  speed 
operation.  Potential  applications  and  technology  insertion 
challenges  for  SOI  are  described. 

2,  Introduction 

Advances  and  innovations  in  integrated  circuit  (IC) 
manufacturing  and  design  technologies,  including  first 
generation  low  power  techniques,  are  leading  to  new 
generations  of  wireless,  portable  information  technology 
products  and  components.  Increasing  power  dissipation 
levels  by  the  most  recent  families  of  ICs  is  limiting  their 
practical  application  in  battery  powered,  portable 
microelectronic  systems.  System  size  and  weight,  along 
with  functionality  and  cost,  are  also  among  the  factors  that 
differentiate  these  portable  microelectronic  systems. 
Additionally,  in  some  cases,  the  spatial  volume  occupied  by 
the  microelectronics  and  associated  support  systems  (i.e., 
power  supplies,  thermal  mitigation,  racks,  etc.)  is  of  a 
critical  concern,  especially  in  some  non-consumer, 
noncommercial  applications.  Substantial  reductions  in  IC 
power  dissipation  will  allow  smaller,  lighter  batteries  with 
longer  useful  lifetimes,  reduced  electronic  packaging  costs, 
elimination  of  heat  sinks  and  other  thermal  mitigation 
systems,  enable  reductions  in  the  sizes  and  quantities  of 
external  power  supplies,  and  may  generally  improve  the 
reliability  by  reducing  fatigue  and  breakdown. 

Designs  for  portable  products,  or  applications  that  require 
very  low  power  dissipation  but  high  performance  are 
complicated  by  a  lack  of  sophisticated  design  tools  to  fully 
explore  trade-offs  in  design  and  manufacturing,  optimize 
power  (nominal,  average,  maximum),  while  remaining 
constrained  by  required  performance.  Present  approaches  to 


low  power  system  designs  are  mainly  ad  hoc  and  are  not 
easily  extended  to  future  generations  of  products.  In  the 
past,  some  portable  applications  have  included  custom 
power  management  hardware  and  software  to  control 
unnecessary  power  dissipation  by  system  components.  This 
application-specific  practice  has  allowed  longer  battery 
lifetimes,  but  has  usually  also  compromised  performance 
and  ergonomics.  The  greatest  general  purpose  reductions  in 
power  dissipation  by  an  IC  will  result  from  decreasing  the 
power  supply  voltage.  However,  dimensional  and 
parametric  scaling  of  conventional  CMOS  can  not  keep  pace 
with  the  necessary  voltage  reductions,  so  conventional  bulk 
CMOS  technologies  become  too  slow  at  low  voltage  and 
also  exhibit  relatively  high  current  conduction  below  the 
threshold  voltage. 

The  Advanced  Research  Projects  Agency,  working  with  the 
semiconductor  industry  recently  began  an  aggressive 
program  to  develop  enabling  technologies  for  low  power 
electronics,  including  ultra-shallow  doping  technologies, 
film  silicon-on-insulator  (SOI)  substrate  technologies,  and 
new  approaches  to  power  management,  circuit  architectures 
and  design . 

3.  Low  Power  Electronics  Application  Areas 

Both  commercial  and  noncommercial  application 
areas  are  driving  low  power  electronics  technologies.  The 
spatial  volume  of  systems  has  decreased  by  50%  each  five  to 
seven  year  span,  with  superior  or  at  least  parity  in 
functionality  and  performance.  A  number  of  commercial 
products  are  now  demanding  very  low  power  dissipation  to 
realize  useful  battery  lifetimes  and  functionalities. 

Examples  are  in  portable  computers,  digital  assistants, 
cellular  telephones  and  similar  portable  communication 
systems,  “smart”  cards,  digital  cameras,  and  medical  devices 
(e.g.,  pacemakers  and  implants),  and  hand-held  instruments. 
Portable,  full  speed,  multi-media  systems  are  on  the  horizon 
for  commercial  products,  as  are  compact,  high  resolution,  all 
electronic  video  cameras  that  may  soon  appear  for 
commercial  and  industrial  applications,  all  enabled  by  low 
power  ICs  and  memory.  As  the  information  display  unit  is 
usually  a  large  fraction  of  the  power  budget,  it  is  likely  that 
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voice  commands  and  keyed  inputs  coupled  with  smaller, 
more  dense  flat  panels  will  emerge. 

Future  industrial  applications  might  couple  low  power 
electronic  circuits  with  sensors  and  actuators  and  rf 
communication  to  provide  intermittent,  addressable,  or 
continuous  information  on  conditions  or  surroundings, 
especially  applicable  for  hazardous  environments  or  where 
fine  granularity  is  important.  The  low  power  aspect  allows 
very  small  form  factors  and  long  lifetimes.  The  location  and 
status  of  mobile  or  distributed  resources  or  inventories  could 
be  instantly  updated  through  use  of  ubiquitous  “smart” 
electronics  unobtrusively  embedded  in  appropriate  structures 
or  surroundings.  As  semiconductor  technologies  continue  to 
improve,  any  number  of  additional  embedded  applications  is 
likely  to  emerge  as  well. 

The  importance  of  low  power  enabling  IC  technologies  to 
systems  of  military  interest  is  also  clear  for  both  existing 
platforms  and  those  under  plaiming  and  development.  In  the 
portable,  hand-held  regime,  tiieie  are  applications  for  tactical 
information  assistants,  analogous  to  Personal  Digital 
Assistants,  but  for  assisting  warfighters  in  completing 
mission  requirements.  Tactical  information  assistants  might 
contain  RF  communications  to  upload  or  download 
appropriate  information,  such  as  terrain  maps,  target 
intelligence,  positions,  or  orders. 

Mixed  signal  applications  are  of  great  importance  for 
military  systems  in  such  components  as  RADARs, 

SONARs,  and  communication  systems.  Power  reduction 
here  can  improve  reliability  and  allow  greater  functional 
signal  processing  densities.  Most  of  the  power  dissipated  by 
the  analog  section  is  in  standby  power  of  the  operational 
amplifiers,  roughly  proportional  to  the  circuit  capacitance, 
including  parasitics.  There  are  ongoing  research  efforts  to 
implement  these  analog  and  RF  functions  in  low  power 
CMOS,  and  the  extension  to  SOI  is  both  logical  and  highly 
possible. 

The  general  electronic  content  of  aircraft  avionics,  weapons, 
and  detection  systems  has  been  increasing,  limited  in  some 
cases  by  the  power  available  from  the  generator.  High 
power  dissipation  of  certain  electronics  requires  elaborate  and 
expensive  cooling  techniques  fliat  add  uruiecessary  weight 
and  use  up  valuable  space.  Potentially,  low  power 
electronics  could  eliminate  the  external  thermal  mitigation 
systems  and  reduce  the  size  or  quantity  of  onboard 
generators.  Munitions  such  as  missiles  have  been  moving 
toward  smaller  sizes,  lighter  weights,  greater  accuracy, 
greater  velocity,  and  greater  range.  Battery  powered,  high 
computational  processors  and  fast  memories  are  required 
here,  meeting  a  strict  power  budget  between  the  target  seeker 
and  the  guidance  system.  Ground  based  forces  are  also 


relying  more  on  electronic  systems.  Similar  to  aircraft, 
tanks  and  other  vehicles  have  a  limited  capacity  for 
generating  power  to  run  electronics  and  keep  Arose  systems 
thermally  stable.  Soldiers  could  use  a  variety  of  information 
and  communication  technology  products  to  complete 
missions  and  aid  in  survival,  but  issues  regarding  form 
factor  and  useful  battery  lifetimes  must  be  solved.  In  the 
future,  the  low  power  camera  technology  could  enable 
complete,  intimate,  undetectable  surveillance  of  a  battlefield 
or  other  hostile  territory.  It  is  conceivable  that  a  satellite 
could  be  miniaturized  to  the  size  of  a  wafer,  easily  and 
readily  replicated,  thus  forming  the  backbone  of  an 
uninterruptible,  inexpensive,  highly  redundant,  highly 
reliable  command,  control,  communication,  and  intelligence 
channel  for  global  and  highly  localized  application. 

4.  Technology  Trends  in  Power  Dissipation 

Extrapolations  contained  within  the  Semiconductor  Industry 
Association’ s  National  Technology  Roadmap  for  IC  voltage 
and  power  are  surtunarized  in  Table  1.*  Meeting  both  the 
power  and  power  density  projections  will  require  integration 
of  advanced  technologies,  especially  in  portable  applications. 
Otherwise,  large  power  dissipation  will  result  in 
unpractically  short  lifetimes  for  practical  battery  sources, 
expensive  packaging,  and  heavy  heat  sinks  or  requirement  of 
inconvenient  cooling  techniques.  Trends  in  the  power 
dissipation  of  commercial  volume  microprocessors  are 
shown  in  Figure  1.  These  microprocessors  were  fabricated 
in  bulk  CMOS  or  BiCMOS  technologies,  and  indicate  an 
approximate  tripling  in  dissipated  power  every  generation. 

A  typical  recent  microprocessor  dissipates  in  the  10-30W 
range  and  operates  around  lOOMHz. 


TART.E  1 :  STA  Roadmap  Extrapolations! 
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5.  Device  Power  Dissipation 

CMOS  is  the  dominant  technology  choice  for  digital 
applications.  There  are  three  principal  components  to  the 
power  dissipated  in  a  CMOS  ^vice, 

Pt  “  npCjoa(lVout^dd^clock'*‘Isc^dd'*"Ileakage^<Id  (1) 

where  the  activity  factor  np  is  determined  by  power 
management  schemes  and  all  other  quantities  have  their 
usual  meanings.  The  first  term  of  (1)  is  the  dynamic 
switching  component  and  is  often  the  dominant  factor,  but 
there  is  also  a  short  circuit  current  during  the  switching 
transient,  and  a  dc  leakage  current  due  to  the  diode  dark 
current  conduction.  Careful  device  design  and 
control  of  process  parameters,  such  as  threshold  voltage,  can 
minimize  and  Iieakage-  There  are  four  basic  options 
available  to  reduce  the  switched  power  dissipation.  The 
greatest  reductions  come  from  reducing  the  power  supply 
voltage  since  power  decreases  nonlinearly  as  the  square. 

But,  reducing  device  and  interconnect  capacitance  is 
necessary  as  well. 

With  architecture  changes,  the  clock  frequency  may  be 
reduced  without  sacrificing  performance.  In  some  fixed 
throughput  applications,  such  as  data 
compression/decompression,  parallel  circuits  may  be 
implemented  to  maintain  bandwidth  or  functionality,  while 
also  enabling  reductions  in  fdocb  which  allows  reductions 
in  Dynamic  control  of  the  clock  frequency  may 
provide  dramatic  reductions  in  power  consumption.  The 
underlying  idea  is  to  adaptively  control  the  clock  cycle  to 
provide  only  as  much  throughput  or  bandwidth  as  is  required 
at  that  instant.  SOI  material  technology  has  several 


compelling  potential  advantages  to  engineered-channel  scaled 
CMOS  structures. 

6.  Fabrication  Technology  Approaches 

Conventional  approaches  to  reducing  the  power  of  an 
integrated  circuit  are  tantamount  to  reducing  the  power 
supply  voltage.  To  retain  performance,  typically,  device  and 
process  parameters  are  scaled  by  the  same  factor  (as  the 
power  supply  reduction).  There  are  tremendous  challenges 
in  scaling  as  required  critical  dimensions  are  reaching 
practical  limits,  among  them  gate  oxides  are  nearing 
tunneling  limits,  and  junction  depths  are  shallower  than 
present  technology  can  reproducibly  manufacture.  Junction 
depths  are  particularly  critical  since  leakage  and  capacitance 
is  directly  proportional.  A  new  approach  to  junction  doping 
is  under  development  based  on  a  pseudo-lifliography 
approach,  called  projection  gas  immersion  laser  doping 
(PGILD).  A  high  power  laser  essentially  images  a  device 
region  and  melts  the  silicon  wafer  in  the  presence  of  a 
dopant  gas  to  form  a  very  shallow  junction.  Preliminary 
experimental  results  have  confirmed  the  ability  to  control 
junction  depth  through  varying  the  laser  pulse  to  increase  or 
decrease  the  melt  depth.  Initial  results  indicate  that  the 
PGBLD  technology  will  be  capable  of  reproducing  junctions 
under  lOnm.  Development  of  a  production  prototype  tool 
and  imit  process  are  now  underway,  as  are  process 
integration  efforts. 

While  attractive  and  simple,  scaling  conventional  bulk 
CMOS  structures  and  parameters  to  reduce  power  supply 
voltages  does  not  clearly  provide  a  robust  long-term  solution 
of  low  power  electronics.  Although  power  certainly  does 
scale,  power  density  does  not,  so  the  thermal  limitations 
remain.  In  addition,  while  minimum  feature  sizes  have 
decreased,  chip  active  areas  have  tended  to  increase,  providing 
greater  transistor  integration  levels  and  thereby  increasing 
power  dissipation.  It  is  not  altogether  certain  that  bulk 
CMOS  technologies,  even  with  engineered  channels,  will 
work  satisfactorily  with  supply  voltages  below  about  1.5V. 

Thin  film  SOI  substrate  technologies  are  attractive  for  use 
in  applications  that  require  low  power,  since  SOI  offers 
significant  advantages  in  operating  low  voltage, 
short-channel  transistors,  from  a  speed-power  perspective. 
SOI  is  also  entirely  compatible  with  conventional  process 
technology  and  may  also  provide  an  opportunity  for  process 
simplification  as  compared  to  conventional  bulk  CMOS 
technology.  It  has  been  estimated  that  a  nominal  SOI 
process  could  use  approximately  15%  fewer  steps  than 
silicon  CMOS,  eliminate  as  many  as  fliree  masks,  and 
provide  higher  levels  of  device  integration.^  SOI  starting 
substrates  for  future  leading-edge  microelectronics 
technology  generations  will  require  the  development  of 
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high-quality,  thin  buried  oxide  (BOX)  layers  and  growth  of 
gate  oxide  films  near  the  tunneling  limit.  A  series  of 
comprehensive  experiments  is  now  underway  in  coordination 
with  ARPA,  SEMATECH,  the  semiconductor  industry,  and 
SOI  wafer  suppliers.^ 

The  key  issues  for  either  SOI  technology  in  commodity 
applications  is  wafer  supply  and  wafer  quality  of  acceptable 
material  at  affordable  prices.  A  number  of  challenges  must 
be  solved  for  both  SOI  approaches  before  wide  scale  use  in 
VLSI  applications,  such  as  integrity,  defects  and 
contamination  in  the  oxide  and  the  thin  active  region.  The 
completeness  and  integrity  of  the  bond  is  an  issue  for 
bonded,  while  the  quality  of  buried  oxide  and  the  thin  silicon 
region  are  a  concern  for  SIMOX  wafers.  Particularly  critical 
is  the  thickness  control  of  the  thin  silicon  layer,  since  that 
directly  affects  the  threshold  voltage  for  fully  depleted 
devices.  For  SOI  to  remain  a  viable  a  technology  option, 
competitively  priced  SOI  wafers  must  be  readily  available 
with  excellent  thin  silicon  thickness  (under  lOOnm) 
uniformity  (<  3nm)  to  support  fully  and  partially  depleted 
transistor  channels.  Although  the  early  indications  from 
suppliers  is  positive,  the  control  of  intra-wafer,  intra-lot  and 
lot-to-lot  material  parameters  must  be  improved  and 
demonstrated  for  both  technologies  to  gain  long-term 
acceptance. 


7,  SOI  Devices 

The  cross-section  of  an  SOI  transistor  is  shown  in  Figure  2. 
By  making  the  silicon  thickness  very  thin,  under  600A  or 
so,  the  depletion  layer  extends  entirely  through  the  active 
area  (fully  depleted)  and  drastically  improves  the  electrical 
characteristics,  as  compied  to  conventional  CMOS  or  even 
thick  film  (partially-depleted)  SOI  (see  Fig.  3).  However,  it 
is  more  difficult  to  fabricate  substrates  for  fully  depleted 
technologies  and  partially  depleted  devices  can  have  a  better 
subthreshold  slope  at  high  drain  bias.  Other  potential 
advantages  of  SOI  include  a  reduction  in  device  and 
interconnect  capacitance,  as  well  as  parasitics.  It  may  be 
quite  reasonable  to  employ  doped  buried  layers  to  provide  a 
means  to  electrically  bias  devices  to  compensate  for  process 
variations  or  possibly  be  used  to  control  device  conduction 
dynamically  to  turn  on  or  off  unnecessary  areas  of  a  chip. 
Since  there  are  no  well  contacts,  complementary  circuits 
formed  in  SOI  are  not  susceptible  to  &e  “latch-up” 
phenomena  and  device  isolation  is  nearly  total.  The  effects 
of  the  back-side  oxide-silicon  interface  must  be  studied  since 
that  layer  floats  in  voltage  and  interface  states  may  trap 
holes  leading  to  long  term  reliability  problems  or  a  number 
of  first  and  second  order  modifications  to  device  I-V 
performance.  Susceptibility  of  SOI  devices  to  electrostatic 
discharge  (ESD)  and  methods  to  provide  fast  input  protection 
need  detailed  investigations. 


Figure  2  Cross  Section  of  an  SOI  Inverter 
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Figure  3:  SOI  and  conventional  CMOS  thresholds  voltages 


8.  Conclusion 


Low  power  electronics  technologies  will  have  a 
tremendous  impact  on  the  performance  and  functionality  of 
future  systems.  SOI  is  an  attractive  technology  option  for 
low  power,  high  performance  semiconductors.  Efforts  are 
underway  to  improve  material  properties  and  availability. 
Next  steps  in  the  development  of  an  SOI  technology  base 
are  better  understanding  of  how  materials  issues  affect 
device/circuit  operation,  and  which  device  design  paradigms 
are  best. 
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Abstract  -  A  self  aligned  complementary  GaAs  (CGaAs™) 
technology  has  been  developed  for  low-power,  high-speed  digital 
and  mixed-mode  applications.  A  single  process  flow  has  been  used 
to  build  low-voltage,  low-power,  full-complementary  digital 
circuits  at  200MHz,  high-speed  SCFL  digital  circuits  at  5GHz,  RF 
MMIC  and  power  circuits  (900MHz),  and  combinations  of  these. 
Complementary  digital  circuits  have  demonstrated  speed-power 
performance  of  O.ljiW/MHz/gate  at  0.9V.  A  mixed  SCFL/ 
Complementary  signal  processor  operated  at  >  IGHz  with  a 
speed-power  performance  of  0.16ia.W/MHz/gate. 


L  INTRODUCTION 

As  the  trend  in  electronic  devices  continues  to  move  towards 
lower  voltage,  lower  power,  and  higher  integration,  there  will 
be  an  ever  increasing  demand  for  circuit  technologies  which  do 
not  sacrifice  performance  for  low  power.  Ideally,  this 
technology  would  capable  of  operating  at  0.9  to  1.5V  which 
would  require  the  use  of  only  a  single  battery  in  portable 
applications.  The  CGaAs^^  (Complementary  heterostructure 
GaAs)  process  is  ideal  for  very  low  voltage  applications,  and 
has  an  optimum  speed-power  performance  at  supply  voltages 
near  0.9  V.  Its  flexibility  allows  a  variety  of  circuit  topologies 
to  be  used,  and  designs  can  be  tailored  to  meet  speed  and  power 
goals  through  both  architecture  and  power  partitioning.  In 
addition,  mixed  mode  applications  can  be  considered  which  do 
not  incur  large  sacrifices  in  either  digital  or  RF  performance 
and  allow  new  circuit  concepts  to  be  used  which  take 
advantage  of  this  integration. 

II.  CGaAs  TECHNOLOGY  DESCRIPTION 

A.  Epitaxial  Material 

CGaAs  devices  were  fabricated  on  4-inch  epitaxial  wafers 
consisting  of  a  30A  GaAs  cap,  250A  undoped  Alo,75GaQ  25AS, 
150A  undoped  Iuq  2Gao  gAs,  and  a  2000A  GaAs  buffer  layer 
grown  on  LEG  GaAs  substrates  (Figure  1).  The  threshold 
voltages  of  the  NFET  and  PFET  devices  depend  on  the  N-  and 
P-Schottky  barriers  heights,  the  conduction  and  valance  band 
discontinuities  of  the  AlGaAs/InGaAs  junction  respectively, 
and  the  pinch-off  voltage.  N-  &  P-channel  device  threshold 
voltages  were  adjusted  simultaneously  with  a  Silicon  pulse 
doping  30A  below  the  InGaAs  channel.  Thus  the  threshold 
voltages  are  set  by  the  epitaxial  material,  not  by  ion- 
implantation.  For  small  threshold  voltage  variations,  precise 


control  of  the  layer  thicknesses,  mole  fraction,  and  silicon  delta 
doping  were  required.  Molecular  Beam  Epitaxy  (MBE) 
technology  allows  excellent  control  of  layer  dimensions  and 
compositions  and  is  ideal  for  such  structures.  Metalorganic 
Chemical  Vapor  Deposition  (MOCVD)  grown  wafers  have 
also  been  used  with  success. 

B.  Device  Fabrication 

A  schematic  cross-section  of  the  CGaAs  process  is  shown  in 
Figure  1.  The  gate  metal  was  RF  sputtered  TiWN  (4000A) 
which  was  etched  with  a  low  damage  SF^CHF^/Ro  plasma  to 
a  nominal  gate  length  of  1.0  pm.  The  wafers  were  immediately 
capped  with  PECVD  SiN  to  prevent  oxidation  of  the  GaAs/ 
AlGaAs  layers.  N-channel  (Si'*')  and  P-channel  (Be'^/F^  co¬ 
implant)  source/drain  implants  were  self-aligned  to  the  gates. 
The  Fluorine  improved  activation  of  the  Beryllium  and  also 
reduced  the  in-diffusion  of  Be  under  the  gate  and  out-diffusion. 
The  implants  were  rapid  thermal  annealed  in  a  temperature 
range  from  TOO^C  to  800®C.  The  devices  were  then  isolated  via 
oxygen  implant,  and  capped  with  PECVD  Si02.  A  single 
refractory  ohmic  metal  contacted  both  device  types  and  was 
formed  using  dielectric-assisted  lift-off.  The  refractory  ohmics 
were  stable  up  ton  temperatufe  range  of  500®C  to  600®C  which 
allowed  an  interface  with  VLSI  Aluminum  interconnect 
metallization,  where  Al-Cu  deposition  took  place  at  a 
temperature  near  500^C. 

The  balance  of  the  process  was  similar  to  standard  Silicon 
processing.  TEOS  was  deposited  and  then  vias  were  formed  to 
contact  gate  and  ohmic  regions.  The  first  interconnect  metal 
consisted  of  a  TiW/AlCu/TiW  sandwich.  A  TEOS  interlevel 
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Figure  1 :  CGaAs  Process  Cross-Section 
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dielectric  was  deposited,  a  deposition/etchback  planarization 
was  performed,  vias  were  formed,  and  an  AlCu/TiW  second 
metal  was  added.  Finally,  devices  were  passivated  with  Si02 
and  SiN[l]. 

III.  DEVICE  RESULTS 

Typical  I-V  characteristics  of  1  x  10  |xm  N-  and  P-channel 
devices  are  shown  in  Figures  2  (&  3.  Threshold  voltages  for 
these  devices  were  designed  to  be  +0.55V  for  NFETs  and  - 
0.55V  for  PFETs.  While  this  gave  about  a  4:1  ratio  of  drain 
currents,  it  produced  the  optimum  speed-power  product  in 
complementary  circuits.  The  devices  had  very  good  pinch-off 


Vds  (V) 


Figure  2:  Drain  current  characteristics  of  1x10  ^im  NFET 
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Figure  3:  Drain  current  characteristics  of  1x10  iiim  PFET 


characteristics  and  output  conductances.  Typical  device 
parameters  are  shown  in  Table  1. 


Parameter 

NFET  (1x10  pm) 

PFET  (1x10  pm) 

Vt 

+0.55  V 

-0.55  V 

^dss 

1.5  mA 

0.4  mA 

Gm 

250  mS/mm 

60mS/mm 

Beta 

250  mAA^^-mm 

55  mAA^^-mm 

Subth  slope 

75  mV/dec 

90  mV/dec 

Subth  Current 
(Vgs=0V) 

<  InA 

<  lOnA 

Table  1 :  Device  Parameters 


Due  to  the  CGaAs  devices’  low  threshold  voltage  and  high 
current  drive,  performance  is  very  good  even  at  low  voltages. 
Figure  4  shows  unloaded  ring  oscillator  delay  versus  supply 
voltage  for  several  complementary  technologies.  The 
performance  of  the  1 .0  pm  CGaAs  is  superior  to  0.5  pm  CMOS 
or  Thin-Film  Silicon-on-Insulator  (TFSOI)  and  the  0.5  pm 
CGaAs  shows  delays  below  lOOps  at  1.2V. 


Vdd  Volts 


Figure  4:  Ring  Oscillator  Delay  vs.  Voltage 


IV.  ARCHITECHTURE  VERSATILITY 

A  variety  of  circuit  design  options  are  available  using 
CGaAs  technology.  For  digital  designs,  these  include  full 
complementary  designs  (CMOS-like),  single-ended  current 
steering  logic  (DCFL-like),  and  differential  current  steering 
logic  (ECL/CML-like).  Each  of  these  styles  present  the 
designer  trade-offs  in  power,  speed,  and  gate  complexity.  The 
operating  speed  and  power  per  gate  for  each  style  is  shown  in 
Figures  5  &  6.  All  of  these  styles  may  be  used  together  on  the 
standard  CGaAs  process  and  circuits  can  operate  over  a  range 
from  DC  to  greater  than  5  Gb/s,  with  world  class  speed-power 
in  each  regime  [2]. 
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Figure  5:  CGaAs  Circuit  Configurations 


Figure  6:  CGaAs  Circuit  Performance 


This  architectural  versatility  provides  many  of  the 
advantages  of  BiCMOS,  but  with  a  much  simpler  process.  A 
typical  BiCMOS  process  flow  adds  the  steps  required  to  make 
a  Bipolar  transistor  to  an  existing  CMOS  flow.  This  results  in 
process  flows  with  25  or  more  mask  steps.  The  standard 
CGaAs  flow  uses  13  masks  through  3rd  metal  interconnect  and 
provides  all  three  types  of  circuit  topologies  if  desired. 

The  complementary  circuit  style  gives  performance 
exceeding  advanced  CMOS  processes  while  operating  at  0.9  - 
1.5  V.  Complex  logic  gates  can  be  built  as  in  CMOS  while 
layout  is  simplified.  Since  the  CGaAs  process  does  not  require 
wells,  tubs,  or  substrate  contacts,  devices  can  be  laid  out 
without  the  “normal”  restrictions  of  putting  all  of  one  device 
type  together  in  a  common  well.  In  fact,  N-  and  P-type  device 
contacts  can  even  be  butted  together  with  the  ohmic  contact 
merged.  This  results  in  much  simpler  layouts  and  more 
compact  designs.  Most  CMOS  designs  can  be  mapped  directly 
into  CGaAs  designs,  with  little  or  no  changes  required.  The 
only  modifications  typically  required  involve  the  lower  supply 
voltage  (limits  on  device  stacking,  etc.)  and  would  also  be 
needed  for  low- voltage  CMOS  design. 


The  DCFL-like  circuit  topology  uses  a  PFET  in  place  of  a 
depletion-mode  MESFET  for  the  current  source,  and  offers 
speed  similar  to  E/D  MESFET  designs.  Due  to  the  higher  gate 
diode  turn-on  voltage  of  the  CGaAs  devices,  the  high-level 
output  of  the  gates  are  not  clamped  to  0.7V  as  is  typical  in  E/D 
MESFET  designs.  This  has  several  advantages  including: 
higher  noise  margins,  higher  gate  fan-in,  the  ability  to  stack 
input  devices  to  make  more  complex  gates,  and  the  ability  to 
directly  interface  with  complementary  circuits.  Another 
advantage  is  that  the  P-load  can  be  switched  off  when  the 
circuit  is  not  operating  which  reduces  the  standby  power  to  a 
negligible  level. 

The  source-coupled  FET  logic  (SCFL)  topology  provides 
the  highest  operating  speed  at  the  expense  of  higher  power. 
SCFL  designs  use  differential  current  steering  logic  analogous 
to  Silicon  ECL  or  CML.  Since  only  the  very  fast  N-FET  is  used 
in  SCFL,  circuit  speeds  can  exceed  5  Gb/s. 

V.  CIRCUIT  RESULTS 

Several  circuits  have  been  built  using  each  of  the  design 
techniques  and  demonstrated  the  high  performance  and 
flexibility  of  CGaAs  technology. 

A.  Complementary  Designs 

Several  large  circuits  have  been  built  with  the  full 
complementary  design  style.  These  include  a  4K  SRAM,  a 
16x16  bit  multiply/accumulator,  and  a  microprocessor  core. 
These  designs  have  demonstrated  speed-power  performance  as 
low  as  0.01  pW/MHz/gate  at  0.9V  and  0.1  jjiW/MHz/gate  at 
L2V.  Shift  registers  have  operated  at  greater  than  500MHz. 

The  4K  SRAM  had  nominal  access  delays  of  5.3ns  at  1.5  V 
and  15.0ns  at  0.9  V.  SRAM  power  ranged  from  0.36  mW  at 
0.9  V  to  16,2  mW  at  1.5  V  [3].  This  power  was  significantly 
lower  than  other  GaAs  SRAM  designs  (Fig.  7)  [4-12].  The 
SRAM  was  also  tested  over  temperature  and  showed  very  little 
delay  variation  (<  1%)  from  -2QPC  to  +120®C. 

B.  Mixed  Digital  Designs 

A  signal  processor  circuit  was  designed  with  a  mix  of  high¬ 
speed  SCFL  and  low  power  complementary  blocks.  It  operated 
at  over  IGHz  over  temperature  (-35®C  to  +110^C)  with  a 
speed-power  of  0.4iiW/MHz/gate.  The  SCFL  circuits  operated 
at  -4  Volts  while  the  complementary  blocks  used  a  -1.2V 
supply.  Further  testing  showed  that  the  SCFL  supply  could  be 
reduced  to  -2.5V  while  maintaining  operation  at  IGHz  and  this 
reduced  the  speed-power  to  0.16  pW/MHz/gate  [2]. 

C.  Mixed  Mode  Designs 

Some  initial  circuit  design  and  characterization  has  begun 
using  CGaAs  devices  for  analog,  MMIC,  and  RF  power 
applications.  While  some  device  improvements  still  need  to  be 
made  for  precision  analog  circuits,  there  are  many  applications 
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Figure  7:  GaAs  SRAM  Delay  and  Power  Comparison  [4-10] 

which  can  be  addressed  including:  digital  filtering,  A/D  &  D/ 
A  converters,  ATM,  etc.  Since  n-channel  CGaAs  devices  are 
very  similar  to  typical  GaAs  HFETs,  both  MMIC  and  power 
RF  circuits  can  be  designed.  Table  2  shows  typical  RF  device 
performance  for  0.7pm  gate  length  devices  at  l.OV.  A  0.7  pm 
X  3mm  NFET  was  used  as  a  power  amplifier  in  class  AB 
operation  with  26dBm  output  power  at  IGHz  for  a  3V  supply; 
power-added  efficiency  was  greater  than  60%. 


Parameter 

NFET  (Lg=0.7|J.m) 

PFET(Lg=0.7^im) 

ft(Vds=1.0V) 

20  GHz 

5GHz 

W(Vds=1.0V) 

30GHz 

lOGHz 

MAG  (Vds=1.0V) 

20dB 

6dB 

Table  2:  RF  Device  Parameters 


VL  CONCLUSIONS 

We  have  developed  a  complementary  GaAs  process  which 
is  ideal  for  low-power,  high-speed  digital  and  mixed-mode 
applications.  The  complementary  operation  along  with  the 
robust  metal  interconnect  process  allows  large  integration 
levels  (10-30  K  transistors)  and  provides  a  path  for  VLSI  GaAs 
ICs  in  the  future.  A  variety  of  circuit  topologies  gives  the 
designer  a  wide  range  of  options  in  both  architecture  and 
power  partitioning. 
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Abstract 

A  GaAs  BiFET  LSI  technology  has  been 
successfully  developed  for  high  speed  and  mixed 
signal  circuit  applications.  High  integration  levels 
and  functional  circuit  yield  have  been  achieved. 
Excellent  HBT  and  FET  characteristics  have  been 
produced,  with  the  noise  figure  of  the  FETs 
comparable  to  those  of  traditional  MESFETs, 
enabling  them  to  perform  well  in  front-end  receiver 
applications.  Through  this  technology,  several  LSI 
circuits,  including  a  2  Gsps  2-bit  prototype 
DRFM,  a  2  GHz  32  x  2  bit  shift  register,  a  sample 
and  hold  circuit  with  9-bit  resolution  at  200  Msps 
and  SRAMs  with  a  record  access  time  (330ps) 
have  been  successfully  demonstrated. 

1.  Introduction 

The  integration  of  GaAs  HBTs  and  MESFETs 
(Bipolar^^T  or  BiFET)  can  make  a  significant 
impact  on  the  design  of  high  speed  and  mixed 
signal  integrated  circuits  and  systems.  BiFET 
technology  offers  great  design  flexibility  and  novel 
circuit  opportunities  by  combining  the  device 
advantages  of  the  HBT’s  high  switching  speed, 
high  drive  capability,  excellent  threshold  voltage 
matching,  and  low  1/f  noise;  as  well  as  the  FET’s 
low  noise,  high  input  impedance,  high  density, 
and  low  power  dissipation  on  a  single  chip.  In 
particular,  the  addition  of  MESFETs  provides 
active  loads,  low  voltage  current  sinks  for 
minimizing  power  consumption,  and  floating 
current  sources  to  provide  high  gain. 

2.  Device  Structure  and  Fabrication 

To  achieve  high  integration  level  and  high  yield,  a 
planar  BiFET  IC  process  has  been  developed.  The 
typical  material  growth  sequence  and  layer 
structure  of  the  HBT  are  summarized  in  Fig.  1. 
The  most  obvious  advantage  of  this  process  is  the 
use  of  an  already  existing  emitter  epilayer  to  define 
the  FET  active  channel.  This  eliminates  the  need 


for  material  regrowth,  provides  a  more  planar 
structure,  and  avoids  the  need  of  forming  p-type 
buried  layer  to  reduce  both  short  channel  and 
sidegating  effects. 


n-GaAs 


n-  AIQaAs 


p+GaAs 


n+GaAs 


Sub-Gollector 


Semi-insulating  GaAs  Substrate 


Fig.  1  Schematic  cross  section  of  the  monolithic 
integrated  GaAs  HBT  and  MESFET. 

The  FET  was  fabricated  from  the  emitter  cap 
epilayers.  During  normal  operation  of  the  FET,  the 
AlG^s  layer  in  the  HBT  emitter  is  fully  depleted; 
thus  the  conduction  paths  remain  primarily  in  the 
GaAs  (rather  than  lower  mobility  AlGaAs)  apart 
from  real  space  transfer  of  electrons.  The  FET 
ohmic  was  on  the  heavily  doped  emitter  cap  layer. 
The  backgate  (BG)  is  a  unique  feature  of  the  FET, 
which  was  formed  on  the  p+  GaAs  during  the 
same  fabrication  process  of  the  HBT  base  contact. 
Like  the  front  gate,  the  backgate  can  also  be  used 
to  modulate  the  FET  channel.  Gate  recessing  was 
performed  using  an  NH40H:H202:H20  wet 
chemical  etch  with  the  end  point  determined  by 
monitoring  the  channel  current.  Several  steps  in  the 
NffiSFET  fabrication  process  have  been  combined 
with  the  HBT  process,  including  ion  implantation 
for  device  isolation  and  all  metalization  for 
interconnects.  A  standard  baseline  HBT  process 
was  implemented  for  the  device  and  circuit 
fabrication.  In  this  process  a  Schottky  diode  was 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


279 


formed  on  the  collector  layer;  thin  film  NiCr 
resistors  were  also  fabricated.  The  three  level 
interconnects  were  processed  along  with  the  HBTs 
using  standard  Ti/Au  metalizations  and  polyimide 
for  intermetal  isolation.  All  of  the  process  steps  are 
identical  to  the  baseline  HBT,  with  the  exception  of 
the  gate  definition. 

3.  BiFET  Device  Performance 

The  HBTs  fabricated  in  this  technology  are 
comparable  to  those  made  by  the  HBT-only 
process.  I-V  characteristics  are  shown  in  Figure  2. 
Good  current  gain  (>  50  at  1  mA)  and  excellent 
RF  properties  (ft  and  fmax  >  50  GHz  at  Ic  =  3 
mA)  have  been  achieved  for  a  1.4x3  jim^ 
device.  This  process  has  also  produced  both 
E-mode  and  D-mode  MESFETs  with  excellent 
characteristics.  Typical  I-V  characteristics  for  an  E- 
mode  MESFET  are  shown  in  Fig.  3.  The  ft  and 
fmax  of  a  0.5  |Lim  FET  were  measured  to  be  40  and 
35  GHz,  respectively;  the  gm  of  the  FETs  ranged 
from  250  to  350  mS/mm.  Further  reduction  in  the 
gate  length  will  result  in  significant  improvement 
of  ft»  fmax* 


0  0.5/div  5 

Vce(V) 

Fig.  2  I-V  characteristics  of  (a)  GaAs  HBTs  with 
emitter  dimension  of  1.4  jim  x  3  p,m. 

4.  Circuit  Demonstrations 

Several  key  GaAs  BiFET  circuits  have  been 
demonstrated.  Those  include  the  first  LSI  circuits: 
a  single  chip  prototype  DRFM,  200  Msps 
sample/hold  circuits  and  ultra-fast  SRAMs. 


Fig.  3  I-V  characteristics  of  GaAs  MESFETs  with 

gate  length  of  0.5  |im  fabricated  in  the 

BiFET  process. 

5.  DRFM  on  a  Single  Chip 

The  fabricated  DRFM  (Fig.  4)  consists  of  a  2-bit 
ADC,  a  2-bit  DAC,  32  x  2-bit  word  shift  register 
and  supporting  circuitry.  It  contains  518  HBTs, 
1572  pfeTs  and  332  Schottky  diodes.  The  DRFM 
system  diagram  is  shown  in  Fig.  5.  The  designed 
system  can  be  easily  expanded  in  both  length  and 
width.  Basic  functionality  consists  of  digitizing  an 
input  signal  through  a  2-bit  analog  to  digital 
converter  (ADC),  selectively  delaying  (or  storing) 
the  pattern  by  using  a  32  x  2-bit  word  shift 
register,  and  then  reconstituting  it  with  a  2-bit 
DAC.  The  output  waveforms  of  the  DRFM  are 
shown  in  Fig.  6.  The  DRFM  was  also  found  to  be 
operational  up  to  2  GHz. 


Fig.  4  Microphotograph  of  a  fabricated  BiFET 
DRFM. 
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BECIRCUUTING 
SHIFT  REGISTER 


Fig.  5  DRFM  system  diagram,  which  consists  of 
a  two-bit  ADC,  a  two-bit  DAC,  a  32-bit 
by  2-bit  word  shift  register,  and  support 
circuitry. 


6.  Sample  and  Hold  Circuit 

A  BiFET  sample  and  hold  circuit  (Fig.  7)  has  been 
fabricated  and  evaluated.  SINAD  (Signal  to  Noise 
And  Distortion)  was  measured  to  be  greater  than 
55  dB  (>  9  bit),  but  was  limited  by  the  test  setup. 
The  device  accepts  differential  analog  input  and 
produces  differential  output.  Clocks  at  up  to  200 
MHz  are  differential,  but  were  driven  single-ended 
for  this  test.  Twelve-bit  performance  (74  dB)  was 
expected  from  the  simulation;  and  nearly  9-bit 
performance  was  observed  at  200  Msps  with  an 
input  frequency  of  198  MHz,  despite  our  test  setup 
limitations.  The  very  low  droop  rates  (10  Msps) 
offered  by  the  BiFET  technology  were  also 
verified.  The  photos  (Fig.  8)  show  the  a  signal 
with  sampled  output  at  200  Msps  and  the 
spectrum  of  the  beat  frequency. 
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Input  frequency  shifted 
by  2-bit  DRFM 
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Fig.  6  (a)  DRFM  output  waveform,  (b)  Input  signal  frequency  down  shifted  by  2-bit  DRFM^ _ 


Fig.  7  BiFET  sample  and  hold  circuit  schematic.  g  BiFET  samplediold  citcuit  output 
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7.  Ultrafast  BiFET  SRAM 


A  small  size  (64  bits)  SRAM  was  fabricated  and 
tested.  It  is  a  conventional  design  in  which  six 
address  lines,  a  WR  strobe.  Din  and  Qout  control 
all  function.  The  bit  cell  consists  of  four  IVESFETs 
and  two  HBTs  (Fig.  9).  The  two  HBTs  and  two 
MESFETs  form  a  cross-coupled  latch,  while  two 
MESFETs  are  pass  elements  to  complementary  bit 
lines.  The  pass  elements  have  gates  connected  to 
word  lines.  All  peripheral  circuitry  is  implemented 
in  HBTs,  with  Schottky  diodes  used  for  row 
decoding. 


Vdd 


SCW>35lC.0e0SW 


An  access  time  test  was  performed.  The  access 
time  test  consists  of  selecting  an  arbitrary  address 
line  (A5)  and  toggling  it  a  1  GHz.  This  is  a  column 
select  and  ought  to  represent  a  worst-case  read 
access  time.  An  HP54120  oscilloscope  was  used 
to  capture  the  output  and  measure  the  delay. 
Access  time  of  around  330  ps  to  360  ps  was 
observed  (Fig.  10).  To  our  knowledge,  this  is  the 
fastest  SRAM  ever  demonstrated  by  any 
semiconductor  technology. 


Bit  Bit  BH  Bit 


Fig.  9  Schematic  and  microphotograph  of  BiFET  memory  cell. 

Read  access  Read  access 
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Time  (200  psA>Msldn} 

Fig.  10  BiFET  SRAM  with  read  access  time  of  330  ps. 


8.  Summary 

In  summary,  we  have  developed  a  manufacturable, 
high  speed  BiFET  technology  for  both  analog  and 
digital  circuit  applications.  The  technology  is  fully 
compatible  with  the  standard  HBT  material  growA 
and  fabrication  processes  (only  one  additional 
mask  needed  to  fabricate  MESFETs  and  integrate 
them  with  HBTs).  We  have  demonstrated  various 
BiFET  LSI  circuits  including:  a  2  Gsps  2-bit 
prototype  DRFM,  sample  and  hold  circuits  with 
9-bit  resolution  at  200  Msps  and  SRAMs  with 


very  fast  access  time  (330ps).  The  developed 
BireT  technology  is  anticipated  to  impact  both 
military  and  coimnercial  circuits  and  systems  in 
areas  of  functionality,  speed,  power,  noise  and 
system  architecture. 
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ABSTRACT 

Analog  fiber-optic  links  have  a  number  of  design  challenges 
that  are  not  found  in  their  more  popular  digital  link  counter¬ 
parts,  This  talk  begins  with  a  brief  introduction  to  the  field, 
then  concentrates  on  the  principals  and  experimental  state  of 
the  art  for  both  directly  and  externally  modulated  links.  The 
utility  of  these  links  is  then  demonstrated  by  applications  in 
cable  television  distribution,  cellular  telephone  and  phased 

array  antennas.* 


1,  INTRODUCTION 

Transmission  of  signals  over  optical  fiber  has  become  com¬ 
mon  place.  However  probably  99%  of  all  fiber  optic  links 
are  digital  links,  i.e.  links  wherein  the  original  data  stream 
modulates  the  light  without  first  being  put  on  an  RF  carrier. 
There  is  a  growing  list  of  so  called  analog  applications  where 
the  information  to  be  conveyed  over  fiber  is  riding  on  an  RF 
carrier,  e.g.  cable  TV  signals.  The  rational  for  using  analog 
fiber  optic  links  is  basically  economic:  by  avoiding  the  need 
to  convert  the  desired  signal  to  or  from  its  original  format,  it 
is  possible  to  reduce  the  cost  of  the  system.  For  example, 
high  quality  analog  links  can  transmit  an  80  channel  CATV 
feed  in  the  standard  NTSC  format  thereby  avoiding  the  need 
to  digitize  this  signal  at  the  head  end  and  then  reconvert  the 
signal  at  each  users  TV. 

Although  there  is  clearly  overlap  in  the  design  issues 
between  digital  and  analog  links,  there  are  a  number  of 
issues  that  are  unique  to  the  analog  link.  Common  to  both 
types  of  links  is  the  optical  fiber  and  the  wavelengths  of 
operation.  All  optical  links  use  silica  glass  fibers  in  which 
the  attenuation  at  the  wavelengths  of  interest  has  been 
reduced  to  about  1/10  that  of  clear  air.  Thus  the  fibers  pro¬ 
vide  a  transmission  medium  where  the  loss  is  independent  of 
the  signal  frequency  and  only  a  weak  function  of  length,  AH 
optical  links  use  wavelengths  in  the  near  infrared.  Since  the 
first  diode  lasers  operated  at  0.85  pm,  the  first  generation 
links  used  this  wavelength.  Second  generation  links  operate 
at  longer  wavelengths,  1.3  and  1.55  pm,  where  the  optical 
losses  are  lower  by  about  a  factor  of  4  and  10  respectively. 
Further  at  these  wavelengths  the  dispersion  is,  or  can  be 
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made  zero,  thereby  increasing  the  length-bandwidth  product 
of  the  fiber. 

All  analog  links  in  use  today  convey  the  signal  via  inten¬ 
sity  modulation  of  the  optical  carrier.  The  two  general  ways 
of  imposing  the  intensity  modulation  are  shown  in  Figure  1. 
In  direct  modulation  the  signal  modulates  the  laser’s  optical 
power  directly.  Since  laser  diodes  are  the  only  laser  with 
reasonable  bandwidth  for  this  type  of  modulation,  they  are 
the  only  type  of  laser  used  for  direct  modulation.  For  exter¬ 
nal  modulation,  the  laser  is  operated  CW  and  the  desired 
intensity  modulation  is  imposed  using  a  separate  device. 
Although  there  are  a  number  of  types  of  external  modulators, 
an  integrated-optic  version  of  a  Mach-Zehnder  interferome¬ 
ter  is  the  one  that  is  used  universally  today.  Since  the  modu¬ 
lator  eliminates  the  modulation  requirement  on  the  laser, 
there  is  a  broader  choice  of  lasers  for  external  modulation. 
Because  of  their  high  fiber-coupled  optical  power  and  low 
relative  intensity  noise,  RIN,  diode-pumped,  neodymium 
doped  yttrium  aluminum  garnet,  Nd:YAG  lasers  are  the  most 
commonly  used  sources  for  externally  modulated  links. 

DC 


OUTPUT 

PHOTO 

DETECTOR 


RF- 

INPUT 


DIODE 

LASER 


1- 


FIBER 


EXTERNALLY  MODULATED  FIBER  OPTIC  UNK 

_ DC 

RF 


RF_ 

INPUT 


€ 


^modulator 

FIBERSj^ 


PHOTO 
DETECTOR 

DC 


OUTPUT 


Figure  1 :  Analog  Fiber-Optic  Link  Options 


2.  DIRECTLY  MODULATED  LINKS 

As  shown  in  Figure  2,  there  are  basically  two  types  of  diode 
laser  cavities  in  use  today.  The  Fabry-Perot  laser  is  the  sim¬ 
pler  of  the  two;  the  dominant  reflections  come  from  the 
cleaved  end  faces  of  the  semiconductor  substrate.  Optical 
powers  from  microwatts  to  watts  are  commercially  avail¬ 
able.  The  cost  of  this  type  of  laser  is  low,  driven  mainly  by 
the  market  for  lasers  in  every  compact  disk  player.  However 
the  RIN  and  linearity  do  not  meet  the  needs  of  demanding 
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analog  link  applications,  such  as  CATV.  Consequently  there 
has  been  a  substantial  effort  to  develop  a  better  diode  laser. 
The  distributed  feedback,  DFB,  laser  presently  has  higher 
linearity  and  lower  RIN  than  a  Fabry-Perot  laser.  In  the  DFB 
laser,  the  facet  reflections  are  suppressed  and  the  reflections 
come  from  a  grating  that  is  distributed  along  the  length  of  the 
laser  cavity.  This  and  other  processing  changes  have 
resulted  in  a  diode  laser  that  can  meet  the  needs  of  CATV. 
However  the  more  limited  production  and  more  complex 
structure  result  in  a  laser  of  significantly  higher  cost. 
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Figure  2:  Diode  Lasers  for  Direct  Modulation 

Link  noise  figure  is  one  measure  of  the  link  improvement 
provided  by  a  DFB  laser.  In  Figure  3  the  link  noise  figure  is 
plotted  vs.  the  average  detector  current  for  links  using  Fabry- 
Perot  and  DFB  lasers.  With  careful  setup,  the  Fabry-Perot 
based  link  can  have  a  noise  figure  as  low  as  30  dB,  whereas 
the  DFB -based  link  has  a  noise  figure  of  20  dB  over  a  wider 
range  of  operating  currents.  These  data  are  valid  for  fre¬ 
quencies  up  to  about  500  MHz.  Above  this  frequency  the 
DFB  RIN  begins  to  increase,  thereby  degrading  the  noise 
figure. 


Figure  3:  Link  Noise  Figure  vs.  Detector  Current 

Increased  RIN  is  only  one  of  the  degrading  effects  as  one 
goes  to  higher  frequency  links.  Figure  4  shows  the  increase 
in  link  loss  for  some  experimental  high  frequency  links  as 
well  as  the  estimated  link  response  if  one  were  to  use  the  lat¬ 
est  diode  lasers  that  have  been  reported  in  the  literature.  As 
will  be  seen  below,  the  increase  in  link  loss  sets  a  lower 
bound  on  the  minimum  link  noise  figure.  In  principal  any 


link  loss  can  be  overcome  via  electronic  amplifiers,  however 
in  practice  the  higher  the  link  loss  the  more  difficult  it 
becomes  to  find  an  amplifier  which  has  the  gain  and  the 
dynamic  range. 


Figure  4:  Link  Gain  vs.  Frequency  -  Direct  Modulation 

Another  important  link  parameter  is  dynamic  range.  There 
are  several  measures  of  dynamic  range  including  maximum 
signal-to-noise  floor  or  SNR,  composite  triple  beat  or  CTB 
and  intermodulation-free,  IM-free,  dynamic  range.  The  limi¬ 
tation  on  dynamic  range  is  imposed  primarily  by  the  modula¬ 
tion  device,  i.e.,  either  the  diode  laser  or  the  external 
modulator. 

There  are  several  factors  that  effect  the  dynamic  range, 
among  them  are  the  laser  structure  -  Fabry-Perot  or  DFB, 
laser  design  and  the  laser  environment  such  as  optical  reflec¬ 
tions  into  laser.  Fabry  Perot  lasers  typically  have  an  IM-free 
dynamic  range  of  85-95  dB-Hz^^  whereas  DFB  lasers  gener¬ 
ally  have  100-110  dB-Hz^^"^  of  dynamic  range.  Controlling 
the  reflections  improves  the  dynamic  range  with  either  laser 
structure,  however  it  more  effective  with  DFBs.  There  are 
applications,  especially  the  distribution  of  cable  TV  signals, 
where  additional  IM-free  dynamic  range  is  required.  A  vari¬ 
ety  of  methods  for  increasing  the  dynamic  range  have  been 
designed,  however  virtually  all  of  them  are  propriety  and  so 
have  received  little  or  no  public  disclosure. 

3,  EXTERNALLY  MODULATED  LINKS 

Virtually  all  externally  modulated  links  use  a  modulator 
based  on  a  Mach-Zehnder  interferometer  whose  basic  opera¬ 
tion  is  shown  in  Figure  5.  For  modulations  frequencies 
below  about  3  GHz,  it  is  reasonable  to  assume  that  changes 
in  the  modulation  voltage  are  slow  compared  to  the  transit 
time  of  light  in  the  waveguide  arms.  At  higher  frequencies, 
the  transit  time  effects  must  be  taken  into  account.  This 
leads  to  the  traveling  wave  modulator.  The  most  efficient 
modulation  is  achieved  when  there  is  a  match  between  the 
propagation  velocities  of  the  RF  and  optical  fields.  To  date 
modulation  up  to  40  GHz  has  been  obtained  with  such 
designs. 
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Figure  5:  Mach  Zehnder  External  Modulation 

While  link  loss  is  also  an  issue  with  externally  modulated 
links,  this  type  of  link  has  an  additional  degree  of  freedom 
which  can  be  applied  to  overcoming  the  link  loss.  Link 
modeling  has  shown  that  unlike  directly  modulated  links,  the 
gain  of  externally  modulated  links  increases  as  the  square  of 
the  average  optical  power.  Thus  a  20  dB  increase  in  optical 
power,  for  example  from  100  pW  to  10  mW,  results  in  a  40 
dB  increase  in  link  gain.  Using  this  approach,  it  is  possible 
to  have  externally  modulated  links  which  are  lossless  or  even 
have  real  power  gain.  The  top  two  data  points  in  Figure  6 
show  a  bandpass  link  with  about  10  dB  of  gain  and  a  lossless 
broadband  link.  Present  modulators  have  insufficient  respon- 
sivity  at  higher  frequencies  to  permit  obtaining  such  results 
at  higher  frequencies  using  reasonable  optical  powers,  as 
evidenced  by  the  remaining  curves  in  Figure  6.  Also 
included  are  the  estimated  link  gain  using  two  recently 
reported  modulators.  Although  both  these  modulators  should 
provide  previously  unattainable  link  performance,  neither 
has  the  link  gain  necessary  for  low  noise  performance. 

Although  the  gain  attracts  attention,  the  link’s  RF  insertion 
loss  could  easily  be  overcome  using  an  RF  amplifier.  A 
more  significant  consequence  of  the  links  with  low  loss  or 
gain  is  a  lower  link  noise  figure.  For  the  link  with  10  dB 
gain,  the  noise  figure  was  6  dB.  While  this  is  high  compared 
with  the  state  of  the  art  for  electronic  amplifiers,  it  is  a  record 
low  value  for  optical  links.  In  fact  this  value  may  be  low 
enough  that  no  further  low  noise  preamplifier  is  needed. 
Consider  an  application  where  the  modulator  will  be  used  to 
remote  the  signal  from  a  dipole  antenna  operating  around 
100  MHz.  A  6  dB  noise  figure  corresponds  to  a  minimum 
detectable  field  strength  of  2.3  pV/m  when  the  noise  band¬ 
width  is  1  MHz.  Since  the  sky  noise  at  100  MHz  produces  a 
field  of  about  4  pV/m,  the  sensitivity  of  this  link  is  below  the 
sky  noise.  Therefore  the  dipole  terminals  can  be  connected 
directly  to  the  modulator  electrodes,  without  any  need  for  an 
electronic  preamplifier.  Several  system  applications,  such 
as  cable  TV  distribution  and  radar  antenna  remoting,  require 
IM-free  dynamic  ranges  beyond  that  attainable  with  present 
modulators.  Thus  there  have  been  a  number  of  methods  pro 
posed  for  increasing  the  IM-free  dynamic  range.  Recently 


Figure  6:  Link  Gain  vs.  Frequency  -  External  Modultion 
Bridges  and  Schaffner  ()  have  written  a  link  computer  simu¬ 
lation  which  compared  the  IM-free  dynamic  range  and  noise 
figure  of  externally  modulated  links  using  any  one  of  the 
various  all-optical  linearization  techniques.  The  results  of 
their  calculations  is  shown  in  Figure  7.  For  the  laser  power 
and  other  link  parameters  they  chose,  the  dynamic  range  of  a 
simple  Mach-Zehnder  modulator,  MZM,  and  a  simple  direc¬ 
tional  coupler,  SDC,  modulator  come  out  to  be  about  the 
same.  Linearization  either  via  a  combination  of  MZMs  or 
via  multiple  electrodes  on  a  SDC  improves  the  third  order 
IM-free  dynamic  range,  but  at  the  penalty  of  increased  noise 
figure.  There  is  one  case  in  their  analysis  that  does  not 
involve  a  noise  figure  penalty.  By  operating  a  SDC  modula¬ 
tor  at  the  bias  point  where  the  third  derivative  is  zero,  the 
third  order  distortion  will  be  minimized;  it  will  not  be  zero 
because  third  order  distortion  is  also  generated  by  the  fifth 
order  term  in  the  SDC  power  series  expansion.  This  bias 
point  increases  the  second  order  distortion,  which  limits  its 
application  to  systems  whose  bandwidth  is  less  than  an 
octave,  so  the  second  order  distortion  lies  outside  the  band¬ 
width  of  interest.  The  interesting  fact  is  that  the  noise  figure 
is  not  degraded  at  this  bias  point,  in  fact  it  is  actually 
improved  slightly. 


ALL  ENTRIES  IN  dB 

*  dB  BELOW  SIGNAL  AT  INPUT,  MAKING  IMO  =  NOISE 
REFERENCE:  BRIDGES  &  SCHAFFNER 

Figure  7:  Optical  Linearization  Techniques 

To  give  an  overview  of  the  state  of  the  art  in  analog  link 
performance,  Figure  8  plots  the  published  gain  and  noise  fig¬ 
ure  for  both  directly  and  externally  modulated  links.  Also 
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shown  on  this  figure  are  two  fundamental  limits  on  link  per¬ 
formance  which  have  been  derived  by  Cox  ().  From  this  plot 
it  is  clear  that  to  reduce  the  link  noise  figure,  it  is  essential  to 
improve  the  link  gain,  i.e.  to  reduce  the  link  loss.  This  will 
most  likely  require  the  development  of  modulation  devices 
with  higher  responsivity. 


Figure  8:  Summary  of  Recent  Link  Performance  Results 

4.  APPLICATIONS 

One  of  the  first  and  probably  most  widespread  applications 
of  analog  fiber  optic  links  is  to  use  fiber  to  replace  the  coax¬ 
ial  cable  for  the  distribution  of  “cable”  TV,  CATV,  signals. 
There  are  a  number  of  ways  to  do  this,  the  method  employed 
by  AEL  0  is  shown  in  Figure  9.  There  block  diagram  is  a  lit¬ 
tle  more  complex  than  normal  because  they  are  using  much 
of  the  same  hardware  to  also  distribute  cellular  telephone 
signals.  There  approach  uses  external  modulation  through¬ 
out.  Wavelength  division  multiplexing  permits  full  duplex 
operation,  which  is  required  for  the  telephone,  and  permits 
the  CW  lasers  for  both  outgoing  and  incoming  links  to  be 
located  at  the  head  end  while  using  only  two  fibers.  By 
using  a  proprietary  linearization  scheme,  AEL  is  able  to 
exceed  the  distortion  requirements  for  CATV 


RFIN  TOOTHER 


HEAD  END  '  FIBER  ‘  REMOTE  NODE 


Figure  9:  Duplex  Fiber-Optic  CATV  -  Block  Diagram 

Distribution  of  cellular  signals  from  the  telephone  inter¬ 
face  to  the  antenna  site,  even  without  CATV  signals,  is  a 


growing  application  area  for  fiber  optics.  The  hardware 
offered  by  Allen  Telecom  Group  uses  direct  modulation  to 
remote  the  1 6  telephone  channels  of  the  standard  900  MHz 
cellular  band.  A  photograph  of  the  pole  mounted  portion  of 
their  fiber-fed  equipment  is  shown  in  Figure  10. 

Phased  array  radars  are  another  potential  application  of  ana¬ 
log  fiber  optics.  ARPA  and  Rome  Labs  contracted  Hughes 
build  a  prototype  phased  array  antenna  that  used  analog  fiber 
optics  A  block  diagram  of  the  radar  is  shown  in  Figure  11. 
This  technology  eliminated  the  change  in  beam  direction,  or 
beam  squint,  as  the  radar  center  frequency  was  changed. 
Direct  modulation  feeding  various  lengths  of  fiber  was  used 
to  implement  this  time  delay.  Fiber  optics  was  also  used  to 
distribute  the  radar  wave  form  to  all  the  subarrays.  External 
modulation  was  used  for  this  function  because  the  output  of 
a  high  power  CW  laser/external  modulator  combination 
could  be  divided,  via  an  eight  way  splitter,  to  feed  each  of 
the  subarrays. 


Figure  10:  16-Channel,  900-MHz  Fiber-Fed 
Microcellular  Base  Station 


REMOTE  StTE  .  ANTENNA 


Figure  1 1 :  ARPA/Rome  Labs/  Hughes  -  Photonic 
Phased  Array  Radar 
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1.  Introduction 

There  are  many  techniques  for  optically  carrying  an  RF 
signal  from  an  antenna  element  to  its  receiver,  thus, 
optically  isolating  the  antenna.  However,  if  the  antenna 
needs  to  be  tuned  or  the  matching  network  needs  to  be 
controlled  then  it  is  desirous  to  perform  that  function 
optically  as  well.  We  show  that  an  optically-linked 
electrically-small  loop  can  be  tuned  over  almost  an  octave 
of  bandwidth  by  using  remote  optical  control  via  an 
optically  variable  reactance.  Both  theoretical  and 
experimental  results  are  presented. 


2.  Tuning  of  Photonically  Linked  Antennas 

Connecting  optical  modulators  and  fiber  optic  links  to  an 
antenna  allows  flexibility  in  the  system  design  beyond  that 
available  by  conventional  systems.  This  is  due  to  two 
inherent  properties  of  the  optical  links  and  modulators:  1) 
The  broad  bandwidth  of  optical  modulators  and  optical  links 
can  be  very  large;  and  2)  The  effective  impedance  of  the 
optical  modulator  is  reactive  with  a  very  small  (a  few  ohms) 
resistive  component. 

The  broad  bandwidth  allows  the  potential  for  connecting  a 
broadband  antenna  to  the  link  and  remoting  the  receiver 
electronics.  For  example,  in  a  measurement  one  might 
connect  a  probe  antenna  with  more  than  a  decade  of 
bandwidth  (such  as  the  cavity  backed  spiral  antenna)  to  the 
optical  link.  Using  photonic  modulators  and  links  for  this 
only  makes  use  of  the  broad  bandwidth  and  low  loss  of  the 
optics.  It  still  uses  conventional  antennas  which  must  be 
mechanically  supported  and  can  have  a  fairly  large 
interaction  with  the  antenna  or  system  under  test. 


An  electrically  short  dipole  antenna  or  electrically  small 
loop  can  be  connected  to  an  MZI  with  a  matching  /  tuning 
circuit  such  that  the  overall  circuit  Q  is  very  high  and  the 
voltage  gain  of  the  circuit  can  be  20  dB  or  more.  This  is 
possible  because  the  resistive  losses  in  an  electrically  short 
antenna  due  to  radiation  and  attenuation  are  just  a  few 
Ohms,  Hence,  the  impedance  of  the  antenna  can  be 
combined  with  the  impedance  of  the  MZI  and  a  tuning 
circuit  to  form  a  series  resonant  circuit  with  only  a  few 
Ohms  of  loss.  This  resonant  circuit  can  then  be  tuned  over 
a  large  band  by  changing  the  capacitance  of  the  tuning 
circuit.  Figure  1  presents  a  block  diagram  of  this  concept 
using  a  small  loop  antenna  connected  through  a  matching 
network  to  an  optical  modulator.  Figure  2  presents  a 
schematic  diagram  for  the  antenna  system  of  Figure  1. 
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Figure  1  -  Block  diagram  of  a  photonically  connected 
and  controlled  electrically  small  loop  antenna. 


The  highly  reactive  impedance  of  the  MZI  modulator 
allows  one  to  construct  a  field  sensor  system  that  has  high 
sensitivity  while  small  in  both  physical  size  and 
electromagnetic  cross  section.  As  mentioned  above,  the 
MZI  impedance  is  generally  capacitive  with  some  resistive 
losses  due  to  the  small  diameter  of  the  electrodes  and  their 
leads.  This  difference  in  the  load  lends  itself  to  narrow  band 
matching  with  small  antennas,  such  as  short  dipoles  or  small 
loops. 


Conventional  approaches  for  tuning  the  antenna  /  matching 
network  described  above  require  electrical  (hard  wire) 
connectivity  to  the  tuning  circuit.  This  is  not  appropriate  for 
optical-based  sensor  systems  because  it  defeats  the  passive 
nature  of  the  fiber  optical  link.  However,  with  the  use  of 
Toyon’ s  optically  variable  capacitor  (OVC'*'^,  it  is  possible 
to  control  the  reactance  of  devices  optically,  while 
maintaining  the  all-dielectric  characteristic  of  the  link.  The 
capacitance  can  be  varied  over  almost  a  decade.  Tuning 
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circuits  have  been  built  that  allow  an  antenna’s  resonance  to 
be  swept  over  nearly  an  octave  of  bandwidth  by  changing 
the  intensity  of  light  incident  on  the  tuning  circuit. 


Loop  Optical  MZI 

Antenna  Tuning  Model 


Circuit 


Figure  2.  Schematic  diagram  of  the  equivalent  circuit  for 
the  photonically  connected  antenna  of  Figure  1. 

To  test  the  complete  photonic  link  and  control  of  a  loop 
antenna  as  shown  in  Figure  1,  we  built  a  half-loop  antenna 
above  a  ground  plane  and  reactively  matched  (See  Figure 
2)  it  to  a  MZI  modulator  using  an  OVC™,  The  diameter  of 
the  loop  was  1.5  cm.  A  half  loop  antenna  was  used  because 
we  planned  to  illuminate  it  in  a  TEM  (Crawford)  cell  [1] 
and  needed  to  feed  it  against  a  ground  plane. 

Before  performing  the  measurements,  we  modeled  the 
entire  system  in  our  PHLASH™  code.  PHLASH  is  a 
PCAVindows-based  system  engineering  tool  for  predicting 
the  performance  of  sensor  and  antenna  systems  which 
include  a  fiber  optic  link  in  their  architecture.  This 
particular  link  included  an  MZI  with  a  of  5.6  volts.  The 
link  was  driven  with  only  6  mW  of  laser  power.  Hence,  the 
link  was  not  designed  for  optimal  link  gain  or  sensitivity. 
Figure  3  shows  the  predicted  link  gain  values  for  the  two 
extreme  values  of  our  OVC^  For  the  dark  case  the 
capacitance  was  19  pF  and  the  loop  was  tuned  to  248  MHz. 
For  the  case  of  maximum  light,  the  OVC™  produced  a 
capacitance  of  3  pF  and  the  loop  was  tuned  to  408  MHz. 

Figure  4  shows  the  measured  gain  for  the  tuned  half-loop 
link.  The  measured  gain  is  uncalibrated  because  of  the  TEM 
cell  measurements.  The  tuning  range  and  the  Q  values  were 
of  principal  interest  to  us.  From  Figure  4  the  tuning  range 
is  seen  to  go  from  242  MHz  to  413  MHz  or  71%  which  is 
greater  than  that  which  we  had  predicted.  The  measured  Q 
is  seen  to  be  fairly  close  to  the  predicted  value. 

The  above  test  was  not  optimized  for  link  sensitivity.  It  was 
done  to  prove  that  a  small  antenna  could  be  tuned  and 
connected  photonically.  Subsequent  to  this  test,  we  used 


PHLASH  to  predict  what  the  link  gain  and  sensitivity  for  a 
loop  antenna  with  a  4-cm  loop  diameter.  We  optimized  the 
photonic  link  parameters  to  reflect  the  state-of-the-art 
available  today.  We  used  a  MZI  with  a  of  0.75  volts  and 
a  laser  with  1 50  mW  of  output  power. 


Frequency  (MHz) 


Figure  3.  Predicted  link  gain  for  the  1 .5-cm  half-loop 
antenna  reactively  matched  to  the  MZI-based  link 
showing  the  two  extreme  states  of  the  OVC™  tuning 
capacitor. 
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Figure  4.  Measured  link  gain  (uncalibrated)  for  the  1.5- 
cm  diameter  half-loop  antenna  reactively  matched  to  the 
MZI-based  link. 


Figure  5  presents  the  predicted  link  gain  for  the  4-cm  loop 
and  optimized  link.  In  this  study,  the  same  OVC™  was  used. 
The  larger  antenna  caused  the  tuning  range  to  be  lowered  to 
180  to  300  MHz.  The  tuning  bandwidth  was  66%  which  is 
the  same  as  that  predicted  with  the  smaller  antenna. 


288 


However,  in  this  case,  the  link  power  gain  is  predicted  to  be 
about  22  dB.  Figure  6  shows  that  this  translates  into  a 
minimum  detectable  field  of  less  than  10  pV/meter  at  the 
tuned  value.  It  is  interesting  to  note,  for  the  light-off  case, 
that  the  minimum  sensitivity  stays  lower  than  100  pV/meter 
from  below  100  MHz  to  above  350  MHz. 


Figure  5.  Predicted  link  gain  for  a  4-cm  diameter  loop 
antenna  reactively  matched  to  an  optimized  MZI-based 
photonic  link.  The  gain  is  shown  for  the  two  extreme 
settings  of  the  OVC™  capacitor. 

Figure  7  presents  the  predicted  dynamic  range  in  a  10  kHz 
bandwidth  for  this  system.  The  spurious-free  dynamic  range 
(SFDR)  is  nearly  90  dB  everywhere  except  at  the  point  of 
maximum  link  gain.  The  dynamic  range,  as  it  is  calculated 
for  the  photonic  link,  is 

SFDR  =  ^{IP3  - 

=  j(/P3  +  174  dBm  -  NF  -  101og(5)  -  G^)  , 

where  IP3  is  the  third  order  intercept  point  for  the  MZI, 
Po.mds  is  the  minimum  detectable  signal  at  the  output  of  the 
link,  NF  is  the  link  noise  figure,  Ga  is  the  link  gain,  and  kT 
=  -174  dB. 

The  minimum  detectable  signal  at  the  output  of  the  link  is 
defined  as 


Figure  6.  Predicted  minimum  detectable  electric  field 
for  the  4-cm  diameter  loop  antenna  reactively  matched  / 
tuned  to  an  optimized  MZI-based  photonic  link. 

From  the  expression  for  SFDR,  it  is  clear  that  as  the  link 
gain  goes  up,  the  dynamic  range  goes  down.  Hence,  design 
tradeoffs  can  be  performed  to  maximize  dynamic  range  at 
the  expense  of  link  gain. 


Frequency  -  MHz 


Figure  7,  Predicted  dynamic  range  for  the  4-cm 
diameter  loop  antenna  reactively  matched/tuned  to  the 
optimized  MZI-based  photonic  link. 


P^^j^idBm)  =  -\lAdBm  +  10  log5  + 
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3.  Summary 


In  this  paper  we  presented  experimental  data  demonstrating 
that  an  photonically-linked  electrically-small  loop  antenna 
can  be  tuned  over  a  71%  bandwidth  using  an  optically 
variable  capacitor.  We  also  showed  theoretically  that  this 
technique  can  potentially  give  a  very  sensitive  small  receive 
antenna  that  is  completely  isolated  optically.  This  bodes 
optimistically  for  building  antenna  near-field  or  chamber 
probes  that  do  not  greatly  perturb  the  fields  they  are 
measuring. 
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ABSTRACT 

In  this  paper,  a  simple  model  is  presented  to  ana¬ 
lyze  the  performance  of  optical  fiber  Time-Division 
Multiple  Access  (OF-TDMA)  systems.  The  sensi¬ 
tivity  analysis  of  OF-TDMA  receivers  is  carried  out 
by  taking  account  of  the  effects  of  optical  transmit¬ 
ter’s  extinction  ratio,  photodetector  and  electrical 
amplifier  noise.  It  is  shown  that  the  non-zero  ex¬ 
tinction  ratio  is  a  major  impairment  which  signif¬ 
icantly  degrades  the  receiver  sensitivity  when  the 
number  of  users  N  is  large.  Compared  with  the 
single-user  system,  an  OF-TDMA  system  requires 
far  lower  extinction  ratio  to  alleviate  the  perfor¬ 
mance  degradation  when  N  is  larger. 

1.  INTRODUCTION 

Optical  fiber  Time-Division  Multiple  Access 
(TDMA)  is  a  synchronous  scheme  which  shares  the 
huge  bandwidth  of  single-mode  optical  fibers  by  di¬ 
viding  time  into  slots  and  specifying  distinct  time 
slots  of  a  frame  for  different  users.  Optical  fiber 
TDMA  (OF-TDMA)  can  offer  a  high  throughput 
by  using  optical  signal  processing  and  does  not  suf¬ 
fer  from  the  cumulative  delay  as  encountered  when 
using  asynchronous  methods  [1].  Thus,  OF-TDMA 
networks  are  attractive  to  fixed  data-rate  or  homo¬ 
geneous  services,  continuous-type  traffic,  and  high- 
data-rate  terminals,  such  as  digital  TV  or  HDTV 
distributions  and  broadcasting,  digital  telephone, 
and  circuit- switching  based  Synchronous  Transfer 
Mode.  In  order  to  feasibly  design  OF-TDMA  net¬ 
works,  the  analysis  of  system  performance  would 
be  required.  In  this  paper,  we  develop  a  theo¬ 
retical  model  for  OF-TDMA  systems  and  present 
an  analytical  expression  to  the  receiver  sensitivity 
of  OF-TDMA  systems,  by  which  fast  performance 
evaluation  can  be  easily  carried  out. 


2.  SYSTEM  MODEL 

The  block  diagram  of  an  OF-TDMA  network  is 
illustrated  in  Figure  1.  A  common  clock  of  rate  l/T 
is  used  to  synchronize  all  the  OF-TDMA  transmit¬ 
ters.  Then  N  optical  transmitters  with  the  same 
wavelength  operate  at  a  data  rate  =  IfT,  and 
they  are  assumed  to  have  the  identical  peak  out¬ 
put  power.  Short  optical  pulses  of  width  T jN  are 
generated  by  each  electrooptic  (EO)  modulator  at 
a  transmitter  when  data  bits  “ONEs”  are  issued. 
Each  modulated  pulse  stream  is  further  delayed  to 
the  assigned  slot  per  data  frame,  and  it  is  multi¬ 
plexed  with  other  N  -1  streams  through  &  N  X  N 
optical  star  coupler  with  uniform  outputs.  On  the 
other  hand,  an  optical  clock  signal  is  distributed 
to  N  receivers  through  separate  fibers.  In  order 
to  achieve  the  correct  frame  synchronization,  the 
length  of  optical  fiber  from  each  station  (contain¬ 
ing  a  transmitter-receiver  pair)  to  the  star  coupler 
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Figure  I:  Block  diagram  of  an  OF-TDMA  net¬ 
work  using  optical  processing. 
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must  be  properly  chosen  [1].  At  each  receiver,  the 
optical  clock  stream  is  appropriately  delayed  by  a 
tunable  optical  delay  line  (TODL)  as  designed  in 
[2],  and  therefore,  is  superimposed  to  the  incoming 
OF-TDMA  signal  in  the  desired  time  slot  by  using 
a  2  X  1  optical  combiner  (see  Figure  1).  The  com¬ 
bined  optical  signals  are  then  converted  into  the 
electrical  one  at  a  wideband  photodetector  (PD) 
followed  by  the  electrical  amplifier  (E-AMP)  and 
low  pass  filter  (LPF).  Finally,  a  high-speed  thresh¬ 
old  detector  (THRES-DET)  is  used  to  regenerate 
the  data  sent  from  the  desired  transmitter. 

For  the  convenience  of  analysis,  we  assume  that 
an  optical  transmitter  consists  of  a  continuous- 
wave  (CW)  laser  and  an  ultrafast  EO  modulator. 
Then  the  extinction  ratio  r  of  optical  transmitters 
is  equal  to  that  of  EO  modulators,  while  for  the 
mode-locked  laser  scheme  [1],  we  can  stiU  use  an 
equivalent  r  which  is  normally  very  small  [3].  For 
convenience,  dispersion  and  distortion  of  fibers  are 
neglected,  and  OF-TDMA  decoders  are  treated  as 
ideal  incoherent  optical  signal  processors.  More¬ 
over,  optical  pulses  at  each  photodetector  are  as¬ 
sumed  to  be  ideal  rectangular. 

Since  all  the  transmitters  are  well  synchronized, 
the  multiplexed  signal  Y (/)  at  the  output  of  a  iV  x 
N  optical  star  coupler  can  be  written  as: 

A/^  f  -1-00  \ 

>'(')  =  El  E  (1) 

i=l  [^j=-oo  J 


where  denotes  the  i-th  user’s  optical  power  in 
the  j-th  OF-TDMA  frame.  It  takes  on  a  value  E 
corresponding  to  the  data  bit  1  sent  or  rE  cor¬ 
responding  to  the  data  bit  0  sent.  Tp  =  T/N  is 
the  slot  width.  Here  hp{t)  represents  a  rectangular 
pulse  in  a  time  slot,  i.e.. 


otherwise 


(2) 


At  receiver  1,  TDMA  decoder  1  adds  io(0  to  the 
received  TDMA  signal  because  the  desired 

pulse  and  delayed  clock  signal  occupy  the  same 
time  slot  per  frame.  The  output  of  decoder  1  is 


given  by 

pi{t)  = 

e{Yi{t)  +  Yo{t)} 

-\-GO 

7=r  — CO 

N 

+  Y^f-LKit-jT-{i-l)Tp) 

i=2 

+00 

=  Y,  bmhp{t-mTp)  (4) 

m=—oo 

where  i  is  a  constant  integer,  or  A, 

and  T  =  rC  or  C,  here  A  and  C  are  peak  optical 
power  of  TDMA  and  optical  clock  signals  at  the 
PD,  respectively,  bm  is  the  received  optical  power 
in  the  m-th  slot,  which  can  take  one  of  four  levels 
depending  on  the  OF-TDMA  pulse  sequence  pat¬ 
tern  [4]. 

Since  the  received  optical  pulse  is  ideal  rectan¬ 
gular,  the  intersymbol  interference  between  optical 
pulses  is  zero.  Then  we  can  simply  need  to  consider 
the  isolated  optical  pulse  p{t)  =  bQhp{t)  instead  of  a 
sequence  pi{t)  when  we  analyze  the  noise.  For  the 
convenience  of  analysis,  the  filter  is  assumed  to  be 
an  ideal  low  pass  filter  (LPF)  and  the  intersymbol 
interference  at  the  LPF  is  neglected. 

3.  RECEIVER  SENSITIVITY  OF 
OF-TDMA  SYSTEMS 

In  this  paper,  we  consider  the  APD-based 
optical  receivers  using  a  FET-front-end  tran¬ 
simpedance  amplifier  of  which  the  output  noise 
power  <  >  is  written  as  [4] 


Without  any  loss  of  generality,  we  treat  user  1  as 
the  desired  user.  Considering  periodical  repetition 
characteristics  of  clock  signal  and  TDMA  frames, 
we  can  write  the  optical  TDMA  signal  at  the  input 
of  decoder  1  as 

H-co  ^  N  ^ 

uw=  E 

j'  —  —  0O  =  l  1 

Similarly,  the  clock  signal  Ib(^)  at  the  input  of  de¬ 
coder  1  can  be  expressed  as 

+  00 

Yo{t)=  Y  <Php{t-j'T)  (3) 

j'=r-00 


9m  \  3  / 

where  k  is  Boltzmann’s  constant,  0  is  the  absolute 
temperature,  and  B  is  the  LPF  bandwidth,  e  is 
the  charge  of  an  electron,  Cj  is  the  equivalent  total 
inpnt  capacitance,  Rb  is  the  bias  or  load  resistance, 
and  Rj  IS  the  feedback  resistance.  F  is  a  numerical 
constant,  fc  is  the  1 //-noise  corner  frequency.  Ig 
is  the  FET  gate  leakage  current,  and  Qm  is  the  FET 
transconductance  [5]. 
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At  the  output  of  a  LPF,  the  power  of  shot  noise 
at  the  sampling  time  f  =  0  is  given  by  [4] 


4eRG^F{G)boB 

TT 

y27rrpS  sin(a;)  2eRG^F{G) 

Jo  X  TT^Tp 

•6o  [1  -  cos(27rrp5)]  (6) 


and  the  average  signal  voltage  <  Vs{t)  >  at  t  =  0 
is  written  as 


<  Vs{0)  >  = 


2RGbo  sm{y) 


TT 


L 


dy 


(7) 


where  R  =  ^  is  the  photodetector  responsivity  [5]. 
G  is  the  average  value  of  the  avalanche  gain,  and 
F{G)  =  KG  +  (2  -  1/G)(1  ”  K)  is  the  avalanche 
excess-noise  factor,  here  K  is  the  ratio  of  the  ioniza¬ 
tion  coefficients  of  electrons  and  holes  of  the  APD. 

The  noise  power  due  to  the  APD  dark  current  is 
then  written  as 

<  nj  >=  2eB  {Uu  +  G^F{G)Idm}  (8) 

where  Idu  and  Idm  are  the  dark  current  compo¬ 
nents  which  is  not  subject  to  and  undergoes  the 
avalanche  multiplication  process,  respectively  [5]. 

After  the  tediously  mathematical  treatment,  the 
minimum  peak  optical  power  required  to  ensure  a 
given  bit  error  rate  (BER)  at  OF-TDMA  receivers 
is  thus  expressed  as 


P 

-*■  m 


(Nr  —  r  +  q  +  2) 


U1FU2 

1 


+ 


(9) 


where  q  =  is  the  excess  clock-to-TDMA  signal 
ratio,  and  Q  =  6  for  BER  =  1  x  10“^. 

Ug  =  2eB  +  G'^F{G)Idm]  +  <  nl  >  (10) 

rr  / f  ^eRG^F{G)B 
U\  =  {Nr  —  r  +  g  +  2)  < - - - 

«/o  ^ 


1  -  cos{2‘KTpB) ) 

/ 

(11) 

U2 

Nr  —  r  +  q  +  2 

(12) 

Uo 

2RG{Nr  —  r  -t-  g  +  2) 

TT 

rrTpB  sin(?/)  , 

•  /  .  ^ 

(13) 

Jo  y 

and 

{Nr  +  q-\-l)U2, 

(14) 

Nr  —  r  -|-  g  4-  2 

The  OF-TDMA  receiver  sensitivity  Sr 

is  thus 

defined 

as 

5,.  =  lOlogio  (dBm) 

(15) 

4.  NUMERICAL  RESULTS 
AND  DISCUSSIONS 

At  present,  mature  optical  receivers  can  have 
a  wide  bandwidth  of  10  GHz  [6].  We  can  then 
reasonably  choose  jB  =  10  GHz  and  the  chip  rate 
IjT^p  =  10  Gbit/s.  The  other  parameters  used  for 
the  calculation  of  receiver  sensitivity  are:  Ct  =  0.5 
pf,  Rh  =  10  Mfi,  Rf  =  390  fi,  A  =  1.55  /^m,  and 
the  parameters  of  InGaAsP  APD  and  GaAs  MES- 
FET  are  taken  from  [6].  Since  1/Tp  is  fixed,  the 
data  rate  l/T  decreases  as  N  increases. 

Figures  2a  and  3  show  the  effects  of  the  non¬ 
zero  extinction  ratio  v  on  the  sensitivity  of  OF- 
TDMA  receivers.  For  a  fixed  r,  the  OF-TDMA 
receiver  sensitivity  varies  with  the  number  of  users 
JV,  which  gets  worse  as  N  increases,  because  the 
effect  of  the  accumulated  DC  optical  power  caused 
by  the  non-zero  r  is  more  severe  when  r  or/and  N 
becomes  larger.  For  example,  the  degradation  of 
optimum  receiver  sensitivity  is  17.6  dBm  when  N 
increases  to  2000  for  r  =  0.01,  whereas  the  sensi¬ 
tivity  degradation  is  only  5.2  dBm  for  r  =  0.001 
and  N  to  2000.  Thus,  for  a  given  N  the  lower 
T  is  required  to  alleviate  the  performance  degra¬ 
dation.  Since  using  the  mode-locked  laser  scheme 
as  given  in  [1]  can  achieve  a  very  low  equivalent 
r,  for  the  fixed  output  peak  power  of  transmit¬ 
ters,  this  scheme  can  offer  more  users  than  the  CW 
laser  scheme.  Figure  2b  shows  the  effect  of  optical 
clock  amplitude  on  the  sensitivity  of  OF-TDMA  re¬ 
ceivers,  which  becomes  more  significant  for  a  lower 
A,  while  the  degradation  is  smaller  when  N  is 
larger.  It  means  that  the  receiver  sensitivity  can 
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be  improved  by  separating  OF-TDMA  signal  from 
clock  pulses  for  the  signal  decision  (i.e.,  q  =  —1),  at 
the  expense  of  increasing  the  receiver  complexity. 
It  is  important  to  note  that,  as  N  increases,  the 
optimum  G  decreases  and  the  suboptimum  G  win¬ 
dow  becomes  sharp.  This  requires  the  more  critical 
control  of  G,  Moreover,  from  Figure  2a,  one  can 
find  that  there  would  be  no  optimum  G  (>  1)  if  iV 
becomes  very  large  for  r  =  0.01,  which  means  that 
there  is  no  advantage  to  use  APDs  in  this  case. 


Figure  2:  The  OF-TDMA  receiver  sensitivity 
Sr  versus  the  average  APD  gain  G  (a)  as  a 
function  of  N  and  r  for  ?  =  0  and  (b)  as  a 
function  of  q  and  N  for  r  —  0.005. 

5.  CONCLUSION 

An  explicit  expression  for  the  OF-TDMA  re¬ 
ceiver  sensitivity  has  been  presented  in  this  pa- 


Figure  3:  The  OF-TDMA  receiver  sensitivity 
Sr  versus  the  number  of  users,  iV,  as  a  function 
of  r  for  5  =  0  and  (?  =  6. 

per,  by  which  the  fast  performance  evaluation  can 
be  easily  obtained.  Further  improvement  on  the 
proposed  model  is  to  involve  the  effects  of  pulse 
spreading  and  distortion  as  well  as  timing  jitter. 
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ABSTRACT 

A  technique  for  reducing  the  error  floor  of 
differential  phase-shift-keying  (DPSK)  receivers 
impaired  by  phase  noise  is  introduced  and  ana¬ 
lyzed.  In  the  proposed  DPSK  receiver,  the  band¬ 
width  of  integrate-and-dump  (I&D)  filters  are  de¬ 
creased  so  that  multiple  samples  per  symbol  du¬ 
ration  of  the  received  signal  are  made  available  to 
the  post-detection  and  decision  circuitry.  The  pro¬ 
posed  post-detection  circuitry,  hereafter  referred 
to  as  one-bit- shifted  expanding-window  (OBS- 
EW)  post-detection  processing  scheme,  correlates 
the  acquired  samples  that  are  centered  about  the 
signal  transition  point  in  a  two-symbol  signal¬ 
ing  interval.  For  a  phase-noise-limited  operation, 
it  is  shown  that  the  proposed  OBS-EW  scheme 
achieves  an  error  floor  that  is  significantly  smaller 
than  that  of  a  conventional  DPSK  receiver. 

1.  DPSK  Signaling  Format 

DPSK  receivers  provide  phase-coherent  de¬ 
tection  of  differentially-encoded  signals  in  the  ab¬ 
sence  of  phase  synchronization  when  the  phase 
ambiguity  at  the  receiver  is  assumed  to  be  con¬ 
stant  for  at  least  two  consecutive  symbol  intervals 
[1,  2].  This,  in  turn,  reduces  receiver  complex¬ 
ity,  which  ultimately  results  in  a  reduction  in  the 
receiver  cost.  Optical  DPSK  receivers  have  re¬ 
ceived  a  great  deal  of  attention  in  recent  years 
(for  example,  see  [3,  4,  5]).  The  performance  of 
a  DPSK  receiver,  however,  is  quite  sensitive  to 
any  fluctuations  in  the  signal  phase  that  may  vi¬ 
olate  the  aforementioned  constant-phase  assump¬ 
tion.  In  practice,  phase  instability  that  is  com¬ 
monly  modelled  as  a  Brownian  motion  is  an  ever¬ 
present  feature  of  an  optical  or  RF  receiver.  This 
impairment  ultimately  imposes  an  error  floor  on 


the  performance  of  DPSK  systems,  see  [3].  To 
combat  this  effect,  a  number  of  techniques  have 
been  proposed  [4,  6, 7,  8].  In  [6],  a  post-detection 
strategy,  hereafter  referred  to  as  fixed-window 
(FW)  processing,  was  introduced,  where  the  inte¬ 
gration  interval  of  I&D  filters  of  a  DPSK  receiver 
is  decreased.  This  results  in  the  acquisition  of 
multiple  samples  per  symbol  time.  In  this  paper, 
an  alternate  scheme  is  introduced  where  the  sam¬ 
ples  that  are  centered  about  the  signal  transition 
point  in  a  two-symbol  interval  are  correlated.  In 
a  DPSK  receiver,  the  intermediate  frequency  (IF) 
version  of  a  DPSK  signal  at  the  receiver  is  given 
by 

=  A  cos  [oJiFi  +  (t>{t)  -I-  0]  -I-  n{t) 

(1) 

where  A  denotes  the  signal  amplitude  in  V, 
is  the  intermediate  frequency  in  rad/sec,  ^>(0 
signifies  the  total  phase  noise  at  the  receiver, 

03 

and  4){t)  =  -  kT)  is  the  information- 

fc=  — oo 

bearing  phase  process  with  <l>k  denoting  the  itth 
DPSK  symbol  taking  on  {O.^r}  with  equal  prob¬ 
ability,  T  representing  the  symbol  interval,  and 
P{t)  denoting  a  non-return-to-zero  (NRZ)  pulse 
with  unit  amplitude  and  a  duration  of  T  sec. 
Moreover,  e  is  an  unknown  (random),  but  con¬ 
stant  phase  that  is  uniformly  distributed  on  n] 
and  n{t)  is  an  additive  white  Gaussian  noise 
(AWGN)  with  a  two-sided  power  spectral  den¬ 
sity  (PSD)  of  ^  V'^fHz,  denoting  the  receiver 

t 

thermal  noise.  Also,  =  /  /i(r)rfr,  where 
/i(t)  is  a  white  Gaussian  noise  process  with  two- 
sided  PSD  level  27rAz/  in  {radj j Hz  [Ai/  de¬ 
notes  the  fuU-width-at-half-maximum  (FWHM) 
linewidth  Au  (in  Hz)  of  the  oscillator  and  flicker 
noise  is  assumed  to  be  negligible].  Moreover, 
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n{t)  =  •\/2nc(<)  cos  (w/ft)  —  \/5nc(i)  sin  Next, 

we  proceed  to  provide  expressions  for  A  and  ^ 
for  the  case  of  optical  heterodyne  detection.  In 
this  case,  the  above  signal  represents  the  amplified 
voltage  across  the  photodetector’s  load  resistance. 
Hence  [5,  9],  A  =  in  F  and  ^  = 

GAqr^RPio  in  V^/Hz;  /[  <  |/|  <  /h,  where  r  is 
the  resistance  in  n  of  the  photodetector’s  load  re¬ 
sistor,  Ga  is  the  power  gain  of  the  IF  amplifier,  q 
is  the  charge  of  an  electron,  R  is  the  photodetec¬ 
tor’s  responsivity  in  amps/watt,  Plo  is  the  power 
of  the  local  laser,  P,  is  the  received  optical  signal 
power,  and  /i,  and  fa  denote  the  3  dB  frequencies 
in  Hz  (about  the  IF  frequency)  of  the  IF  ampli¬ 
fier’s  response.  Based  upon  the  above  definition 
of  n{t)  and  assuming  that  the  amplifier  response 
is  symmetrical  about  the  IF  frequency,  nc(0  and 
nj(t)  may  be  modelled  as  two  independent,  identi¬ 
cally  distributed  Gaussian  random  processes  with 
two-sided  power  spectral  density  levels  and 

respectively,  which  are  given  by  ^  = 

^  =  GAqr^RPLO  in  VVHz;  o  <  |/|  < -^^  Hz. 
Next,  we  resort  to  a  complex  representation  of  the 
DPSK  receiver  operation.  To  that  end,  we  have 

xiF{t)  =  Re{YiF{t)  +  (2) 

where  Y/f  (<)  =  +  ^’1  and 

Z(t)  =  V2[ncit)  + jn,(t)].  The  upper  case  letters 
are  used  hereafter  to  represent  complex  signals. 
The  IF  signal  is  processed  by  the  inphase  and 
quadrature  phase  (I&Q)  processing,  followed  by 
the  I&D  circuitry.  This  may  be  viewed  as  a  shift 
to  baseband  followed  by  an  integration  of  the  IF 
signal.  That  is,  we  need  to  be  concerned  with 
Y (<)  =  =  A  exp  +  <f>{t)  +  0]}  and 

Z{t).  At  this  point,  we  need  to  describe  the  oper¬ 
ation  of  the  I&D  device.  First,  the  symbol  interval 
T  is  divided  into  N  equally-spaced  subintervals, 
hereafter  referred  to  as  the  chip  interval,  each  of 
duration  %  =  T/N.  The  I&D  circuit  then  provides 
an  integration  over  N  chip  intervals  of  the  signal 
at  the  output  of  the  I&Q  unit  for  later  processing, 
as  in  [6].  Hence,  for  every  symbol  interval,  N 
distinct  samples  are  offered  to  the  post-detection 


device.  Let  the  integration  interval  Tj  be  substan¬ 
tially  smaller  that  the  correlation  time  of  the  phase 
noise.  The  resulting  sample  for  the  kth  signal¬ 
ing  interval  and  the  Ith  subinterval,  is  given 
by:  where  7  =  AT,, 

denotes  the  Ith  sample  of  the  phase  noise  in  the 

IkT+IT, 

^h  signaling  interval,  and  z}  ’  =  f  Z(t)dt. 

kTA-{\-l)T. 

n.  Post-Detection  Processing  Schemes 
and  Error  Floor  Calculation 

Let  be  the  decision  variable  of  the  ith 
post-detection  processing  technique  for  the  ^h 
signaling  interval.  For  the  FW  scheme[6]. 


1=1 

For  the  proposed  OBS-EW  scheme  (see  Fig.  1), 

=  (4) 

1=1 


where  M  is  a  design  parameter  that  may  be 
used  to  optimize  performance.  Since  M  may 
be  varied  for  a  fixed  N,  the  proposed  scheme 
may  be  referred  to  as  an  expanding  window 
detection  technique.  An  optimum  receiver  for 
the  above  post-detection  processing  schemes,  in 
the  absence  of  phase  noise,  then  suggests  the 
following  decision  rule:  0;  i  g  {1,2} 


for  bipolar  NRZ  binary  signaling.  The  re¬ 
ceiver  error  floor  for  the  ith  DPSK.  receiver 

where 

Let 


is  EF^'^  =  Pr|f:cos  <o|, 


N  i=l 
M  i  =  2 


Pi  -  ^  and  L;  =  ( 

[N-l+l  t  =  2  \ 

=  [AV>i,i,  AV’2,2,"-,  AV’Jv,iv]^ 

=  [AV’i.Af,  AV'2,Ar-i,  •••,  AV’A/,Af-M+i]\  and 
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With  some  effort,  it  may  be  shown  that 


i  €  {1,2},  Li  >  1, 

(5) 

where  is  the  nth  eigenvalue  of  the  covariance 
matrix  with 


^(1) 


Jl  J2 


=/3min  (ii,i2)+ 

/3min  (iV  +  ji,A^  +  ;2)- 

^mm{ji  +  N,j2)- 

/9min(ji,j2  +  N). 


(6) 


and 


Ki2) 


31,32 


=/3mm  {ji  +  NJ2  +  N)+ 

/3mm(N  -ji  +  l,N  -j2  +  1)- 
/?niin(7V  -ji  +  1,  j2  +  N)- 
-j2  +  l,ii  +  N). 

(7) 


ni.  Significance  of  the  Proposed  Architecture 

First,  in  Fig.  2,  the  error  floors  are  depicted 
as  a  function  of  N.  In  this  case,  as  expected, 
the  error  floors  in  general  improve  (i.e.,  decrease) 
as  a  result  of  increasing  N.  Recall  that  an  in¬ 
crease  in  N  directly  increases  the  bandwidth  of 
the  I&D  filters,  which  in  the  absence  of  additive 
noise  (phase-noise-limited  operation)  does  not  in¬ 
crease  the  bit  error  rate.  A  rather  remarkable  re¬ 
sult  is  the  substantial  improvement  in  the  error 
floor  that  is  gained  by  increasing  N  for  the  OBS- 
EW  scheme.  This  confirms  our  earlier  conjecture 
as  to  the  effectiveness  of  the  OBS-EW  in  reduc¬ 
ing  the  error  floor  of  a  DPSK  receiver.  In  fact. 


as  N  is  increased,  the  FW  scheme  offers  little  dr 
no  improvement  in  error  performance,  whereas 
the  OBS-EW  scheme  offers  an  exponential-like 
decrease  in  the  error  floor  with  an  increase  in 
N.  This  result  is  of  significant  importance  to  the 
implementation  of  nearly  noiseless  DPSK  opti¬ 
cal  communications  systems  that  are  impaired  by 
phase  noise  due  to  non-ideal  laser  sources.  In 
Fig.  3,  the  error  floors  of  the  OBS-EW  and  FW 
receivers  are  depicted  as  a  function  of  linewidth- 
symbol-duration-product  (LSDP).  Once  again,  the 
OBS-EW  scheme  yields  a  performance  that  is  far 
superior  to  the  FW  scheme  for  all  values  of  iV  >  1 
and  for  a  wide  range  of  phase  noise  levels.  The 
effect  of  increasing  N  on  the  error  performance 
for  the  FW  scheme  seems  to  be  rather  subtle.  This 
is  the  main  reason  the  plots  for  different  values  of 
N  for  the  FW  scheme  in  Fig.  3  almost  overlap. 
However,  for  the  OBS-EW  scheme,  an  increase 
in  N  has  a  significant  impact  on  the  error  perfor¬ 
mance.  This  is  an  encouraging  result,  which  un¬ 
derscores  the  effectiveness  of  the  OBS-EW  tech¬ 
nique  in  reducing  the  error  floor  of  a  DPSK  re¬ 
ceiver  impaired  by  phase  noise. 


T 


TT 


TT 


6X, 


Fig.  1.  The  integration  intervals  for  the  proposed 
one-bit-shifted  expanding -window  (OBS-EW)  post-detection 
processing  schemes  for  TV  =  6  and  ikf  =  3. 
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Fig.  2.  Bit  error  floor  as  a  function  of  N  for  DPSK  receivers 
with  the  OBS-EW  (O)  and  FW  (X)  post-detection 
processing  for  LSDP  =  0.01  ( - ),  0.03(-.-.-  ),  0.05  ( — ). 


-JB.e  -».*  -i.SS  -2.0  -t.B  -I.e  -1.4  -f.2 

Lo  g  (  L-inev/i.  d  t  h  Symbol  Duraidon  Product) 

Fig.  3.  Bit  error  floor  as  a  function  of  LSDP  for  DPSK  receivers 
with  the  OBS-EW  (O)  and  FW  (X)  post-detection  processing. 
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ABSTRACT 

The  opportunity  exists  to  apply 

spread  spectrum  concepts  into  the  enormous 
bandwidth  of  optical  fibers.  We  introduce  a 
new  optical  CDMA  network  architecture 
based  on  passive  linear  unitary  filtering  of 
the  optical  carrier  signals. 

1.  INTRODUCTION 

The  opportunity  exists  to  apply 

spread  spectrum  concepts  into  the  enormous 
bandwidth  of  optical  fibers.  Recently  we 
have  seen  the  commercial  emergence  of 
spread-spectrum  radio.  In  spite  of  radio 
being  a  relatively  narrow  band  medium,  the 
spread-spectrum  concept  has  sufficient 

advantages  in  overall  capacity  and  quality 
for  it  to  compete  commercially  with 


conventional  time-division-multiplexed 
cellular  telephones. 

Surely  if  spread  spectrum  is  viable  in 
a  narrow  band  medium  such  as  radio,  it 
should  be  even  more  promising  in  a 
broadband  medium  such  as  optical  fibers. 
Included  among  the  usual  advantages  of 
coded  communications,  are  the  prospects  for 
Tera-bit  switching  and  very  high  speed 
parallel  signal  processing. 

The  form  of  optical  code  division 
multiplexing  which  we  have  in  mind  is 
different  from  previous  proposals  employing 
so-called  “optical  codes”  ,  which  are  non¬ 
negative,  and  therefore  cannot  be  truly 
orthogonal.  Instead  we  propose  an  optical 
code  division  multiplexed  system  which  is 
mathematically  and  conceptually  similar  to 


Modulating  Signal  m(t) 


S(t)+X(t) 

Transmitted 

Signal 


Fig.l  Transmitter  Block  Diagram 
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radio  code-division  multiplexing.  The 
specific  hardAvare  implementation  however 
must  match  the  opportunities  in  modem 
optical  components,  but  also  their 
limitations. 

2.  THE  TRANSMITTER 

Figure  1  shows  the  block  diagram  of 
the  transmitter  in  the  proposed  architecture. 
The  transmitter  uses  a  mode  locked  laser  to 
generate  the  unencoded  carrier  waves  at 
multiples  of  the  laser  repetition  frequency. 
The  unencoded  carrier  is  passed  through  a 
linear  filter  which  basically  does  a  unitary 
transform  to  generate  a  linear  combination 
of  the  components  of  the  unencoded  carriers. 

Each  user  has  a  different  linear  filter 
and  therefore  a  different  linear  code  which  is 
truly  orthogonal  to  the  codes  of  all  other 
users.  It  has  been  recently  shown  by  Reck 
et  al  that  every  unitary  transformation  can  be 
implemented  in  optics  using  just  mirrors, 
beam  splitters  and  phase  shifters.  The 
encoded  carrier  is  then  modulated  by  the 
message  signal  through  a  double  balanced 
modulator  to  obtain  side  bands  on  the 
modulated  codes.  All  the  user  channel  side 
bands  and  part  of  the  unencoded  carrier  are 
combined  together  using  a  star  coupler 
which  is  not  shown  in  the  block  diagram. 
The  unencoded  carrier  will  be  recovered  at 
the  receiver  end  to  avoid  the  use  of  a  local 


oscillator. 

The  linear  filters  used  in  this  system, 
apart  from  being  the  encoders  for  different 
channels,  can  also  be  used  for  some  linear 
computations  such  as  fourier  transforms  etc.. 
Because  photons  obey  the  superposition 
principle,  instead  of  carrying  out  the 
operations  sequentially  as  in  digital 
computers,  most  of  the  linear  computation 
can  be  parallized  in  a  photonic  system.  This 
has  significant  implications  for  applications 
such  as  those  involving  image 
transformation  because  part  of  the 
computational  load  can  be  moved  from  the 
electronic  system  to  the  optical  system. 

One  way  to  implement  the  double 
balanced  modulator  is  to  use  an  optical 
phase  modulator  inserted  between  two  50/50 
biconical  fused  fiber  beam  splitters  as  shown 
in  Fig.  3.  By  setting  the  phase  modulation 
constant  kg  small,  it  can  be  shown  that  the 
modulator  output  will  be  the  product  of  the 
carrier  signal  and  the  message  signal. 

3.  THE  RECEIVER 

The  receiver  block  diagram  is  shown 
in  Figure  2.  It  takes  the  multiplexed  signal 
side  bands  from  different  users  and  the 
unencoded  carrier  as  the  input.  The 
multiplexed  signal  side  bands  are  separated 
from  the  unencoded  carrier  signal  by  a 
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Fig.2  Receiver  Block  Diagram 
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m(t) 


Fig.  3  An  example  implementation  of  a  double  balanced  modulator 


Fabry-Perot  type  filter  which  has  very 
narrow  pass  bands  centered  at  the  pure 
carrier  frequencies.  By  transmitting  the 
unencoded  carrier  to  the  receiver  and 
recovering  it  by  the  carrier  separation  filter, 
we  can  avoid  the  use  of  the  local  oscillator 
for  signal  detection  which  involves 
complicated  phase  locking  mechanism  and 
polarization  tracking  in  the  optical  domain. 
The  unencoded  carrier  and  the  signal  side 
bands  suffer  the  same  phase  shift,  dispersion 
and  polarization  changes  because  they  travel 
the  same  path.  It  thus  maintains  the  CDMA 
system  complexity  to  be  compatible  with  a 
WDM  system. 

The  code  for  the  desired  channel  is 
generated  from  the  recovered  unencoded 
carrier  by  the  same  linear  filter  used  in  the 
transmitter.  The  multiplexed  signal  side 
bands  and  the  carrier  code  are  then  passed  to 
a  double  balanced  mixer  to  obtain  the 
message  signal  out.  The  double  balanced 
mixer  is  composed  of  a  50/50  coupler  and 
two  square-law  detectors  which  also 
function  as  low  pass  filters.  The  double 
balanced  modulator  output  will  give  the 
product  of  the  code  and  the  multiplexed 
signal  side  bands.  Because  of  the 
orthogonality  between  the  codes,  all  the  side 
bands  from'  other  channels  will  be  rejected 
except  the  one  which  has  the  correct  code. 


3.  CONCLUSION 

To  summarize,  we  proposed  a  novel 
optical  CDMA  architecture  that  uses  truly 
orthogonal  codes  based  on  a  passive  linear 
filtering  concept  which  in  addition  to  code 
generation  can  also  be  explored  for  other 
linear  computations  in  the  optical  domain. 
The  proposed  system  involves  no  high  speed 
coherent  optical  devices  and  uses  direct 
detection  at  its  output.  The  system 
complexity  is  compatible  to  a  WDM  system. 
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Abstract 

A  unifying  framework  for  solving  several  problems  in 
image  understanding  is  proposed  based  on  the  exploita¬ 
tion  of  subspace  fitting  techniques  originally  developed 
for  antenna  array  processing  applications.  The  SLIDE 
(Subspace-based  Line  DEtection)  algorithm  is  a  frame¬ 
work  for  introducing  the  concept  of  subspace  fitting  into 
problems  in  image  understanding  and  computer  vision. 
The  basic  formulation  of  SLIDE  provides  a  solution  to 
the  problem  of  multiple  line  fitting  in  binary  or  gray¬ 
scale  images.  However,  as  is  shown  in  this  paper,  its 
modified  versions  can  be  applied  to  several  other  image 
analysis  problems. 

1.  Introduction 

An  interesting  relationship  can  be  established  be¬ 
tween  image  analysis  and  the  notion  of  subspace  fitting 
by  thinking  of  the  image  under  consideration  in  a  wave 
propagation  context  in  which  the  digitized  image  is  re¬ 
garded  as  a  snapshot  of  a  spatial  wave  field  at  a  fixed 
instant  in  time.  This  formulation  leads  to  an  efficient 
solution  to  several  image  understanding  problems  by  ex¬ 
ploiting  ideas  and  tools  from  sensor  array  processing, 
communications  theory,  and  projection-based  transfor¬ 
mations. 

SLIDE  is  a  model-based  algorithm  based  on  the  par¬ 
titioning  of  the  induced  measurement  space  into  a  signal 
subspace  that  is  defined  by  the  few  desired  parameters, 
and  a  noise  subspace  that  includes  all  the  undesired  con¬ 
tributions.  The  signal  subspace  is  determined  by  ex¬ 
ploiting  the  spatial  (and/or  temporal)  coherency  that 
exists  between  contributions  of  the  desired  components, 
e.g.  straight  lines,  in  the  image. 

After  establishing  the  framework  for  multiple  straight 
line  fitting  applications,  it  is  possible  to  extend  the  co¬ 
herency  accumulating  notions  of  SLIDE  to  other  vi¬ 
sion  applications  by  exploiting  such  tools  of  communi¬ 
cations  theory  as  chirping/ dechirping  and  modulation. 
Circle  and  ellipse  fitting,  uniform  motion  estimation, 
linewidth  measurement  and  alignment  in  microlithog¬ 
raphy  for  manufacturing  of  integrated  circuits,  skew  de¬ 


tection  in  text  analysis,  and  estimation  of  the  location 
of  the  axis  of  symmetry  are  examples  of  such  extensions. 
In  this  paper,  the  fundamental  form  of  the  SLIDE  al¬ 
gorithm  will  be  briefly  introduced  first,  followed  by  its 
application  to  several  of  the  mentioned  application  ar^ 
eas. 


2.  The  SLIDE  Algorithm 

Starting  with  a  simplified  case  of  fitting  a  straight 
line  to  a  binary  image  with  a  set  of  colinear  pixels,  let 
us  imagine  that  there  is  an  array  of  sensors  in  front  of 
the  vertical  axis  of  the  image.  A  simple  sketch  of  such 
arrangement  is  shown  in  Fig.  1.  If  we  now  consider  the 
straight  line  to  be  the  wavefront  of  a  propagating  wave, 
the  measurements  received  at  the  sensors  will  have  the 
form: 

z(y)  = 

where  /x  is  a  constant  parameter,  and  can  be  interpreted 
as  the  speed  of  propagation. 


Figure  1:  Image  matrix  and  hypothetical  sensors. 


In  our  model,  p  is  free  to  choose,  and  its  choice  gives 
us  a  handle  to  develop  different  applications.  The  mea¬ 
surements  z{y)  have  the  form  of  a  complex  sinusoid  with 
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a  frequency  related  to  the  line  angle  6\  the  line  offset 
has  been  separated  and  encoded  in  a  constant  complex 
number.  The  varying  part  of  the  measurements  can  be 
lumped  into  a  term  ay  (6)  =  and  be  called  the 

array  response.  The  above  formulation  readily  general¬ 
izes  to  the  multiple  line  case, 

d 

z{y)  =  +n(2/)  (2) 

k  =  l 
d 

=  '^ay{0k)sk +‘n{y).  (3) 

k=l 

This  equation  is  the  starting  point  of  extensive  re¬ 
search  in  the  last  decade  on  the  so-called  subspace-based 
high  resolution  direction  finding  (and  signal  copy)  algo¬ 
rithms  (known  as  MUSIC,  ESPRIT,  WSF,  etc.,  see  e,g. 
[1]).  In  these  methods,  a  sample  covariance  matrix  is 
computed  in  a  certain  way  from  the  measurements,  and 
its  eigendecomposition  is  examined.  The  basic  concept 
of  subspace  fitting  is  that  the  d  dominant  eigenvectors 
of  this  covariance  matrix  span  the  same  subspace  that 
is  spanned  by  the  array  response  vectors  for  the  desired 
angles. 

The  next  step  would  be  to  estimate  the  line  offsets. 
It  can  be  shown  that  by  modifying  the  propagation 
scenario,  new  measurements  are  obtained  that  contain 
chirp  (quadratic  frequency)  contributions  from  the  an¬ 
gles,  and  linear  contributions  from  the  offsets.  In  other 
words,  the  offsets  will  be  encoded  as  frequencies  of  com¬ 
plex  sinusoids.  Dividing  the  chirped  measurements  z(y) 
by  the  array  response  ay  (6)  results  in  new  dechirped 
measurements  w{y) 

on  which  fast  high  resolution  spectral  estimation  meth¬ 
ods  can  be  applied  to  obtain  estimates  of  the  line  offsets. 
The  above  formulation  also  generalizes  to  gray-scale  im¬ 
ages  by  assigning  the  value  of  the  gradient  at  each  pixel 
to  the  amplitude  of  the  wave  emanating  from  it.  Details 
of  the  implementation  of  the  SLIDE  algorithm  can  be 
found  in  [2]. 

3,  2D  Motion  Estimation 

An  interesting  application  of  the  SLIDE  algorithm  is 
in  finding  the  velocities  and  positions  of  multiple  moving 
objects  in  a  frame  sequence. 

If  the  projections  of  the  frames  on  one  of  the  main 
(horizontal  or  vertical)  axes  are  stacked  up  to  produce 
a  synthetic  image  in  t  and  x,  say,  then  the  moving  pat¬ 
terns  in  the  original  sequence  will  be  represented  by 


skewed  bands  in  this  image.  Fig.  2  shows  a  simple  case, 
where  there  are  two  moving  objects  and  a  stationary  ob¬ 
ject  in  the  sequence.  Propagating  the  image  parallel  to 
the  x-axis  produces  measurements  z{t)  that  encode  the 
X  components  of  the  velocities  of  the  moving  patterns 
as  frequencies  of  complex  sinusoids: 

z{t)  =  (5) 


Figure  2:  Stack  of  the  frame  projections. 

A  similar  approach  would  result  in  the  estimation  of 
the  positions  of  the  moving  patterns.  An  advantage  of 
SLIDE  is  that  it  can  handle  multiple  moving  objects  in 
the  same  frame  sequence,  whereas  most  other  methods 
have  to  narrow  down  their  working  window  such  that 
only  one  moving  pattern  appears  in  it. 

In  Fig.  3,  the  first  and  last  frames  of  a  sequence  are 
shown,  in  which  two  helicopters  move  and  the  back¬ 
ground  is  stationary.  The  goal  is  to  find  the  velocities 
and  the  positions  of  the  helicopters. 

Fig.  4  shows  the  synthetic  images  obtained  by  stack¬ 
ing  the  projections  of  the  frames  on  the  x  and  y  axes. 
The  derivative  of  the  projections  in  time  is  taken  by 
simply  subtracting  each  frame  projection  from  its  neigh¬ 
boring  projection.  As  we  can  observe,  each  helicopter 
is  represented  by  a  skewed  band  in  these  images.  Then, 
the  algorithm  is  applied  to  these  synthetic  images  in¬ 
dependently,  and  estimates  of  the  velocities  and  loca¬ 
tions  of  the  center  of  gravity  of  the  moving  objects  are 
obtained.  These  estimates  have  been  used  to  plot  the 
trajectories  of  the  objects. 

4.  Text  Skew  Detection 

Another  application  of  the  SLIDE  algorithm  is  the 
estimation  of  the  skew  angle  in  scanned  text  images. 
Although  the  lines  of  text  contribute  to  more  than  one 
pixel  per  image  row,  and  the  character  pixels  introduce 
fuzziness  on  the  lines,  the  estimate  obtained  by  SLIDE 
is  accurate  enough  for  pursuing  the  other  steps  of  pro¬ 
cessing  the  text.  This  robustness  stems  from  the  fact 
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Figure  3:  First  and  last  frames  of  the  test  sequence. 
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Figure  4:  Stack  of  projections  on  the  x  and  y  axes,  and  detected  motions. 


that  SLIDE  exploits  the  great  amount  of  coherency  that 
exists  in  the  text  images  between  the  locations  of  the 
pixels  on  parallel  lines.  This  coherency  contributes  to 
enhancing  the  signal  subspace  component  of  the  space 
spanned  by  the  sample  covariance  matrix.  An  example 
of  applying  SLIDE  to  a  scanned  text  image  is  shown  in 
Fig.  5.  The  original  image  has  been  scanned  with  some 
amount  of  skew.  The  estimated  value  of  the  skew  has 
been  used  to  rotate  the  text  image  back  to  the  proper 
orientation. 

5.  Axis  of  Symmetry 

By  exploiting  the  tools  of  communication  theory, 
other  application  areas  can  be  added  to  the  framework 
of  the  SLIDE  algorithm.  One  such  tool  is  signal  modu¬ 
lation,  in  which  a  passband  signal  can  be  represented  by 
the  location  of  a  central  carrier  frequency,  surrounded 
by  two  sidebands.  A  particular  case  of  interest  in  this 
application  category  is  the  estimation  of  the  axis  of  sym¬ 
metry  of  patterns. 

Consider  the  simple  case  of  Fig.  6.  Assuming  that 
the  pattern  profile  in  each  row  of  the  image  can  be  rep¬ 
resented  by 

f{x-xo  +  ytand),  (6) 

the  measurements  after  dechirping  would  be 

w{y)  =  (7) 

which  have  the  structure  of  a  modulated  signal  with  a 
central  carrier  frequency  axo-  The  shape  of  the  side¬ 
bands  around  the  carrier  frequency  in  the  frequency 


domain  of  'w{y)  is  a  scaled  version  of  /(.),  and  hence 
is  symmetric.  It  can  be  shown  that  applying  subspace 
fitting  techniques  to  such  modulated  signals  results  in 
an  estimate  of  the  mean  frequency.  In  this  application, 
the  mean  frequency  is  proportional  to  the  position  of 
the  axis  of  symmetry,  xq.  Fig.  7  presents  an  example 
of  applying  the  SLIDE  algorithm  to  finding  the  axis  of 
symmetry  of  an  alignment  mark  on  a  wafer.  The  de¬ 
tected  axis  of  symmetry  has  been  superimposed  on  the 
original  gray  scale  image  taken  from  the  wafer. 

Detection  of  horizontal  and  vertical  axes  of  symmetry 
can  be  extended  to  arbitrary  symmetric  shapes.  The  ba¬ 
sic  observation  in  applying  subspace  fitting  methods  is 
that  due  to  the  symmetry  in  the  original  shape,  the  mea¬ 
surements  will  have  symmetric  sidebands  around  the 
carrier  frequency  in  the  frequency  domain.  Subspace 
fitting  results  in  an  estimate  of  the  carrier  frequency. 
Fig.  8  shows  an  image  with  a  pattern  symmetric  in  the 
horizontal  direction.  The  location  of  the  detected  axis 
of  symmetry  has  been  plotted  on  the  image. 

6.  Circle  and  Ellipse  Fitting 

The  circle  is  the  most  symmetric  pattern.  In  addi¬ 
tion  to  the  estimation  of  the  location  of  its  center,  it 
can  be  shown  that  SLIDE  can  also  find  an  estimate  for 
the  value  of  the  circle  radius.  This  value  appears  as 
the  carrier  frequency  of  a  modulated  signal,  which  in 
this  case  is  itself  variable  with  the  measurement  index 
y.  This  technique  is  directly  applicable  to  both  hollow 
circles  and  disks,  and  can  readily  be  applied  to  finding 
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Figure  5:  Scanned  text  image  with  a  skew  and  the  skew-compensated  image 


Vy  tane 


Figure  6:  Image  with  a  symmetric  pattern. 


Figure  8:  Image  with  a  vertical  axis  of  symmetry. 


the  parameters  of  ellipses  as  well. 
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ABSTRACT 

The  paper  presented  deals  with  phase  retrieval  prob¬ 
lem  for  image  reconstruction  from  only  spectrum  mag¬ 
nitude.  Only  two-dimensional  spatially  limited  non¬ 
negative  objects,  which  are  characterized  by  analytical 
spectra,  are  considered  assuming  that  the  unique  solu¬ 
tion  of  phase  problem  exists.  In  this  paper  it  is  pro¬ 
posed  to  use  nonlinear  optimization  approach,  namely, 
well-known  maximum  entropy  method  (MEM)  which 
has  very  good  extrapolation  features  and  high  stabil¬ 
ity  to  noise.  For  solving  phase  retrieval  problem  we 
propose  to  introduce  into  the  optimized  entropy  func¬ 
tional  additional  unknowns  related  to  a  real  and  imag¬ 
inary  parts  of  an  object  spectrum  and  represent  the 
constraints,  which  are  derived  from  measured  spectrum 
magnitude  data,  as  linear  constraints,  in  order  to  re¬ 
duce  the  optirnization  problem  to  the  standard  MEM. 
The  whole  computational  algorithm  is  constructed  as  a 
combination  of  the  standard  MEM  algorithm  and  ad¬ 
ditional  nonlinear  constraint  for  a  real  and  imaginary 
parts  of  the  spectrum  data  which  is  realized  during 
computational  iterations.  Images  reconstructed  by  the 
proposed  MEM  approach  may  be,  if  necessary,  further 
improved  by  Fienup’s  iterations.  In  this  case  the  previ¬ 
ous  image  is  used  as  a  starting  point  ensuring  reliable 
convergence  of  Fienup’s  algorithm  to  sought  for  solu¬ 
tion.  Numerous  simulation  results  demonstrate  validity 
and  high  efficiency  of  the  approach  proposed. 

l.INTRODUCTION 

Some  problems  of  signal  processing  are  connected  with 
solving  phcLse  problem,  i.e.  image  or  signal  reconstruc¬ 
tion  from  only  spectral  magnitude.  The  most  of  phys¬ 
ical  instruments  are  able  to  register  only  spectrum  in¬ 
tensity  data,  but  for  correct  representation  of  an  ob¬ 
ject  it  is  necessary  to  have  information  both  on  spec¬ 
tral  magnitude  and  pha^e.  It  is  proved  that  in  the  case 
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of  two-dimensional  spatially  limited  non-negative  func¬ 
tions,  which  have  analytical  spectra,  the  phase  problem 
has,  except  of  cases  determined  on  null  set,  unique  so¬ 
lution  within  a  class  of  equivalent  functions  [1].  We 
will  assume  that  solution  of  phase  problem  exists.  But 
existence  of  the  solution  does  not  guarantee  conver¬ 
gence  of  a  phase  retrieval  algorithm  to  it.  The  prob¬ 
lem  of  synthesis  of  the  algorithms,  which  converge  reli¬ 
ably  to  sought  for  solution,  is  a  very  difficult  problem. 
At  present  the  most  popular  algorithms  are  algorithms 
based  on  Fienup’s  iterations  [2],  But  it  is  known  that 
sometimes  Fienup’s  algorithm  fails.  The  main  possi¬ 
ble  reason,  as  was  discussed  in  [3],  is  a  starting  point 
which  is  chosen  in  the  iteration.  In  this  paper  we  pro¬ 
pose  at  first  to  seek  for  the  solution  of  phase  problem  by 
maximum  entropy  method  modified  to  spectrum  mag¬ 
nitude  data  and  then  improve  this  result,  if  necessary, 
by  Fienup’s  iterations  using  it  as  a  starting  point  which 
ensures  reliable  convergence  of  the  algorithm  to  sought 
for  solution. 

2.BASIC  ALGORITHM 

In  this  section  the  part  of  the  combined  phase  retrieval 
algorithm,  proposed  in  the  section  4,  which  is  based  on 
maximum  entropy  method,  is  described.  The  standard 
MEM  [4]  for  image  reconstruction  from  complex  spec¬ 
trum  can  be  represented  as  the  following  optimization 
problem.  (We  consider  only  discrete  form.) 


min  ^  ^  Xml  ln(a:ml), 

m  J 

(1) 

'^J2^miaml  =  Ank, 

m  1 

(2) 

~  Bnkj 

m  1 

(3) 

Xml  >  0. 

(4) 

where  x^i  are  unknowns;  Ank  and  Bnk  are  known 
spectrum  data,  real  and  imaginary  parts  respectively; 
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and  are  constants,  related  to  Discrete  Fourier 
Transformation: 

=  cos(27r(mn  +  lk)/N)/N  ^  (5) 

b^j  =  —  sin(27r(mn  +  lk)fN)/N,  (6) 

Ank  and  Bnk  can  be  represented  in  the  following 
way 

Afik  —  ^nk  COS  (f>nky  (7) 

Bnk  =  Mnk^iri(j)nky  (8) 

where  Mnk  and  (j>nk  are  spectral  magnitude  and  phase 

respectively. 

In  the  case  of  phaseless  data  let  cos  (f>nk  and  sin(^„jfe 
be  unknowns  as  well.  Using  the  following  identities 

cos  (l>nk  ”  2  cos  ((^yi^/2)  1,  (9) 

sijl<j)nk  “  “h  C0s(i^^^/2))  —  1,  (10) 

and  determining  the  following  new  variables 

ink  -  2coS^((/»„fc/2),  (11) 

Snk  =  (sin(0nfc/2)  +  cos(<^„fc/2))^  (12) 

which  are  nonnegative,  it  is  possible  to  rewrite  (1)  -  (4) 
as  follows 

min^  ^  Xml  ln(a;mi)  + 

m  I 

+EE‘»  k  lll(^nfc)  “1"  ^nk  lll-(Sj^/b),  (18) 


n  k 

^  ^  ^  V  Qynf  ~  ^nk^nk  ~  '^^nky  (1^) 

m  } 

^  ^  ^  ^  ^mlbml  ^nk^nk  ^  ^nky  (1^) 

m  I 

^ml  y^nky  ^nk  ^  0.  (10) 

In  addition  we  demand  meeting  the  following  non¬ 
linear  condition 

(tnk  -  If  +  {Snk  -  If  =  1.  (17) 


For  obtaining  solution  of  considered  phase  problem 
it  is  proposed  to  solve  (13)  -  (16)  numerically  using 


steepest  ~  descent  method  and  require  after  each  itera¬ 
tion  meeting  (17). 

The  optimization  problem  written  as  (13)  -  (16) 
looks  like  the  standard  one  (1)  -  (4).  Only  additional 
variables  tnk  and  Snky  which  are  non-negative  as  well 
as  sought  for  function  Xmh  are  included  into  entropy 
functional  and  linear  constraints. 

Using  Lagrange  method  it  is  easy  to  obtain  the  so¬ 
lution  of  the  (13)-(16)  expressed  via  dual  factors 

Xml  =  exp(-  Y 
n  k 

tnk  —  ^^pi^^nkAInk  1))  (1^) 

Snk  =  exp{l3nkMnk  “  !)•  (20) 

The  dual  factors  Onk  and  /?„jt  refer  to  (14)  and  (15) 
respectively. 

As  it  is  seen  from  (18)-(20),  requirements  of  (16) 
are  met. 

Dual  unconditional  optimization  problem  appears 
as  follows 

min  EE  EE(^"*  +SnA:)- 

ml  n  k 

-EE  {oink^nk  "H  i^nk^nk^-  (21) 

n  k 

We  propose  to  solve  (21)  using  steep est-descent 
method.  After  each  iteration  dual  factors  get  increases 

^ank{l3nk')’’ 


O^nk  ^  ^nk  A  ^anky 

(22) 

l^nk  ^  Pnk  "1"  ^pnk* 

(23) 

Simulation  results  for  relatively  simple  image  are 
shown  in  Fig.l.  Analysis  of  simulation  results  [5]  in 
the  case  of  more  complicate  images  shows  that  the  so¬ 
lutions  obtained  only  by  MEM,  i.e.  (13)- (16),  are  char¬ 
acterized  by  good  reconstruction  of  low-frequency  re¬ 
gion  of  spectrum  which  may  be  narrow.  Supplement 
of  the  nonlinear  constraint  (17)  allows  to  considerably 
widen  the  reconstructed  region.  Further  improving  is 
possible  by  Fienup^s  iterations  if  MEM-image  is  used 
as  a  starting  point. 

3.COMBINED  MAXIMUM  ENTROPY  AND 
FIENUP’S  ITERATION  METHOD 

The  most  well-known  algorithm  for  phase  retrieval  is 
Fienup^s  iterative  reconstruction  [2].  It  is  proved  that 
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Fienup’s  algorithm  does  not  have  a  character  of  com¬ 
pressibility  and  error  at  each  iteration  can  stay  the 
same.  In  [3]  a  potential  source  of  error  in  the  numerical 
implementation  of  the  algorithm  is  discussed.  Besides 
it  was  established  that  Fienup^s  algorithm  sometimes 
fails  converging  to  a  fixed-point.  Authors  of  the  paper 
have  presented  a  conjecture  that  converging  of  the  algo¬ 
rithm  towards  the  fixed  point  depends  on  the  starting 
point  which  is  chosen  in  the  iteration. 

For  avoiding  failure  of  Fienup’s  algorithm  caused 
by  unsuccessful  starting  point  it  is  proposed  to  com¬ 
bine  phase  retrieval  algorithm  from  two  parts.  In  the 
first  part  we  propose  to  obtain  the  starting  point  using 
maximum  entropy  method  which,  as  was  mentioned  in 
the  Section  2,  ensures  good  reconstruction  of  spectral 
phase  in  low  frequency  region.  The  second  part  as¬ 
sumes  reconstruction  by  Fienup^s  algorithm  using  the 
image  reconstructed  by  MEM  as  a  starting  point.  As 
simulation  results  demonstrate,  the  combination  pro¬ 
posed  ensures  reliable  convergence  of  the  algorithm  to 
sought  for  solution.  Error  study  shows  high  stability  of 
the  algorithm  to  noise  [5] . 


may  be,  if  necessary,  further  improved  by  Fienup^s  it¬ 
erations.  In  this  case  the  previous  image  is  used  as  a 
starting  point  ensuring  reliable  convergence  to  sought 
for  solution.  Numerous  simulation  results  demonstrate 
validity  and  high  efficiency  of  the  approach  proposed. 
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4.CONCLUSION 

In  this  paper  the  algorithm  for  image  reconstruction 
from  only  spectrum  magnitude  based  on  maximum  en¬ 
tropy  method  is  proposed.  It  is  suggested  to  intro¬ 
duce  into  the  optimized  entropy  functional  and  lin¬ 
ear  constraints,  which  are  derived  from  measured  spec¬ 
trum  magnitude  data,  additional  unknowns  related  to 
a  real  and  imaginary  parts  of  the  spectrum  of  an  object. 
The  whole  computational  algorithm  is  constructed  as  a 
combination  of  the  standard  MEM  algorithm  and  ad¬ 
ditional  non-linear  constraint  for  a  real  and  imaginary 
parts  of  spectrum  data,  which  should  be  realized  during 
computational  iterations  of  steepest-descent  method. 
Images  reconstructed  by  the  proposed  MEM  approach 
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ABSTRACT 

The  problem  of  nonlinear  distortions  of  images  recon¬ 
structed  from  incomplete  and  noisy  spectrum  data  us¬ 
ing  nonlinear  optimization  methods  such  as  maximum 
entropy  method  is  considered.  To  decrease  the  level  of 
nonlinear  distortions  it  is  proposed  to  seek  solution  in 
the  space  of  complex  functions  using  generalized  max¬ 
imum  entropy  method  instead  of  classical  one. 

l.STATEMENT  OF  THE  PROBLEM 

Let  us  consider  the  problem  of  image  reconstruction 
from  incomplete  and  noisy  spectrum  data.  Images  un¬ 
der  the  consideration  are  images  of  intensities  and  de¬ 
scribed  by  real  non-negative  distributions.  In  order  to 
reconstruct  missing  spectrum  components,  especially 
in  high  frequency  domain,  it  is  appropriate  to  use  non¬ 
linear  reconstruction  procedures  having  super  resolu¬ 
tion  effect.  Such  a  problem  arises,  for  example,  in 
radio  astronomy  where  aperture  synthesis  principle  is 
used  for  high  resolution  source  imaging.  Most  known 
and  effective  computational  method  for  processing  such 
images  is  maximum  entropy  method  (MEM)  [1].  Clas¬ 
sical  MEM  ensures  non-negative  solution  expressed  as 
exponential  function  of  an  argument.  MEM  is  charac¬ 
terized  by  high  stability  to  input  data  noise.  But  when 
the  level  of  noise  becomes  too  large  (such  a  situation 
is  typical,  for  example,  in  radio  astronomy  where  mea¬ 
surements  of  visibility  function  are  made  with  low  sig¬ 
nal  to  noise  ratio)  classical  MEM  produces  nonlinear 
distortions  in  the  form  of  many  artifacts  with  ampli¬ 
tudes  commensurable  with  amplitude  of  sought  signal. 
What  is  the  cause  of  such  distortions?  The  matter  is 
that  in  spite  of  non-negativity  of  an  actual  object  dis¬ 
tribution,  generally,  a  complex  image  corresponds  to 
noisy  input  data.  Therefore  the  solution  of  image  re¬ 
construction  problem  should  be  sought  in  the  space  of 
complex  distributions.  By  the  same  token  it  is  possible 
to  reduce  a  level  of  nonlinear  distortions  moving  them 
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to  real  negative  and  imaginary  parts  of  output  which 
should  be  rejected. 

The  aim  of  the  paper  is  to  show  on  the  example 
of  maximum  entropy  method  how  the  generalization  of 
output  solution  until  complex  functions  can  consider¬ 
ably  improve  the  quality  of  reconstructed  images. 

2.GENERALIZED  MAXIMUM  ENTROPY 
METHOD  FOR  SOLVING  THE  PROBLEM 

Originally  generalized  maximum  entropy  method 
(GMEM)  was  proposed  for  reconstruction  of  complex 
coherent  images  formed  in  accordance  with  radio  holog¬ 
raphy  principle  [2,3].  In  [4]  the  GMEM  was  considered 
for  reconstruction  of  ISAR  images.  Below  it  will  be 
shown  that  the  GMEM  is  more  preferable  than  classi¬ 
cal  MEM  also  in  the  case  of  real  non-negative  images  of 
intensities  if  data  are  measured  with  low  signal  to  noise 
ratio.  Here  let  us  recall  basic  aspects  of  the  GMEM 
[2,3].  The  GMEM  assumes  seeking  solution  in  complex 
form:  rmi  +  jqmi  •  Both  a  real  and  an  imaginary  parts  of 
an  image  must  be  represented  as  a  difference  between 
two  non-negatively  determined  sequences  as  follows: 

—  ^ml  yml  j  QmJ  ^  '^ml  j  (1) 

where  Xml^yrnh  >  0. 

If  sequences  XmJ.ymi  and  ZmhVmi  do  not  overlapp, 
than  Xml  and  Zmi  determine  positive  parts  and  ymi  and 
Vmi  determine  negative  parts  of  the  sequences  rmhQmi^ 
The  GMEM  assumes  minimization  of  the  functional 
which  is  written  as  follows: 

miny^y^a^m/  ln(axrn0  +  ymi  ln(ai/m/) 

m  I 

where  a  is  a  positive  parameter  responsible  for  not  over¬ 
lapping  positive  and  negative  parts  of  the  sought  for 
sequences. 
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Linear  constraints  derived  from  measured  spectrum 
data  are  written  in  the  following  way: 

“  i^rnl  “  =  ^njb)  (3) 

m  J 


y^  y^  (^m?  "h  ^nk)  (4) 

m  / 

^mh  Vm}  ^  ^mh '^ml  ^0,  (5) 

where  are  constant  coefficients  which  deter¬ 

mine  image  formation  system,  Ank^Bnk  are  measured 
real  and  imaginary  parts  of  spectrum  samples  respec¬ 
tively. 

Solutions  for  (xmi^ymi)  and  (zmhVmi)  are  connected 
by  the  expression: 

Xmiymi  =  ZmiVmi  =  exp(-l  -  ln(a)).  (6) 

Making  parameter  a  larger  we  can  reach  not  overlap¬ 
ping  effect.  So,  the  GMEM  approach  allows  to  obtain 
solution  of  reconstruction  problem  in  the  space  of  com¬ 
plex  functions.  Because  of  the  optimization  problem 
(2)-(4)  is  solved  with  respect  to  non-negatively  deter¬ 
mined  variables  (5)  the  method  possesses  super  reso¬ 
lution  effect  as  classical  MEM.  If  it  is  required  to  re¬ 
construct  an  image  of  a  real  non-negative  object  Xmi 
it  is  necessary  to  represent  the  sought  for  sequence  in 
complex  form  as  (1).  But  as  a  final  result,  which  must 
agree  a  priori  information  about  non-  negativity  of  an 
object,  only  a  real  non-negative  part  of  the  solution 
should  be  taken  into  account.  Non-zero  values  of  the 
sequences  ymUZmU  should  be  rejected  because  they 
arise  only  due  to  noise  in  input  data  and  carry  no  in¬ 
formation  about  real  object  distribution.  So,  by  using 
the  GMEM  it  is  possible  to  transform  most  part  of  in¬ 
put  noise  to  those  parts  of  output  solution  which  can 
be  easy  rejected.  On  the  contrary,  if  we  use  classical 
MEM,  we  can  not  reject  distortions,  caused  by  input 
data  noise,  and  they  all  appear  in  the  form  of  njnlinear 
artifacts  in  the  output  image. 

3.SIMULATION  RESULTS 

In  Fig.l  simulation  results  proving  advantage  of  the 
GMEM  compared  to  classical  MEM  for  reconstruction 
of  intensity  images  from  noisy  spectrum  data  are  shown. 
The  model  includes  two  gaussian  components.  The  ob¬ 
ject  spectrum  with  large  ” holes”  in  both  low  and  high 
space  frequency  domain  represents  input  data.  Lin¬ 
early  estimated  image  obtained  using  inverse  Fourier 
transform  is  characterized  by  linear  distortions  because 


of  missing  spectrum  data.  As  is  seen,  the  image,  recon¬ 
structed  by  classical  MEM,  has  large  non-linear  distor¬ 
tions  and  the  image,  reconstructed  by  the  GMEM,  has 
much  less  distortions. 

4.CONCLUSION 

As  is  shown,  using  for  reconstruction  of  intensity  im¬ 
ages  instead  of  classical  nonlinear  optimization  meth¬ 
ods  such  as  classical  maximum  entropy  method  with 
real  non-negative  output,  their  generalized  forms  with 
complex  output,  allows  to  considerably  decrease  the 
level  of  nonlinear  distortions  caused  by  noise  in  input 
data.  As  a  final  result  only  the  real  non-negative  part 
of  the  corresponding  complex  solution  should  be  taken 
into  account. 
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Netherlands,  p.  407 

4.  Frieden  B,R.,  Bajkova  A.T.  1994,  Bayesian  cross¬ 
entropy  reconstruction  of  complex  images,  Applied  Op¬ 
tics,  33,  p.  407 


Reconstructed  by  MEM 


Reconstructed  by  GMEM 

Fig.l 
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Linear  filtering  remains  an  attractive  method  of 
image  recognition  because  of  simplicity  of  its 
performance  in  standard  computing  systems.  Moreover, 
even  nonlinear  procedures  of  image  processing  include, 
as  a  rule,  linear  filtering  for  preliminary  selection  and 
transformation  of  classifying  signs  and  compression  of 
processed  information  to  acceptable  values.  EfBciency  of 
linear  filtering  is  defined  mainly  by  the  proper  choice  of 
filter  set,  that  must  correspond  to  the  peculiarities  of  the 
certain  problem.  Generally,  it  is  desired  that  the  total 
number  of  used  filters  would  be  as  small  as  possible. 
From  this  point  of  view  standard  orthogonal  transforms 
(Fourier,  Haar,  etc.)  are  usually  not  optimal  and 
ineffective.  It  seems  much  more  promising  to  use  filters, 
constructed  for  some  optimum  criterion  with  account  of 
statistical  properties  of  image  general  population.  The 
well  known  example  of  such  approach  is  given  by 
Karhunen-Loeve  decomposition.  Different  authors  differs 
in  estimation  of  KL-transform  efficiency  for  image 
processing  and  recognition,  but  in  any  case  it  is  evident, 
that  KL-  decomposition  is  optimal  for  representation  of 
the  images,  not  for  their  discrimination.  In  the  number  of 
papers  [1,2]  there  were  suggested  criteria  of  discriminant 
filters  construction,  based  on  a  priori  information  about 
in-class  and  inter-class  correlations  of  the  general 
population.  But  practical  application  of  these  criteria, 
including  KL-decomposition,  involves  some  difficulties. 
Firstly,  to  construct  the  filters  it  is  necessary  to  solve  the 
spectral  problem  of  matrix  of  dimension  MxM,  defined 
by  the  number  of  samples  M  in  the  image.  If  M  value  is 
about  2^^  -2^^,  solution  of  such  problem  is  possible  only 
with  strong  additional  suppositions.  Secondly,  the 
complete  statistical  information  about  image  ensemble  is 
never  available  to  investigator,  who  has  usually  only 
limited  number  of  experimentally  obtained  images  on  his 
disposal.  Nevertheless,  even  the  comparatively  small  set 
of  images  often  is  informative  enough  to  obtain 
estimation  of  statistical  parameters  of  general  population, 
sufficient  for  constructing  effective  processing  algorithm. 
In  this  paper  we  suggest  the  procedure  of  discriminant 
filters  construction,  based  on  a  priori  classified  training 
set,  and  represent  some  results  of  its  practical 
application. 


We  consider  image  ensemble  {X},  that  consists 
of  m  classes  {Xa},  a=l,m;  each  forming  normal 
population  with  the  average  Ma  and  variance  S.  We 
suppose,  that  the  ensemble  is  represented  by  a  training 
set,  consisting  of  N  a  priori  classified  images.  In  matrix 
representation  dispersion  of  elements  in  each  class  is 
described  by  spread  matrix  Zi  ,  and  dispersion  of  classes 
is  given  by  matrix  Zz  [3].  In  terms  of  the  training  set  the 
spread  matrices  are  estimated  approximately  as: 


+ 


^  m 

a=l 
.  m 

Mo=-2:m„ 

(1) 


where  N„  is  number  of  training  set  images,  rated  in  a 
class,  SNa  =N.  We  construct  filters  for  the  criterion  of 
maximum  of  some  value  J,  that  increases  with  divergence 
between  classes  and  decreases  with  increasing  in-class 
dispersion.  To  construct  the  set  of  K  filters  we  define  Jk 
as  trace  of  the  principle  minor  Hk  of  symmetric  ratio  of 
spread  matrices  H: 

J  =  tr[HJ 

H  =  0.5(Z,-Z,+Z,Z.-) 

Criterion  J  achieves  its  maximum,  when  matrix  H  is 
represented  in  diagonal  form,  ordered  in  decreasing 
diagonal  elements.  As  criterion  matrix  H  is  self¬ 
conjugate,  it  is  brought  to  diagonal  form  with  the  unitary 
transform  F; 

F=[fl,f2,...,fMl  (3) 

where  fj  are  eigen  vectors  of  H.  Hence,  as  filters,  optimal 
for  J  criterion  one  must  choose  the  vectors  fj,  cor¬ 
responding  to  the  maximal  eigen  values  Xj  of  matrix  H. 
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We  solve  the  spectral  problem  of  matrix  H, 
following  the  procedure,  suggested  by  Kurashov  and 
Chumakov  for  Karhunen-Loeve  basis  construction  in 


approximation  of  training  set  [4].  This  procedure 
radically  decreases  number  of  necessary  operations,  if 
number  of  images  in  the  training  set  is  much  less  than 
dimension  of  the  image  itself.  According  to  this 
algorithm  we  orthogonalize  the  training  set  with  Gram- 
Schmidt  procedure  with  the  resulting  formation  of  some 
basis  {Ci},  that  consists  of  N  vectors  equal  in  dimension 
to  the  initial  images  {X}.  Then  we  represent  the  training 
set  in  basis  {Ci}: 


With  representation  (4)  we  receive  according  to  (1),  (2) 
the  expressions  for  spread  matrices  Zj ,  Z2  and  criterion 


matrix  H  that  have  dimension  NxN,  NxN«MxM.  The 
important  notice  is,  that  the  dimension  is  decreased  with 


no  loss  in  information,  and  the  recovery  of  original 
representation  is  performed  with  inverse  transform  [Ci]'^ 
.Hence  the  final  expression  for  the  required  filters  is: 

fj  =  [ei,e2,...,eN]-fj,  j  =  (5) 

where  f  j  are  eigen  vectors  of  matrix  H,  derived  with  any 
standard  algorithm. 

We  studied  the  efBciency  of  the  described 
procedure  for  recognition  of  images,  formed  by  raster 
representation  of  binary  coded  harmonic  and  LFM 
signals  (fig.l).  We  have  chosen  such  objects  in  order  to 
have  a  chance  of  comparing  the  results  with  theoretical 
resolution  limit.  In  our  experiments  the  training  set 
consisted  of  six  groups  of  signals,  with  six  realizations  in 
each  group  (fig.  2). 


frequency 


Fig.l.  Raster  representation  of  binary  coded  Fig.  2.  Spectra  of  signals  of  the  training  set. 

LFM  and  harmonic  signals  without  noise  0)3-02  =02  -Oi  =Ao=7.50r;  o^  — 2.5or.Oi.  is  limiting  resolution 

fi*equency,  Od  is  frequency  deviation  in  signals  (l)-(3). 
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Each  image  had  128x128  samples  and  included  wide 
band  noise  component,  noise  power  was  twice  that  of 
regular  signal.  It  is  evident,  that  such  training  set  can  be 
classified  in  three  different  manners:  1)  for  type  of 
modulation  -  in  two  classes;  2)  for  central  spectrum 
frequency  -  in  three  classes;  3)  for  both  signs  -  in  six 
classes.  For  each  type  of  classification  we  found  the 
respective  set  of  discriminant  filters,  that  were  tested  on 
the  examining  set.  The  examining  set  coincided  in 
structure  with  the  training  one  and  differed  by  noise 
realizations.  The  results  of  recognition  for  modulation 
type  and  for  central  fi-equency  are  represented  on  the 
fig.  3  as  two-dimensional  projection  of  the  sign  space. 
Good  clustering  of  responses  shows,  that  for  such  types  of 
classification  one  can  construct  a  deciding  rule  for 


Based  on  the  described  results  we  have  arrived 
at  two  following  conclusions.  Firstly,  a  priori  information 
about  general  population,  derived  from  a  training  set, 
enables  construction  of  the  set  of  orthogonal  filters] 
which  provide  effective  recognition  of  images  of  very 
large  dimension.  Secondly,  Mahalanobis  distance  is  a 
good  measure  for  quality  of  image  recognition  not  only 
for  two  normal  populations,  but  also  for  images  with 
different  statistics. 


confident  recognition  with  combination  of  two  filters.  As 
quantitative  assessment  of  class  discrimination  we  used 
Mahalanobis  distance  d  [5].  The  choice  of  such  measure 
seems  quite  natural,  because  it  is  closely  linked  to 
probability  of  error  of  Bayes  classificator.  Particularly,  for 
two  normal  classes,  each  including  n  images,  error 
probability  is  P=0.5exp(-nd/8).  In  the  case  of  several 
classes  we  determined  Mahalanobis  distance  dap  for  each 
pair,  recognition  quality  was  assessed  by  the  value  of  do  = 
Min{da|3  }.  The  dependence  of  d  on  number  of  used 
filters  (fig.  4)  shows,  that  the  error  of  recognition  of  M 
classes  approaches  its  low  limit  when  (M-l)  filters  are 
used,  the  further  increase  of  their  number  has  little  effect 
on  the  recognition  quality. 


N 


a)  b) 

Fig.3.  Two-dimensional  projection  of  the  sign  space. 

(a)  -  classification  for  type  of  modulation;  (b)  -  classification  for  central  fi'equency. 


Fig.4.  The  dependence  of  the  minimal  Mahalanobis 
distance  on  number  of  used  filters  for  the  case  of 
classification  for  six  classes. 
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Abstract 

The  principle  analytic  continuation  is  used  to 
generate  broadband  information  from  narrow  band 
data.  The  given  information  can  be  theoretical  data- 
points  or  measured  data.  The  given  data,  as  a  func¬ 
tion  of  frequency,  is  modeled  as  a  ratio  of  two  polyno¬ 
mials.  The  Cauchy  method  can  be  used  in  numerical 
electromagnetics,  optics,  to  reduce  the  size  of  large 
databases. 

1*  Introduction 

In  a  host  of  applications  in  engineering,  it  is  nec¬ 
essary  to  obtain  information  about  a  system  over 
a  broad  range.  In  most  cases  it  is  not  possible  to 
evaluate  the  parameter  of  interest  in  closed  form. 
However,  either  approximate  or  experimental  data 
is  available  in  a  narrow  band.  Generation  of  the 
data  over  the  broadband  is  not  possible  or  may  be 
extremely  time  consuming.  The  principle  of  analytic 
continuation  is  utilized  to  extrapolate/interpolate 
the  data  over  a  wide  band.  A  variation  on  the 
method  of  Cauchy,  as  described  in  [1],  has  been  cho¬ 
sen  in  this  paper  to  implement  the  analytic  contin¬ 
uation. 

The  Cauchy  method  deals  with  approximating 
an  analytic  function  by  a  ratio  of  two  polynomials. 
Given  the  value  of  the  function  and,  optionally,  its 
derivatives  at  a  few  points,  the  order  of  the  poly¬ 
nomials  and  their  coefficients  are  evaluated.  Once 


the  coefficients  of  the  two  polynomials  are  known, 
they  can  be  used  to  generate  the  parameter  over  the 
entire  band  of  interest. 

As  an  application,  Cauchy’s  method  has  been  uti¬ 
lized  to  generate  broadband  currents  on  a  body  from 
which  its  Radar  Cross  Section  (RCS)  is  calculated. 
This  is  done  from  narrowband  calculations  of  the 
currents.  Particularly  in  the  Method  of  Moments 
[2],  [3]  generation  of  the  response  at  each  frequency 
point  is  very  time  consuming.  However,  the  cur¬ 
rent  and  its  derivatives  with  respect  to  frequency 
can  be  calculated  at  a  few  points  using  the  Method 
of  Moments.  Then  Cauchy’s  method  can  be  used  to 
extrapolate/interpolate  the  current  over  a  broad  fre¬ 
quency  range  from  which  the  RCS  can  be  calculated. 

Another  area  of  application  for  the  Cauchy 
method  is  that  of  device  characterization.  A  very 
useful  tool  in  computer  aided  design  would  be  an 
online  description  of  the  characteristics  of  many  de¬ 
vices.  But,  since  each  device  could  be  used  under 
different  operating  conditions,  each  with  its  own  fre¬ 
quency  characteristic,  the  memory  required  to  de¬ 
scribe  all  devices  would  be  prohibitive.  Here  the 
Cauchy  method  could  be  used  to  generate  broad¬ 
band  information  while  storing  the  measured  data 
at  only  a  few  points. 

In  this  paper  the  Cauchy  technique  is  used  to  solve 
the  above  problems.  In  each  of  the  cases  mentioned 
above  the  Cauchy  technique  would  save  a  signifi- 
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This  can  be  written  in  the  matrix  form 


cant  amount  of  program  execution  time  or  computer 
memory  while  still  producing  accurate  results  over 
broadband  frequencies.  We  present,  as  examples, 
the  application  of  the  Cauchy  method  to  generate 
broadband  currents  and  to  device  characterization. 

2.  The  Cauchy  Method 

Consider  a  system  function  II{s).  The  objective  is 
to  approximate  H{s)  by  a  ratio  of  two  polynomials 
A(s)  and  B(s)  so  that  H{s)  can  be  represented  by  a 
few  variables. 

Consider 


H{s)  ~ 


^  ELo  aks'^ 

Eto  hs’‘ 


(1) 


A|  -B 


(6) 


The  order  of  matrix  A  is  iV  X  (P  +  1)  and  that  of 
B  is  iV  X  (Q  + 1)  and  [a]  and  [6]  are  the  vectors  with 
the  polynomial  coefficients. 

This  matrix  equation  can  be  solved  using  a  singu¬ 
lar  value  decomposition.  The  solution  vector  is  the 
right  eigenvector  corresponding  to  the  zero  singular 
value.  Hence,  the  number  of  non-zero  singular  val¬ 
ues  gives  an  estimate  of  the  required  orders  of  the 
polynomial  coefficients. 


3.  Applications  of  the  Cauchy  Method 


Here  the  given  information  could  be  the  value  of 
H (s)  and  its  Nj  derivatives  at  some  frequency  points 
Sj,  j  =  1, ..  .J.  If  represents  the  deriva¬ 

tive  of  H(s)  at  point  s  =  Sj,  the  Cauchy  problem  is: 

Given  for  n  =  0,...Nj,  j  =  1,...J,  find 

P,Q,  {ak,k  =  0, . .  .P},and{bk,k  =  0, . .  .Q}. 

By  enforcing  the  equality  in  equation  (1),  mul¬ 
tiplying  out,  differentiating  n  times,  and  evaluating 
the  expressions  at  point  Sj ,  one  obtains  the  binomial 
expansion, 

(2) 

1=0 

Using  the  polynomial  expansions  for  A(s)  and 
B{s),  equation  (2)  can  be  rewritten  as 

P  Q 

12  =  12  ^{j,n),kh  (3) 

A:=0  fc=0 

where 

A(j,n),fc  =  (^) 

-  i) 
(5) 

n  =  0, 1,  -—Nj,  j  =  1, ..  J,  where  u(k)  =  0  for  A;  <  0 
and  =  1  otherwise. 


An  example  of  the  use  the  Cauchy  method  we  in¬ 
terpolate  the  currents  on  a  sphere  being  illuminated 
by  a  plane  wave.  The  Method  of  Moments  program 
evaluates  the  current  and  its  first  four  derivatives 
with  respect  to  frequency.  This  information  was 
used  as  input  to  a  Cauchy  subroutine.  The  origi¬ 
nal  Method  of  Moments  program  was  used  to  calcu¬ 
late  the  RCS  without  the  Cauchy  method.  The  two 
RCS  plots  were  compared  to  show  the  accuracy  of 
the  Cauchy  method. 

The  current  and  its  first  four  derivatives  are  eval¬ 
uated  at  5  frequency  points  in  three  ranges.  0.6m  < 
A  <  1.0m,  1.0m  <  A  <  1.8m,  1.8m  <  A  <  6.0m.  Us¬ 
ing  this  information  the  RCS  on  the  sphere  was  cal¬ 
culated  at  51  in  the  same  range.  Using  the  Method 
of  Moments  program  to  generate  the  same  informa¬ 
tion  would  increase  the  execution  time  by  a  factor 
of  eight.  In  Figure  1  we  see  the  results  of  applying 
the  Cauchy  method  to  the  evaluation  of  the  RCS 
of  a  sphere.  Here  the  RCS  is  plotted  over  a  decade 
bandwidth. 

Another  application  of  the  Cauchy  method  is  in 
creating  a  database  of  many  devices  working  in 
varying  operating  conditions.  The  Cauchy  program 
would  require  the  value  of  a  parameter  at  a  few  fre¬ 
quency  points  and  use  this  information  to  evaluate 
the  parameter  over  a  wide  frequency  band.  Over 
many  devices,  and  their  operating  conditions,  this 
would  yield  significant  savings  in  memory  require¬ 
ments. 

To  test  this  application  the  Y-parameters  of  a 
UM  PHEMT  were  measured  over  the  range  of  1.0- 
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Figure  1:  Radar  Cross  Section  of  a  sphere 


40.0GHz.  Just  five  of  these  points  were  used  as  in¬ 
put  to  the  Cauchy  program.  The  points  chosen  were 
at  the  frequency  points  l.OGHz,  lO.OGHz,  20.0GHz, 
30.0GHz,  and  40.0GHz.  This  resulted  in  a  numer¬ 
ator  polynomial  of  order  1  and  denominator  poly¬ 
nomial  of  order  2.  Figure  2(a)  shows  the  magni¬ 
tude  (|Fii|)  reconstructed  over  this  broad  frequency 
range.  Figure  2(b)  shows  the  phase  (/Fn)  over  the 
same  range.  As  can  be  seen  the  agreement  with  the 
measured  values  and  the  interpolated  values  is  ex¬ 
cellent. 

4.  Conclusions 

This  paper  has  presented  a  technique  with  many 
practical  applications.  The  Cauchy  method  starts 
with  assuming  that  the  parameter  of  interest,  as  a 
function  of  frequency,  can  be  approximated  by  a  sim¬ 
ple  rational  polynomial  function.  This  technique  has 
applications  to  many  practical  problems.  In  our  re¬ 
search  the  technique  is  applied  to  the  Method  of 
Moments,  optical  systems,  filter  analysis,  and  de¬ 


vice  characterization.  In  all  applications  the  Cauchy 
method  has  shown  to  save  time  and  memory. 
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Figure  2:  The  Cauchy  method  applied  to  Device  Characterization. 
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ABSTRACT 

In  this  paper  the  applicability  of  the  compact 
Green  functions  technique  in  the  problems  of 
signal  restoration  and  device  characterization 
is  discussed.  It  is  shown  that  the  compact 
Green  functions,  in  a  straightforward  way,  can 
be  used  to  restore  the  incident  signal  from  the 
distorted  signal  which  is  received  at  the  other 
end  of  the  line.  Further,  an  exact  method  to 
reconstruct  two  of  the  electrical  parameters  of 
a  nonuniform  transmission  line  from  the 
transient  response  is  presented.  This  method 
may  be  used  to  characterize  a  microwave 
devise  in  terms  of  transmission  line 
parameters. 

1.  INTRODUCTION 

The  basic  theoretical  model  of  distributed 
linear  networks  and  transmission  lines  is  the 
telegrapher’s  equations.  They  describe  the 
wave  propagation  characteristics  of  a 
transmission  line  (device)  in  terms  of  the 
distributed  electrical  parameters:  the 
inductance  L{x),  capacitance  C{x),  resistance 
R{x),  and  conductance  G{x).  In  this  paper  we 
present  the  latest  results  on  using  the  wave¬ 
splitting  technique  and  the  compact  Green 
functions  technique  for  time-domain  signal 
processing  and  device  characterisation. 

The  signal  processing  problem  we  address  is 
the  problem  of  restoring  signals  that  have 
been  distorted  by  a  nonuniform  transmission 
line.  The  method  we  present  is  exact  and 
works  as  long  as  the  attenuation  on  the  line  is 
moderate,  i.e.,  the  method  fails  when  the 
losses  are  so  high  that  the  signal  propagation 
is  more  diffusion-like  than  wave-like. 

In  the  device  characterisation  case,  we  assume 
that  the  telegrapher’s  equations  can  be  used 


as  a  model  for  the  device,  and  focus  on  a 
reconstruction  of  the  spatial  dependence  of 
two  of  the  four  electrical  parameters  of  the 
transmission  line  from  the  time-domain 
transient  response  of  the  line. 

2.  THE  MODEL  EQUATIONS  AND 
THE  WAVE-SPLITTING  TECHNIQUE 

Consider  a  lossy  nonuniform  transmission 
line  of  length  /  between  x=0  and  1.  The 
nonuniform  line  is  assumed  to  be  connected 
to  two  lossless  uniform  lines  at  the  two  ends. 
The  voltage  and  current  then  satisfy  the 
telegrapher’s  equations. 


-d,Vix,t)  =  -^d,I{x,t)  +  R{x)I{x,t) 
c{x) 

-dj{x,t)  =  —^—d,V{x,t)  +  G(x)V(x,t) 
Z(x)c{x) 

(1) 

where  and  d,  denote  differentiation  with 
respect  to  x  and  t,  respectively,  and  where  the 
c  h  a  r  a  cteristic  impedance, 

Z(x)  =  ^jL(x)IC{x) ,  and  the  wavefront 
velocity,  c{x)  =  ij L{x)C(x) ,  are  substituted 
for  the  inductance  and  capacitance.  The  use  of 
Z  and  c  instead  of  L  and  C  simplifies  the 
mathematics  and  is  in  fact  necessary  in  order 
to  reconstruct  the  parameters  L  and  C  from 
the  transient  response. 

The  wave-splitting  technique  is  based  on 
transforming  the  voltage  and  current  into  right 
and  left  moving  components  V*  and  V .  The 
choice  of  transform  is  arbitrary  as  long  as  it 
gives  split  components  that  are  equal  to  the 
physical  right  and  left  moving  signals  at  the 
lossless  and  uniform  ends  of  the  line.  The 
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transform  we  use  fulfil  that  criterion  and  gives 
simple  mathematical  expressions: 

V\x,t)  =  \{V{x,t)±Z{x)I{x,t))  (2) 

From  (1)  we  get  the  partial  differential 
equations  (PDFs)  for  V~ , 

ry"(x,ol  1  J  v\x,t) 

d,  + - d. 

v-(x,t)  c{x)  -y-(x,o 

L  V  ’  L  J 

a{x)  Pix)  V^{x,t) 

Y(x)  d(x)\_V~(x,t)_ 

where 


express  all  responses  in  terms  of  the 
transmitted  signal  instead  of  the  incident 
signal.  Using  the  transmitted  signal  is 
advantageous  in  both  signal  restoration  and 
parameter  reconstruction  problems.  It  can  be 
shown  that  the  split  components,  V^(x,t),  are 
given  by  the  following  expressions: 

V"(x,t  +  T(0,x))‘_ 
y(x,r  +  T(0,x)) 

.  r 

aix)V(t)  G'^^{x,t)*V'(t) 

0  J  [G‘=-{x,t)*r(t)_ 

where  t(0,x)  is  the  travel  time  from  0  to  x. 


a(,x)  =  ^-GZ-RY+ZJ) 

P(x)  =  i{-GZ  +  RY-ZJ) 

7W  =  i(GZ-RK-Z,y) 
e(x)  =  i{CZ+RY  +  ZJ) 

and  where  s  3^Z(x),  Y  =  yZ(x). 

By  transforming  the  problem  from  voltage 
and  current  to  right  and  left  moving 
components,  we  get  a  mathematical  system 
that  is  amenable  to  physical  interpretations 
[1],  and  suitable  for  numerical  treatment  by 
the  method  of  characteristics.  But  most 
important,  we  obtain  natural  boundary 
conditions  for  V~  at  the  ends  of  the 
nonuniform  line,  i.e.,  P*  are  equal  to 
outgoing  and  incoming  waves  at  the 
boundaries.  This  means  that  we  have  exact 
absorbing  boundary  conditions  for  V*'  in  this 
one-dimensional  problem. 

3.  THE  COMPACT  GREEN 

FUNCTIONS  TECHNIQUE 


and  a{x)  is  the  attenuation  factor; 

a{x)  =  exp|j^a(x')dx'j  = 

/  /  \ 

expm^(GZ  +  i?7)djc' 

G‘'~  are  compact  Green  functions,  and  * 
denotes  a  convolution  integral  in  time. 


The  attenuation  factor  and  the  compact  Green 
functions  comprise  all  information  of  the 
scattering  on  the  nonuniform  line.  O'"*  are 
called  compact  Green  functions  because  they 
are  equal  to  zero  outside  the  time-interval 
from  0  to  2t(x,/).  This  property  is  explicitly 
used  in  the  parameter  reconstruction  method. 
The  compact  Green  functions  satisfy. 


A  basic  property  of  linear  systems  is  that  the 
response  to  a  specific  signal  can  be  expressed 
as  a  convolution  between  the  fundamental 
solution  -the  Green  function-  and  the  signal. 
In  our  case,  the  input  is  an  incident  signal,  V‘, 
from  the  left  at  x=0,  and  the  output  is  a 
reflected  signal,  V,  at  x=0,  and  a  transmitted 
signal,  V‘ ,  at  x=l.  However,  in  one¬ 
dimensional  wave  propagation  it  is  possible  to 


\G^-ix,t)\  c{x)  '[g‘^-(x,0. 

a{x)  P(x)  G‘'''(x,t) 
Yix)  d(x)\[G^-(x,t)_ 

and  the  boundary  conditions. 
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(volt)  (volt) 


G"  {x,2r{x,l))  =-^c(x)Pix)G  ix,2T{x,l)) 

,  X 

G^  (x,0)  =-\c{x)Y{x)a  (x) 

4.  SIGNAL  RESTORATION 

Assuming  that  we  know  the  electrical 
parameters  of  the  transmission  line,  by 
expressing  the  split  components  in  the 
transmitted  signal,  the  signal  restoration 
problem  is  in  fact  solved.  Note  that  the 
original  signal  {Q,t) .  According  to 

equation  (5),  once  the  compact  Green 
functions,  G^^ix^t),  and  the  attenuation  factor, 
a(x),  have  been  calculated  using  equations  (4), 
(6),  (7)  and  (8),  we  get  the  original  shape  of 
the  signal  from  the  signal  received  at  x  =  I  by, 

V‘{t)  =  a{0)V'(t)  +  G'^(0,r)  *  V'it)  (9) 


Original  and  restored  signals 


Figure  1.  The  upper  graph  shows  the 
original  and  the  restored  signal.  The 
lower  graph  shows  the  received  signal 
with  and  without  added  noise. 

The  results  of  a  numerical  test  is  shown  in 
figure  1.  The  test  was  based  on  a  fictitious 
lossy  nonuniform  transmission  line  of  10  m 
length.  The  electrical  parameters  where 
chosen  so  that  the  signal  was  heavily  distorted 
by  the  line.  The  solid  lines  in  figure  1 
represent  the  original  and  received  signals.  To 
illustrate  the  well-posedness  of  the  restoration 
procedure,  noise  was  added  to  the  received 
signal  (the  broken  line  in  the  lower  graph). 


The  result,  given  by  equation  (9),  was  the 
broken  line  in  the  upper  figure.  It  is  clearly 
seen  that  the  restoration  works  well,  which  is 
a  consequence  of  that  the  method  is  exact. 
For  some  lossy  transmission  lines  a  simple 
RC-circuit  can  be  used  for  an  instantaneous 
restoration  of  the  signal  [2]. 

5.  PARAMETER  RECONSTRUCTION, 
DEVICE  CHARACTERISATION 

Assuming  that  a  device  can  be  modelled  by 
the  telegrapher’s  equations,  the  compact 
Green  functions  technique  can  be  used  to 
reconstruct  the  transmission  line  parameters 
from  the  transient  response  of  the  device.  If 
the  device  physically  is  not  a  transmission 
line,  the  algorithm  provides  the  equivalent 
transmission  line  that  corresponds  to  the 
given  transient  response. 

From  measurement  (or  specification,  in 
design  problems)  of  the  transient  response,  in 
terms  of  a(0),  G’^*{0,t)  and  G‘'“(0,r),  the 
boundary  conditions  (8)  and  the  PDFs  (7) 
can  be  solved  for  two  unknown  parameters 
provided  the  other  two  are  known.  However, 
the  reconstruction  is  only  unique  where  the 
conditions  in  table  I  are  satisfied  [3]. 


Parameters 

Condition  for  reconstruction 

c  andZ 

GZ-RY^Q 

c  and  R,  c  and  G 

d,Z{x)  ^  0 

Z  and  R,  Z  and  G 

no  restriction 

R  and  G 

9^Z{x)^Qor  {GZ-RY)^Q 

Table  I:  Conditions  for  reconstruction 
of  different  pairs  of  parameters. 

For  example,  the  reconstruction  of  the 
characteristic  impedance  and  the  wavefront 
velocity  is  only  possible  where  GZ  -  RY^O. 
In  the  numerical  example  shown  in  figure  2, 
GZ  -  RY  =  0  at  a:  =  0.85  and  it  is  seen  that 
the  reconstruction  of  Z  and  c  fails  around  that 
point. 
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c  (m/s)  ^  (ohm) 


Figure  2.  Reconstruction  of  Z  and  c. 
The  error  increases  in  the  vicinity  of 
X  =  0.85  since  GZ  -  RY  is  zero  there. 


6.  DISCUSSION 

The  objective  of  this  paper  was  to  show  the 
usefulness  of  the  wave-splitting  technique  and 
the  compact  Green  functions  technique  in  the 
area  of  signal  processing  and  device 
characterization.  An  exact  method  to  filter  the 
distortion  caused  by  a  nonuniform 
transmission  line  was  presented,  as  well  as  an 
exact  method  to  reconstruct  two  of  the 
transmission  line  parameters  from  the 
transient  response  of  the  line. 
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I.  Introduction 

The  Finite  Difference  Time  Domain 
Method  (FDTD)  is  a  very  powerful 
numerical  method  for  solving 
electromagnetic  (EM)  problems.  Due  to  its 
flexibility,  it  can  be  used  to  solve  problems 
which  have  very  complex  boundaries.  It  is 
well  known  that  the  FDTD  method  requires 
long  computation  times  for  simulating  the 
resonant  or  high-Q  structures.  The  reason 
for  this  is  because  the  algorithm  is  based 
on  the  leap-frog  formula.  The  quest  to  find 
good  predictor  for  enhancing  the  method  is 
very  interested  topic  in  EM  modelling. 

In  this  paper,  the  autoregressive  model 
(AR)  and  Backpropagation  (BP)  Neural 
Network  are  designed  as  predictors.  The 
Total  Least  Squares  (TLS)  method  is 
applied  to  obtain  AR  coefficients.  A 
waveguide  filter  is  used  as  an  example  and 
modeled  using  the  FDTD  method.  We 
demonstrate  that  a  short  segment  of  an 
FDTD  data  set  can  be  used  to  train  the 
predictors  and  the  predictors  can  predict 
later  information  very  well. 

II.  The  Brief  Background 

The  FDTD  algorithm  is  a  method  in  which 
the  central  difference  scheme  is  used  to 
discretize  Maxwell's  curl  equations  in  both 
time  and  space.  The  central  difference 
technique  can  contain  the  magnitude  of  the 
round-of  errors  so  that  second-order 
accuracy  is  achieved.  To  model  the 


electrical  and  magnetic  fields  in  space,  Yee 
introduced  his  cell  system  in  1966  [1].  The 
physical  basis  for  Yee's  cell  system  can  be 
easily  explained  using  Faraday's  and 
Ampere's  laws.  After  using  Yee's  cell 
system  to  describe  the  computational 
domain,  which  is  bounded  by  electric, 
magnetic  or  application  specific  absorbing 
walls.  Maxwell's  equations  are  essentially 
replaced  by  a  computer  which  calculates 
the  fields  at  the  grid  points  associated  with 
the  cells. 

In  this  paper,  a  waveguide  filter  which  is 
composed  of  a  segment  of  rectangular 
waveguide  and  two  metal  walls  with  cross¬ 
iris  (Fig.  1)  is  simulated  using  the  FDTD 
method.  The  cavity  between  the  two  metal 
walls  is  a  cross-iris  coupled  one-resonator 
filter.  After  a  pulse  is  launched  from  input 
port,  the  time-sampled  data  are  derived  both 
the  input  and  output  ports.  If  modelling 
accuracy  is  to  be  realized,  the  simulation 
can  not  be  stopped  until  the  incident  pulse 
totally  passes  the  sampling  points.  If  the 
device  is  a  high-Q  system,  the  sampling 
time  is  going  to  be  very  long.  Fig.  2  gives 
the  sampled  data  at  output  port.  It  can  be 
observed  that  after  60000  time  iterations, 
the  output  signal  has  still  does  not 
converged.  In  the  following  sections,  we 
will  demonstrate  how  AR  process  and  BP 
neural  networks  can  be  used  to  predict 
FDTD  data,  based  on  using  very  short  time 
samples. 
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III.  Autoregressive  Processes  and 
Backpropagation  Neural  Networks 

a.  Autoregressive  Processes  [2] 

An  AR  process  is  a  random  process,  which 
is  the  output  of  an  all-pole  filter,  when 
excited  by  white  noise.  In  the  time  domain 
it  can  be  described  by. 


norm  TLS  solution  is  given  by 


r+i 


r  =-J2L±___ 


i-p+l 


K 


ik=l 


(1) 


where  b;  is  the  ith  time  step  signal,  is  the 
AR  coefficients,  K  is  the  order  of  AR 
model  and  n;  is  white  noise.  Matrix  A  is 
covariance  matrix.  Equation  (1)  tells  us  that 
provided  is  suitably  small,  the  present 
value  of  output  y  can  be  predicted  from  a 
linear  combination  of  past  samples.  The 
Least  Squares  solution  can  provide  an 
estimate  of  best  solution  ,  Xls  ,  under  certain 
conditions. 

b.  Total  Least  Squares  (TLS)  [3] 

In  the  TLS  case,  there  is  error  in  both 
matrix  A  and  b.  Thus  the  regression 
equation  b  =  Ax  +n  may  be  written  in  the 
TLS  case  in  the  form 


c.  Multilayer  Perceptron  Neural 
Network  [4] 

Fig.3  depicts  a  portion  of  a  multilayer 
perceptron.  In  this  network,  two  kinds  of 
signals  are  identified.  First,  Function  Signal, 
which  is  an  input  signal  that  appears  at  the 
input  end  of  the  network  and  propagates 
forward  through  the  network.  Second,  Error 
Signal,  which  originates  at  output  neurons 
of  the  network,  and  propagate  backward 
through  the  network.  The  error  signal  is 
computed  by  each  neuron  of  the  network 
involves  an  error-dependent  function  in  one 
form  or  another.  Each  neuron  in  the 
network  can  be  described  by  following  pair 
of  equations: 


(A+E)x=b+r 


(2) 


where  E  is  the  error  in  A,  r  is  the  error  in 
b.We  wish  to  determine  value  for  E,  r  and 
value  XjLs  which  satisfy, 

ib+f)erangeiA+Ej^^’^^^P 

One  way  to  get  the  TLS  solution  is  to  carry 
out  a  SVD  of  [A  b]  and  assume  that  CTp  > 

=  ...  =  Ok+i  with  p<  K.  If  not  all  v^+i  j 
=  0,  i  =  p+1,  ...  K+1,  then  the  minimum 


where  k  is  the  kth  neuron  in  a  network; 

{Xj}  are  input  signals;  {w^j}  are  the  synaptic 
weights  for  neuron  k;  u,,  is  the  linear  output 
combiner;  0^  is  the  threshold;  <|)(...)  is  the 
activation  function,  which  can  be  linear  or 
nonlinear  functions;  and  b^  is  the  output 
signal  of  the  neuron.  The  first  hidden  layer 
outputs  are  the  inputs  of  the  second  layer, 
etc.  Basically,  the  BP  process  consists  of 
two  passes  through  the  different  layers  of  a 
network:  a  forwards  pass  and  a  backward 
pass.  In  the  forward  pass,  the  forward 


328 


signal  is  applied  to  the  nodes  of  the 
network,  and  its  effect  propagates  through 
the  network,  layer  by  layer.  Finally  a  set  of 
response  is  produced  at  the  output  layer 
.During  the  forward  pass,  the  weights  of  the 
network  are  all  fixed.  On  the  other  hand, 
during  the  backward  pass,  the  weights  are 
adjusted  in  accordance  with  the  error- 
correction  rule.  Specifically,  in  the  training 
procedure,  the  actual  response  of  the 
network  is  subtracted  from  the  desired 
response  to  produce  an  error  signal,  which 
propagates  backwards  through  the  network. 
The  synaptic  weights  are  adjusted  so  as  to 
make  the  actual  response  of  the  network 
move  closer  to  the  desired  response. 

IV.  Numerical  Results  and  Discussions 

In  this  section,  the  data  from  2500  to  5000 
iterations  in  Fig.2  is  used  to  train  AR 
coefficients  based  on  TLS,  because  before 
2500  iterations  the  system  is  in  a  period  of 
transition.  After  taking  the  SVD  of  [A  b]  ( 
K  =  2  ),  we  have  Z  =  diag  (0.01505, 
0.0014044,  0.0000013519).  Here  again  the 
order,  p,  of  the  TLS  can  be  taken  to  be 
equal  to  2.  From  Eq.  (4)  we  have  x.j-ls  ~ 
[1.9988,  -0.9999]^.  Fig.4  shows  a  detailed 
comparison  of  real  data  (from  56500  to 
57500  iterations)  and  predicted  data, 
obtained  using  the  AR  model.  The  AR 
coefficients  are  obtained  using  TLS  with  K 
=  2.  There  is  good  agreement. 

An  neural  network  is  designed  with  a  input 
layer,  2  hidden  layers  and  a  output  layer.  In 
the  network,  the  neurons  on  each  layer  are 
6,  3,  3,  1 .  The  activation  function  applied  to 
the  hidden  layers  is  a  sigmoidal  nonliearity 
defined  by  logistic  function: 


1  +exp(-Hj^+0j^) 

and  the  activation  function  on  the  output 
layer  is  linear  function.  Again,  the  data 
from  2500  to  5000  iterations  decimated  by 
the  factor  10  are  used  to  train  the  network. 
Fig.5  shows  the  comparison  between  56500 
to  57500  iterations.  Again  a  very  good 
agreement  is  observed.  More  details  will  be 
presented  on  conference. 

V.  Conclusions 

The  TLS  method  is  used  to  estimate  AR 
coefficients,  which  are  used  in  an  AR 
model.  A  multilayer  perceptron  unreal 
network  is  trained  by  the  Error- 
Backpropagation  Algorithm.  The  AR  model 
and  BP  neural  network  are  used  as 
predictors  and  trained  by  a  very  small 
portion  of  early  data  set  for  the  output  of  a 
waveguide.  The  results  show  that  both 
predictors  can  give  good  predictions. 
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Fig.l  WR  62  waveguide  housing. 
Resonator  length  12.155  mm. 
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Fig.2  60000  iterations  FDTD  data  sampled 
at  output  port  of  the  filter. 
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Fig.4  The  comparison  between  FDTD  and 
AR  model  predicted  data  (from 
10000  to  11000  iterations). 


Fig. 5  The  comparison  between  FDTD 
and  BP  network  predicted  data 
(from  59000  to  60000  iterations). 


Fig.3  A  multilayer  Perceptron  Network 
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1.  ABSTRACT 

Like  most  inverse  problems  the  complex  image  solu¬ 
tion  of  the  Green’s  function  of  multilayer  media  is  non¬ 
unique.  The  non-uniqueness  makes  the  variations  of 
the  image  amplitudes  and  locations  unsteady  with  fre¬ 
quency.  This  is  inconvenient  for  wideband,  such  as 
digital,  circuit  computations.  Nevertheless,  the  non¬ 
uniqueness  also  points  to  the  possibility  of  strong  steadi¬ 
ness  by  fixing  the  image  at  real  locations,  say  to  those 
of  the  static  images,  without  sacrificing  convergence. 
These  new  images  are  renamed  simulated,  they  are 
generated  easily  by  the  moment  method  in  the  spec¬ 
tral  space  without  the  Prony’s  method. 

2.  INTRODUCTION 

Replacing  the  tedious  Sommerfeld  integrals  in  construct¬ 
ing  the  spatial  Green’s  functions  of  a  multilayer  medium, 
Chow  et  al  [1],  [2]  developed  the  quasi-analytic  com¬ 
plex  image  technique.  Requiring  only  3  or  4  images 
with  complex  locations,  the  technique  is  accurate,  and 
indeed  rapidly  convergent,  and  also  general. 

The  underlying  mathematics  of  the  technique  is  the 
Prony’s  method  which  is  really  a  curve  fitting  proce¬ 
dure,  in  the  spectral  space  of  the  Green’s  function,  for 
the  images  akin  to  remote  sensing  (i.e.  inversion)  of 
sources.  The  nonuniqueness  of  a  inversion  solution  re¬ 
sults  in  certain  gives  between  image  locations  and  am¬ 
plitudes  and  means  their  unsteady  variations  over  a 
frequency  band.  Happily,  like  other  inversions,  all  vari¬ 
ations  give  the  correct  values  on  the  spatial  Green’s 
function  fields.  .  This  tolerance  is  convenient  for  field 
evaluations  at  a  single  frequency,  but  not  so  for  wide 
band,  such  that  one  can  quickly  and  accurately  inter¬ 
polate  the  images  at  intermediate  frequency  points  be¬ 


tween  salient  frequency  points. 

On  the  other  hand,  the  nonuniqueness  also  points 
to  the  possibility  of  having  only  a  few  simulated  images 
with  real  locations,  as  opposed  to  complex  images  with 
complex  locations,  but  still  obtaining  high  accuracy  in 
the  Green’s  functions.  To  satisfy  the  low  frequency 
asymptotes  the  real  locations  of  the  images  can  be  the 
first  4  of  the  classical  static  images.  With  the  locations 
fixed,  we  use  moment  method  in  the  spectral  space^  and 
not  Prony’s  method,  to  find  the  images  amplitudes. 
The  amplitudes  are  now  indeed  smooth  varying  func¬ 
tion  of  frequency. 

3.  NUMERICAL  RESULTS 

The  spatial  Green’s  functions,  from  4  simulated  images 
and  4  complex  images,  are  ceilculated  for  comparison. 
The  microstrip  structure  has  dielectric  height  of  1  mm 
with  Cr  =  12.6  . 

As  we  mentioned  before  the  complex  images  are 
nonunique,  both  in  amplitudes  and  in  image  locations. 
We  shall  show  only  the  image  locations  of  the  first  two 
out  of  four  as  a  function  of  frequency  in  Figure  1.  Since 
the  locations  are  complex,  we  shall  show  only  the  am¬ 
plitudes  of  the  locations  of  the  vector  potential 
along  the  dielectric  interface.  We  may  call  the  image 
farther  away  from  the  source  as  image  number  1  and 
the  closer  are  as  number  2.  We  see  that  the  amplitudes 
of  the  locations  indeed  are  unsteady.  In  fact  images 
number  1  and  number  2  appear  to  shift  positions  in 
Figure  1.  Similar  unsteadiness  is  observed  for  the  im¬ 
age  amplitudes,  for  all  four  images. 

Despite  such  unsteadiness,  the  Green’s  functions 
from  the  complex  images  remain  very  steady  and  well 
behaved  in  Figure  2  and  3.  The  results  of  complex 
images  in  Figure  2  and  3  are  circles,  the  results  of 
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simulated  images  are  dotted  lines.  We  see  that  their 
agreements  are  excellent  until  distance  kop  along  the 
dielectric  interface  becomes  unity. 

The  four  simulated  images  in  calculating  the  Green’s 
functions  in  Figure  2  and  the  four  in  calculating  in  Fig¬ 
ure  3  are  perfectly  steady.  Both  sets  of  the  image  loca¬ 
tions  are  chosen  to  be  those  of  electrostatic  images  and 
naturally  they  are  steady.  The  amplitudes  of  the  vec¬ 
tor  potential  and  scalar  potential  Gq  are  shown  in 
Figure  4  and  5  and  they  are  indeed  steady  and  smooth 
as  a  function  of  frequency. 

It  is  observed  that  beyond  6.7  GHz,  when  the  first 
simulated  image  is  one  radian  electrically  from  the  source, 
the  image  amplitudes  starts  to  vary  with  frequency.  Be¬ 
low  6.7  GHz,  the  amplitudes  of  the  simulated  images 
are  near  constant  but  are  not  equal  to  those  of  the 
quasi-dynamic  images  in  general,  since  for  good  accu¬ 
racy  the  latter  images  may  number  80  for  €r  =  12.6 
instead  of  just  the  first  4. 

4.  DISCUSSIONS 

Near  the  source  and  at  low  frequencies  ,  the  simulated 
images  are  just  as  fast  convergent  and  accurate  as  the 
complex  images.  At  higher  frequencies,  however,  since 
the  images  locations  are  fixed  and  real,  we  find  that 
simulated  images  are  not  as  accurate,  until  we  double 
the  image  number. 


This  may  not  be  a  drawback  for  wide  band  (e.g. 
digital)  signals  since  they  are  generally  in  a  lower  fre¬ 
quency  than  microwave.  Then  the  advantage  of  smooth 
varying  image  amplitudes  with  frequency  makes  the 
simulated  images  a  better  choice  for  interpolation  and 
rapid  convergence. 
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Figure  1:  Amplitude  of  locations  of  complex  images  for 
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Figure  3:  The  amplitude  of  the  scalar  potential  Gq. 


Figtiie  4:  The  relative  amplitudes  of  images  of 


Figure  5:  The  relative  amplitudes  of  images  of  Gq. 
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Abstract 

This  paper  shows  that  given  frequency  domain  infor¬ 
mation  over  a  limited  frequency  range,  it  is  possible 
to  extract  information  over  a  broad  band  and  ex¬ 
tract  a  real,  causal  time  domain  response.  This  is 
done  through  the  use  of  the  Hilbert  transform.  This 
technique  for  example  can  be  implemented  in  net¬ 
work  analyzers  and  waveform  processing  equipment. 

1.  Introduction 

System  measurements  may  be  performed  in  either 
the  time  domain  or  the  frequency  domain.  Time  do¬ 
main  measurements  are  easier  to  perform  since  the 
waveforms  of  interest  are  all  real.  However,  one  dis¬ 
advantage  is  the  limited  dynamic  range  of  available 
systems.  Frequency  domain  measurement  equip¬ 
ment  benefit  from  large  dynamic  range.  Further¬ 
more,  frequency  domain  measurements  maybe  car¬ 
ried  out  either  over  an  entire  range  of  frequencies  or 
selectively  over  a  band  of  frequencies.  Theoretically, 
it  is  possible  to  extract  a  time  domain  response  from 
these  measurements  by  an  inverse  Fourier  transform. 
But,  if  the  measurements  are  made  in  a  noisy  envi¬ 
ronment,  it  is  difficult  to  recover  the  true  time  do¬ 
main  response. 

AU  physical  time  domain  responses  are  causal,  in 
that  the  signal  is  non-zero  only  for  times  greater 
than  zero.  However,  since  band-limited  complex  fre¬ 
quency  domain  data  does  not  guarantee  causality  in 
the  time  domain,  nor  a  real  time  domain  response, 


measurements  carried  out  in  the  frequency  domain 
do  not  truly  represent  the  transient  response  of  the 
system.  Even  so,  we  establish  that  it  is  possible  to 
extract  a  causal  response  by  extrapolating  the  com¬ 
plex  frequency  domain  data  under  the  premise  that 
the  time  domain  signal  must  be  causal.  The  objec¬ 
tive  of  this  paper  is  to  extrapolate/interpolate  band- 
limited  frequency  domain  data  so  as  to  facilitate  the 
extraction  of  a  causal  time  domain  response. 

In  general,  the  real  and  imaginary  parts  of  the 
complex  frequency  domain  data  are  independent  of 
each  other.  However,  the  causality  of  the  time  do¬ 
main  signal,  denoted  as  /i(t),  assures  us  that  the  real 
and  imaginary  components  of  the  frequency  domain 
are  related  through  the  Hilbert  Transform  [1]. 

Since  we  process  discrete  frequency  domain  data, 
we  handle  frequency  and  time  domain  signals  in  the 
form  of  sequences.  A  numerical  example  proves  the 
usefulness  of  the  technique. 

2.  Transform  Relationships 

This  section  briefly  covers  some  of  the  properties 
of  sequences  and  their  Fourier  and  Hilbert  trans¬ 
forms.  Any  periodic,  real,  time  domain  sequence 
h[n]  can  be  expressed  as  a  sum  of  an  even  sequence 
h^e[a]  and  an  odd  sequence  /ipoM*  Also,  the  Fourier 
transform  of  the  even  and  odd  parts  of  a  sequence 
are  the  real  and  imaginary  parts  of  the  original  se¬ 
quence  hp[n\  [2]. 

Also,  for  real  which  is 

an  even  function,  and  which 

is  an  odd  function. 
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Consider  a  periodic,  real,  time  domain  sequence 
hp[n]  with  period  iV,  that  is  related  to  a  finite  length 
sequence  h[n]  of  length  N  by 

oo 

i——oo 

We  know 

and  similarly 

/ipo[^]  —  2^^pW  ~  ^p[~^l} 

If  we  have  N  =  2r  (where  r  is  a  positive  integer) 
then,  we  can  express  hp[n]  as 


the  real  and  imaginary  parts  of  the  Fourier  transform 
of  h[n]. 

'  W  H?it[Tn]VN[k  -  m] 

jH^k]  =  I  Q<k<N-l  (6) 

0  otherwise 

V 

Equation  (6)  is  the  Hilbert  transform  relationship 
between  H^[k]  and  H^[k].  Thus  with  a  knowledge 
of  one  the  other  can  be  evaluated.  This  procedure 
forms  the  basis  of  our  technique  for  the  extraction 
of  a  real,  causal  time  domain  response  from  band- 
limited  complex  frequency  domain  data.  The  theo¬ 
retical  development  assures  us  that  by  computing 
the  DFT’s  and  IDFT’s  the  original  real  time  se¬ 
quence  will  not  lose  its  causal  nature. 

3.  Extrapolation  Of  Frequency  Domain  Data 


hp[n]  =  hpe[n]upN[n] 

where 

'  2  n=  -1 

«pJvW  =  <  1  n  =  0,  Y  (2) 

0  n=  ^  +  -  1 

or 

UpN[n]  =  2u[n]  -  2u[n  “  y]  “  “  y]  (3) 

where  u{n]  is  the  unit  step  sequence,  and  ^[n]  is 
the  unit  sample  sequence  [2]. 

Also  the  odd  part  of  the  sequence  can  be  expressed 
as, 

r  hp,[n]  n=l,---f-l 

hpo[n]  =  •{  0  n  =  0,  f  (4) 

[  -hpe[7i]  n  =  ^ -I- -  1 

Using  some  derivations  and  the  definition  of  the 
Fourier  Transform,  we  have 

.  N-\ 

jHp^ik]  =  -^  Hp^VpN[k  -  m]  (5) 

This  is  the  Hilbert  transform  relationship  between 
the  real  and  imaginary  parts  of  the  Fourier  transform 
of  a  periodically  causal  sequence.  If  h[n]  =  0  for 
n  <  0  and  for  n  >  N/2  then  the  periodicity  maybe 
removed  and  we  have  the  same  relationships  between 


The  method  to  generate  a  causal,  finite  duration 
time  domain  sequence  h[i],{h[i]  =  0  for  i  <  0  and 
i  >  N)  given  a  band-limited  data  samples  of  its 
Fourier  transform  is  described.  The  iterative  tech¬ 
nique  discussed  in  the  subsequent  part  of  the  section 
describes  a  sequence  of  transformations  between  the 
time  and  frequency  domains.  To  describe  the  algo¬ 
rithm  utilized  in  our  technique  we  shaU  assume  that 
the  band-limited  frequency  domain  data  is  available 
between  frequencies  /I  and  /2  and  sampled  at  n  fre¬ 
quency  points  in  between  fl  and  /2  .  This  data  can 
be  expressed  as  a  row  vector  of  length  n,  represented 
as 

where  the  notation  H{1 :  n)  indicates  that  samples 
1  through  n  are  elements  of  the  row  vector  H.  In 
general  H{p  :  q)  indicates  samples  Hp  through  Hq. 

In  order  to  extract  the  real  and  causal  time  domain 
signal  for  which  we  have  the  complex  frequency  do¬ 
main  data  in  the  frequency  band  [/l,/2],  we  first 
extrapolate  the  available  frequency  domain  data  be¬ 
tween  frequencies  [0,  fl]  and  from  [/2,  /3],  where  /3 
is  a  frequency  chosen  until  which  extrapolation  is 
considered  necessary.  Since  the 

extrapolated  data  between  frequencies  [— /3,0]  wiU 
be  symmetrical  to  that  between  frequencies  [0,/3]. 
Therefore,  extrapolation  in  the  negative  frequency 
domain  will  not  require  any  additional  computation. 
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The  steps  involved  to  extract  the  time  domain  se¬ 
quence  from  a  band-limited  frequency  domain  data 
are  explained  below. 

1.  The  available  band-limited  frequency  domain 
data  H{1  :  n)  is  padded  with  zeros  to  the  extent  nec¬ 
essary.  To  extract  a  casual,  time  domain  sequence 
of  finite  length  n  samples  the  frequency  data  has  to 
be  padded  with  zeros  to  a  minimum  of  2N  points, 
providing  a  sequence  of  even  length.  Therefore  the 
available  frequency  domain  data  has  to  be  padded 
with  2N—n  zeros.  A  zero  padded  vector  HZ{1  :  2N) 
is  formed  from  if(l  :n)  as 

HZ{1 :  n)  =  [^1,  H2,  .  .Hn,  0,  0 . .  .0] 

2.  The  zero  padded  complex  frequency  domain 
data  is  now  split  into  corresponding  real  and  imagi¬ 
nary  parts. 

HZ^  =  Real(HZ) 

HZss  =  Imag{HZ) 

3.  An  inverse  discrete  Hilbert  transform  of  the 
real  part  of  the  frequency  data  is  performed  which, 
gives  us  the  imaginary  sequence 

4.  The  imaginary  part  of  the  band-limited  fre¬ 
quency  data  extracted  in  step  2  is  substituted  into 
the  data  generated  by  the  discrete  Fourier  transform 
of  the  odd  sequence  in  the  previous  step. 

H  ^^(suhs)  ~ 

[hZ^{1  ;  n),  +  1  :  2N)\ 

5.  The  new  imaginary  data  H  thus  formed 

is  subjected  to  a  discrete  Hilbert  transform,  which 
will  return  an  improved  version  of  the  real  sequence. 
The  even  sequence  of  samples  are  subjected  to  a  dis¬ 
crete  Fourier  transform  so  zis  to  recreate  the  real  part 
of  the  spectrum. 

6.  The  real  part  of  the  band-limited  frequency 
data  extracted  in  step  2  is  substituted  into  the  data 
generated  by  the  discrete  Fourier  transform  of  the 
even  sequence  in  the  previous  step. 

^  ^^{subs)  ~ 

:  n)H Z^(^new){'i^  +  1 :  2iV)j 


7.  Subsequent  processing  is  an  iteration  of  the 
steps  2-11. 

The  results  may  not  be  absolutely  convergent  if 
sufficient  amount  of  band-limited  data  is  not  avail¬ 
able,  however  this  also  depends  on  the  energy  con¬ 
tent  of  the  spectral  components  in  the  band-limited 
data. 

Subsequent  sections  illustrate  the  efficacy  of  this 
technique.  We  have  provided  several  simulation  re¬ 
sults.  We  have  also  implemented  this  technique  on 
actual  frequency  domain  measurements  performed 
on  a  Hewlett  Packard  network  analyzer  HP  8510B. 

4.  Simulation  Results 

The  transfer  characteristics  of  a  band  pass  filter 
are  measured  at  415  points  between  4.31  GHz  and 
7.415  GHz.  60  points  (between  points  200  and  260) 
were  discarded.  Using  the  Hilbert  transform  tech¬ 
nique,  the  missing  data  was  reconstructed.  Figure 
1  shows  the  comarision  of  the  reconstruction  of  the 
real  part  with  the  original  data.  Figure  2  shows  co¬ 
marision  of  the  reconstruction  of  the  imaginary  part 
with  the  original  data.  In  both  cases  the  reconstruc¬ 
tion  is  very  good. 

5.  CONCLUDING  REMARKS 

The  Hilbert  transform  relationship  between  the 
real  and  imaginary  parts  of  the  frequency  domain 
response  for  causal  time  domain  signals  proved  to 
be  a  sound  basis  for  the  extrapolation  of  the  fre¬ 
quency  domain  data  and  extraction  of  a  causal  and 
real  time  domain  response.  The  reconstruction  of 
the  time  domain  signal  is  excellent  given  a  sufficient 
number  of  iterations.  Specifically,  the  peak  value  of 
the  impulse  response  and  the  shape  of  the  time  do¬ 
main  response  are  accurately  represented.  The  pres¬ 
ence  of  an  imaginary  component  in  the  time  domain 
response  as  a  result  of  the  inverse  Fourier  transform 
of  band-limited  frequency  domain  data  is  nullified  as 
a  result  of  the  processing. 

Application  of  this  technique  in  the  case  of  noisy 
frequency  domain  data  has  been  discussed.  With  an 
approximate  knowledge  of  the  bands  of  frequency 
data  which  have  been  corrupted  by  noise  it  was 
shown  possible  to  extract  the  true  time  domain  re¬ 
sponse.  The  technique  was  applied  to  measurements 
performed  on  the  HP  8510B  vector  network  analyzer 
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Figure  1.  Real  Part 

for  a  Sliding  load  standard  and  the  results  have  been 
discussed. 
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Abstract 

The  essential  low  temperature  electrical  characteristics 
of  HEMTs  devices  are  presented.  The  emphasis  is  put  on 
practical  aspects  of  prime  importance  for  the  circuit 
designer  such  as  the  collapse  preventing  techniques  or 
the  noise  behaviour.  Some  examples  of  cryogenic  analog 
circuits  are  then  described  with  their  performances 
compared  to  their  ambient  temperature  counterparts. 


Introduction 

The  low  temperature  behavior  of  high  electron 
mobility  transistors  (HEMTs)  has  been  largely 
investigated  since  these  devices  have  been  introduced 
[1-4].  The  aim  of  these  studies  was  to  check  if  the 
enhanced  mobility  in  the  undopped  active  channel  of  the 
HEMT  could  result  in  very  high  device  performance. 
Even  if  the  cut-off  frequency,  the  gain  and  the  high 
frequency  noise  figure  have  shown  to  improve  largely  at 
liquid  nitrogen  temperature  with  some  devices,  others 
were  affected  by  parasitic  effects  such  as  the  «  collapse  » 
of  the  I-V  curves  at  low  drain  voltages  [3-5] . 
A  considerable  work  has  thus  been  carried  out  to 
understand  and  circumvent  this  anomalous  behavior 
[3-11].  Because  preventing  the  collapse  is  a  prerequisite 
for  a  HEMT  device  to  be  used  in  a  cryogenic  circuit,  we 
will  firstly  present  in  this  paper  some  possible  solutions 
to  circumvent  this  problem.  We  will  also  address  others 
specific  behaviors  of  cryogenically  cooled  HEMTs  such 
as  the  transconductance  and  the  noise  figure 
improvement  at  low  temperatures.  These  behaviors  will 
be  discussed  in  term  of  applications,  enlightening  the 
advantages  of  a  HEMT  based  cryoelectronics.  For 
example,  the  growing  field  of  hybrid  superconductor- 
semiconductor  circuits  needs  a  performant  and  reliable 
active  device  such  as  the  HEMT.  As  an  illustration,  low 
noise  HEMT  amplifiers  where  the  superconducting 
material  is  used  to  realize  low  loss  band-pass  filters,  or 
low  phase  noise  HEMT  oscillators  benefit  from  the  high 
quality  factor  of  superconducting  passive  elements. 


1.  The  problem  of  collapse 

Figure  1  illustrates  the  complexity  of  HEMTs  I-V 
curves  at  low  temperature.  It  represents  the  drain  current 
characteristics  of  an  AlGaAs/GaAs  HEMT  at  77K 
obtained  with  three  different  measurement  procedures  : 
DC  in  the  dark,  DC  under  light  and  pulsed  in  the  dark 
with  a  pulsed  width  of  500  ns  superimposed  on  a  DC 
bias  (Vjs  =  2  V,  Vg^  =  -0.1  V).  The  DC  curve  in  the 
dark  has  been  obtained  after  an  initial  plot  of  the 
Vgs  =  OV  curve.  This  initial  plot  has  induced  the 
collapse  phenomenon  which  consists  in  a  strong 
reduction  of  the  current  at  low  drain  voltages.  It  is 
caused  by  electron  trapping  on  deep  centers,  particularly 
in  the  dopped  AlGaAs  layer. 


Figure  1  :  HEMT  TO8902  I-V  curves  at  77  K 

The  same  experiment  with  other  device  structures 
such  as  pseudomorphic  AlGaAs/GalnAs  HEMTs 
(PHEMTs)  would  have  shown  different  results  with  in 
some  cases  no  collapse  at  all.  This  is  due  to  a  reduced 
trap  density  in  these  devices  and/or  better  carrier 
confining  in  the  channel  [8,12].  However,  even  in  these 
improved  devices,  the  collapse  may  occur  under  specific 
bias  conditions  [10],  As  an  example,  a  few  minutes 
stress  at  about  V^^  =  IV  and  Vg^  >  OV  will  cause 
carrier  delocalisation  and  trapping  ;  the  result  is  not 
always  full  collapse  but  a  largely  reduced  density  of  free 
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carriers  and  hence  of  the  drain  current.  Others 
technological  solutions  have  been  proposed  such  as 
superlattice  HEMTs  [9]  or  moving  the  gate  toward  the 
drain  electrode  [3,6].  However  preventing  any  carrier 
trapping  and  hence  collapse  whatever  the  bias  conditions 
is  not  an  easy  goal,  even  in  these  improved  structures, 
and  further  work  has  still  to  be  done. 

To  overcome  these  problems,  it  is  possible  to  operate 
the  device  under  light  illumination  [2],  Besides  the 
practical  difficulties  related  to  this  method,  the  noise 
properties  under  light  are  not  well  known.  This  is  the 
reason  why  we  have  tried  to  take  benefit  of  a  different 
collapse  recovery  mechanism  [10,11]  :  the  high  drain 
voltage  recovery  described  by  Hori  [7].  An  illustration 
of  this  recovery  is  provided  by  the  HEMT  pulsed  I-V 
characteristic  of  Figure  1 .  This  mechanism  has  also  been 
observed  in  PHEMTs  devices  and  occurs  in  these 
devices  at  lower  drain  voltages  than  in  HEMTs.  Figure  2 
shows  a  PHEMT  I-V  characteristics  measured  after 
biasing  the  device  five  minutes  at  Vds  =  1  V  and 
Vgs  =  0  V.  The  collapse  is  clearly  visible  on  the  first 
curve  (Vgs  =  OV)  but  has  disappeared  for  the  others 
gate  bias. 


0  0.5  1  1.5  2  2.5 

Drain  Voltage  Vds  (V) 

Figure  2  :  PHEMT  MGF4416  DC  I-V  curves  at  77  K 

The  collapse  suppression  occurs  since  the  drain  voltage 
has  been  raised  above  the  kink  (Vds  >1.5  V),  i.e.  in 
the  impact  ionization  regime,  during  the  initial  plot  at 
Vgs  =  OV  [7].  The  essential  practical  consequences  of 
this  behavior  is  that  the  HEMT  nonlinear  modeling  at 
low  temperature  has  to  be  carried  out  from  I-V  pulsed 
characteristics  with  a  superimposed  DC  bias  chosen  in 
the  collapse  recovery  zone,  i.e.  generally  above  1.5  V. 

2.  Others  specific  behaviors  at  low  temperature 

The  first  high  frequency  parameter  affected  by  the 
temperature  is  the  transconductance  g^  which  is  expected 
to  increase  while  lowering  the  temperature  because  of 
the  carrier  mobility  enhancement  in  the  undopped 


channel  of  the  HEMT.  It  has  been  said  that  lowering  the 
temperature  is  more  or  less  like  reducing  the  dimensions 
of  Ae  device.  However,  below  a  0.1  pm  gate  length, 
new  carrier  transport  phenomena  are  expected  and  this  is 
what  is  actually  observed  at  low  temperature  :  the 
improvement  depends  on  the  device  gate  length  [13]  and 
is  more  sensitive  (about  50%)  for  a  few  tenth  of  micron 
devices  than  for  0.1  pm  devices.  This  is  due  to  short 
channel  effects  such  as  self  heating  or  impact  ionization. 
However,  this  behavior  is  largely  dependent  on  the 
device  structure  [14]  and  an  improvement  of  g^  has 
nevertheless  been  observed  in  0.1  pm  InP  based  HEMTs 
at  low  temperature  [15].  Also  note  that  the  g^, 
enhancement  goes  with  an  enhancement  of  the  non-linear 
behavior  because  of  the  increase  of  the  threshold  voltage 
Vt  often  observed  at  low  temperature  (Figure  3). 


Figure  3  :  FHX04  DC  transconductance  at  Vds  =  2V 


The  second  parameter  affected  by  the  temperature  is 
the  output  conductance.  When  a  drain  control  is  induced 
by  the  short  channel  effect  or  when  the  carriers  are 
multiplied  by  impact  ionization,  the  output  conductance 
raises  and  this  may  be  a  serious  drawback,  specially 
when  power  output  is  needed. 

Finally,  the  more  interesting  features  at  low 
temperature  may  be  the  reduction  of  the  parasitic 
elements  of  the  transistor  [14,15],  including  the 
capacitances  [6,15].  This  results  in  very  high  cut-off 
frequencies  [14,15]  and  should  permit  the  development 
of  a  three  terminal  devices  cryoelectronics  above 
100  GHz. 

3.  Low  frequency  noise 

Low  frequency  (LF)  noise  is  an  important  parameter 
in  circuits  where  a  mixing  process  is  involved  such  as 
oscillators  or  mixers.  It  has  been  shown  to  be  strongly 
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temperature  dependent  but,  unlike  thermal  noise  which  is 
preeminant  at  high  frequencies,  LF  noise  is  not 
systematically  reduced  at  low  ternperamre.  Typical  input 
referred  LF  noise  spectra  at  ambient  temperature  show 
several  bulges  of  generation-recombination  (g-r)  noise 
superimposed  on  1/f  noise  [16].  Each  g-r  noise  bulge 
corresponds  to  a  trapping  phenomenon,  the  frequency  of 

the  bulge  being  equal  to  -J—  where  t  is  the  time 

2llT 

constant  of  the  trap.  When  cooling  the  device,  x 
increases  and  this  produces  a  bulge  shift  toward  the  low 
frequency  range.  This  phenomenon  is  clearly  shown  on 
Figure  4  where  a  reduction  of  the  noise  above  1  kHz  can 
be  noticed  at  110  K,  but  it  goes  with  an  increase  of  the 
noise  under  1  kHz. 


Frequency  (Hz) 

Figure  4  :  Input  LF  noise  spectral  density  for  a  PHEMT 
device  at  ambient  and  low  temperatures 

4.  High  frequency  noise 

The  noise  behavior  of  cryogenically  cooled  HEMT 
must  be  well  characterized  to  design  low-noise  cooled 
amplifiers  for  radioastronomy  or  communication  systems 
applications  [17,18].  Cooling  the  device  involves  a 
decrease  of  the  minimum  noise  figure  Fn,i„,  and  an 
increase  of  the  associated  gain  G,  [19-21],  mainly  related 
to  transconductance  enhancement. 

Figure  5  represents  the  variations  of  F^jn  against 
drain  source  voltage  V^s  of  a  commercial  AlGaAs/GaAs 
HEMT  (Toshiba  J8905)  at  18  GHz  and  =  OV.  The 
optimum  value  of  V^s  is  shifted  towards  higher  values  at 
77  K,  above  the  kink  observed  on  the  I-V  curves.  At 
both  294  K  and  77  K  the  increase  of  Fmi„  for  small 
values  of  is  attributed  to  a  decrease  of 

transconductance.  The  variations  of  F„i„  against 
normalized  drain  current,  (WW.  are  also  reported  on 
Figure  6  [22]  for  the  same  device  at  ¥<,5  =  2V  and 
f  =  18  GHz.  The  minimum  noise  figure  exhibits  a 


minimum,  both  at  room  and  cryogenic  temperature,  at 
about  Ids/Idss  =  0-3-  Fmin  decreases  from  1.3  dB  to 
0.4  dB  at  77  K.  For  higher  drain  current  values  the 
benefit  of  cooling  the  device  is  reduced.  This  behavior  is 
attributed  to  parallel  conduction  in  the  doped  AlGaAs 
layer  observed  in  DC  measurement  at  77  K. 


Drain  Voltage  Vds  (V) 


Figure  5  ;  JS8905  mimum  noise  figure  versus  drain 
voltage,  Vgs  =  OV,  f  =  18  GHz 


Normalized  Drain  Current  Ids/Idss 

Figure  6  :  JS8905  minimum  noise  figure  versus  drain 
current,  Vds  =  2  V,  f  =  18  GHz 

When  the  devices  are  correctly  biased,  ultra-low 
noise  cooled  amplifiers  have  been  realized  with  noise 
figure  less  than  0.7  dB  (at  T=12.5  K)  in  the  26-36  GHz 
frequency  range  [17].  More  recently,  improved 
performances  (noise  figure  less  than  0.3  dB)  have  been 
obtained  at  18  K  using  InP-based  HEMT's  in  the 
40-50  GHz  frequency  range  [23]. 

5.  Applications 

Very  low  noise  microwave  amplifiers  have  already 
been  presented.  Apart  from  this  obvious  application  of 
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cryogenically  cooled  HEMTs,  many  others  low 
temperature  electronics  systems  may  be  designed.  The 
followings  are  only  two  examples. 

5.1  Cryogenic  oscillators  : 

The  design  of  low  phase  noise  cryogenic  oscillators  is 
possible  thanks  to  the  availability  of  very  high  Q  factor 
resonators,  even  at  liquid  nitrogen  temperature  [24]. 
III-V  semiconductor  devices  are  well  suited  for  this 
application  because  silicon  BJTs  cannot  be  used  at  these 
temperatures.  However  the  noise  performance  of  the 
oscillator  will  be  directly  related  to  the  LF  noise  of  the 
transistor.  According  to  the  results  detailed  in  III,  good 
performance  may  be  expected  at  baseband  frequencies  in 
excess  of  1  kHz  but  obtaining  low  noise  very  close  from 
the  carrier  ( <  100  Hz)  will  need  a  careful  choice  of  the 
device  [25,26,27].  In  spite  of  this  problem,  cryogenic 
sapphire-superconductor  resonator  oscillators  are  today 
the  best  choice  for  extremely  high  spectral  purity 
microwave  sources  [28]. 

5.2  Liquid  helium  temperature  electronics  : 

III-V  devices  can  operate  at  4  K  (or  below)  without 
carrier  freeze  out.  One  of  the  application,  at  these 
temperatures,  is  the  amplification  of  infra-red  (IR) 
detectors  signals,  also  of  SIS  detectors  signals.  For  IR 
detectors,  the  signal  is  usually  at  audio  frequencies. 
Helium  cooled  MESFET  amplifiers  have  been  designed 
[29,30]  and  have  proven  to  be  competitive  with  50  K 
silicon  JFETs  amplifiers.  PHEMTs  devices  should  be 
used  in  that  field  in  a  near  future. 

Conclusion 

The  essential  features  of  cryogenically  cooled 
HEMT’s  have  been  described.  These  devices  are 
probably  the  best  suited  for  the  development  of  a  77  K 
electronics  that  should  constitute  an  alternative  to  a  high 
performance  4  K  electronics  which  has  always  been  too 
expensive  to  be  largely  used.  The  compared 
performances  with  respect  to  ambient  temperature 
counterparts  will  definitely  determines  the  interest  of  this 
field  of  investigations. 
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Abstract 

Novel  microwave  circuit  structures  have  been  de¬ 
signed  and  fabricated  for  application  at  high  operating 
temperaUires.  The  required  simulation  and  character¬ 
ization  of  MESFET  and  HBT  devices  at  high  ambient 
temperatures  have  been  carried  out.  Novel  physical  an¬ 
alytical  models  for  the  high  temperature  performance 
of  MESFET^s  and  HBT^s  were  developed.  As  an  ex¬ 
ample  a  fully  integrated  radar  front-end  operating  at 
temperatures  up  to  200°  C  is  given  in  the  paper. 

1  Introduction 

MMIC  design  and  fabrication  for  high  tempera¬ 
ture  applications  have  recently  received  increased  at¬ 
tention.  Most  MMIC  for  high  temperature  applica¬ 
tions  considered  up  to  date  have  been  realized  with 
MESFET  or  HEMT  technologies.  Also  HBT  devices 
have  been  proposed  for  high  temperature  applications 
[1.  2,  3,  4,  5], 

The  design  of  MMIC  for  high  temperatures  requires 
accurate  modeling  and  characterization  of  devices  at 
increased  operating  ambient  temperatures.  A  number 
of  physical  and  empirical  models  for  MESFET  devices 
have  been  presented  in  the  literature.  HBT  device 
modeling  has  not  yet  reached  this  maturity  and  new 
results  will  be  presented  in  this  paper. 

The  scope  of  this  contribution  is  to  demonstrate 
novel  MMIC  designs  intended  for  high  temperature 
application  and  new  results  for  the  characterization  of 
HBT  devices  for  a  temperature  range  up  to  250°C. 

2  MESFET  BASED  MMIC  design 

It  is  commonly  accepted  and  has  been  confirmed  by 
us  earlier  [1]  that  only  ^21  of  MESFET  devices  de¬ 
creases  by  approximately  a  factor  of  2  with  increasing 
temperature  from  room  temperature  to  200°C.  As  the 
other  scattering  parameters  remain  unchanged  up  to 
0-7803-2516-8/95  $4.00©  1995  IEEE 
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Figure  1:  High  temperature  VCO  and  measured  re¬ 
sults  for  the  frequency  of  oscillation  and  the  output 
power  as  a  function  of  ambient  temperature. 

reasoning  a  MMIC  VCO  has  been  designed  and  re¬ 
alized.  In  fig,  1  the  design  and  the  measured  per¬ 
formance  are  illustrated.  No  special  measures  have 
been  undertaken  in  this  circuit  to  stabilize  the  output 
power  versus  temperature  and  therefore  the  output 
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Figure  2:  Active  circulator  with  integrated  mixer. 

power  drops  at  temperatures  above  150°C. 

A  second  circuit  has  been  designed  which  comprises 
an  active  circulator  with  a  mixer  circuit  designed  to 
operate  up  to  200^C  ambient  temperature.  The  cir¬ 
cuit  is  shown  in  fig,  2.  The  circulator  utilizes  a 
common-source  configuration  as  an  out-of-phase  di¬ 
vider  and  a  common  gate  pair  as  a  mixer.  The  circu¬ 
lator  does  not  exhibit  a  return  path  and  could  there¬ 
fore  also  be  called  a  diplexer  because  of  its  ability  to 
deliver  the  signal  from  the  input  to  the  antenna  port 
and  from  the  antenna  port  to  the  mixer.  The  simu¬ 
lated  performance  of  the  circulator  is  summarized  in 
fig.  3a  and  fig.  3b  where  the  scattering  parameters  and 
the  output  IF  current  are  indicated. 

3  Modelling  and  Characterization 
OF  HBT  FOR  High  Temperatures 
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HBT  devices  offer  a  number  of  advantages  with 
respect  to  high  temperature  applications  when  com¬ 
pared  with  MESFET  devices.  Most  notably  the  verti¬ 
cal  current  flow  and  the  possibility  of  heterostructur¬ 
ing  yield  higher  flexibility  in  the  design  of  HBT  devices 
and  circuits.  Little  is  known  on  the  high  temperature 
microwave  performance  of  HBT  devices  and  there  are 
no  HBT  device  models  available  capable  of  accurately 
characterizing  the  HBT  high  temperature  behavior. 
We  have  developed  a  DC  and  RF  HBT  model  appli¬ 
cable  for  the  simulation  of  the  operation  at  increased 
temperatures  [2,  3,  4,  5,  6,  7].  A  comparison  between 
measured  and  calculated  DC  characteristics  at  room 
temperature  and  at  200^(7  are  given  fig.  4.  Very  good 


Figure  3:  Simulated  results  for  the  a)  scattering  pa¬ 
rameters  and  b)  for  the  IF  current  at  the  output  port 
of  the  circulator  with  integrated  mixer. 
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Figure  4:  Measured  and  Calculated  I/V  characteris¬ 
tics  at  a)  room  temperature  and  b)  200°C. 

agreement  between  measured  and  calculated  results 
can  be  achieved  with  this  physical  model.  It  will 
be  demonstrated  during  the  presentation  that  the  RF 
performance  of  HBT  resembles  the  high  temperature 
operating  conditions  of  MESFETs.  For  example  the 
main  deterioration  in  device  performance  is  observed 
for  the  simulated  as  well  as  measured  521  • 


—  simulated 

•  measured 

T  =  473K 

base  step 

r _ 

> 

4  Conclusions 
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ABSTRACT 

In  this  paper  the  basics  of  high  temperature  HBTs  are 
described.  Starting  from  the  wafer  structure,  and  ending 
with  obligatory  technological  requirements.  The  paper 
is  completed  by  a  comparison  between  two  very 
promising  HBT  material  systems  in  terms  of  high 
frequency  and  high  temperature  performance. 


1.  Introduction 

HBTs  are  very  promising  devices  for  power  applications 
in  the  GHz-region[  1,2,3].  The  advantage  of  the  high 
power  density  capability  in  comparison  to  field-effect- 
transistor  based  devices  is  at  the  same  time  a 
disadvantage  because  of  device  performance 
degradation  due  to  selfheating  effects.  Since  thermal 
resistances  of  conventional  high  frequency  HBTs  are  in 
the  order  of  some  hrmdreds  of  Kelvin  per  Watt,  an 
increase  of  intrinsic  device  temperature  starting  from 
100°C  is  realistic  for  power  applications. 

Beside  the  above  described  HBT-application  there  is  a 


market  for  sensor-systems  working  with  high  frequency 
measurement  methods  in  a  high  temperature 
environment.  HBT  based  oscillators  are  here  of  interest 
because  of  their  low  phase  noise  in  comparison  with 
MESFET  and  HEMT  based  ones  [4].  In  this  application 
it  is  the  ambient  temperature  that  influences  the  device 
performance. 

These  two  examples  should  demonstrate  the  necessity  of 
considering  the  high  temperature  aspects  in  design  and 
technology  of  HBTs  which  will  be  discussed  in  this 
paper.  Starting  from  the  choice  of  the  used  material  and 
the  design  of  the  wafer,  followed  by  some  remarks  upon 
the  influence  on  the  design  of  the  transistors  and  the 
used  technology,  and  ending  with  the  discussion  of 
high-temperature  microwave  measurements  of 
transistors  up  to  200°C  ambient  temperature. 


2.  Waferdesign 

Aiming  at  high  temperature  operation  AlGaAs/GaAs 
and  GaInP/GaAs  HBT  have  achieved  very  impressive 
results  in  the  last  years  [5].  The  standard  structure  for 
such  a  device  is  shown  in  fig.  1.  This  sketch  also 


Low-Bandgap  Emitter  Cap-Layer 
InGaAs 

Wide-Bandgap  Emitter 
AIGaAs  or  GalnP 

GaAs  Base-Layer  - 

Wide-Bandgap  Collector 
AIGaAs  or  GalnP 
GaAs  Sub.collector 


Emitter  Contact 
Ti/Pt/Au 

Base  Contact 
Ti/Pt/Au 

Collector  Contact 
NI/AuGe/Ni/WSi 


Fig.  1  Schematic  cross  section  of  a  high  temperature  HBT 
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represents  the  structure  of  the  fabricated 
AlGaAs/GaAs/GalnP  and  GaInP/GaAs/GAInP  HBTs. 
Beside  the  conventional  HBT-features  such  as  low- 
bandgap  emitter  caplayer  for  very  easy  ohmic  contact 
formation  and  wide-bandgap  emitter  for  high  base 
doping,  there  a  two  very  important  points  for  the  wafer 
design.: 

Thinking  of  high  temperature  operation  the  suppression 
of  thermally  generated  charge  carriers  in  the  base 
collector  space  charge  region  is  essential  for  a  high 
output  impedance.  For  temperatures  above  200°C  it  is 
therefore  recommendable  to  use  a  wide-bandgap 
collector  material  such  as  AlGaAs  or  GaInP.  With 
Al^jpGa^ggAs  flat  common-  emitter 
outputcharacteristics  have  been  achieved  up  to  400°C 
even  for  large  devices.  The  disadvantage  of  lower 
saturation  velocity  leading  to  increased  collector  delays 
of  these  materials  in  comparison  with  GaAs  is 
compensated  by  the  lower  ionization  coefScients,  which 
allows  to  shrink  the  collectorlength  with  no  change  in 
breakdown  voltage.  To  avoid  a  spike  in  the 
conductionband  between  base  and  wide-bandgap 
collector,  an  interfacial  layer  has  to  be  inserted 
generally. 

For  a  sufficient  current  gain  even  at  elevated 
temperatures  the  valenceband  offset  between  base  and 
emitter  has  to  be  large  enough.  This  condition  is 
automatically  fulfilled  for  a  GalnP  emitter.  For  a 
AlGaAs  emitter  the  A1  mole  fraction  has  to  be  adapted. 


3.  Technology 

Technology  of  the  devices  has  to  take  care  of  aspects  of 
device  reliability.  For  high  operation  temperatures 
degradation  effects  in  ohmic  contacts  because  of  Au 
interdiflusion  could  have  dramatic  influence  on  device 
performance.  Therefore  special  diffusion  barrier  layers 
in  the  ohmic  contacts  are  necessary.  In  fig.  1  Pt  and 
WSi  is  used  for  this  purpose.  For  Ti/Pt/Au  contact  and 
Ni/AuGe/NiAVSi  contacts  long  time  reliability  at 
elevated  temperatures  has  been  proved  already.  But 
there  are  also  some  other  concepts  for  n-ohmic  contacts 
which  avoid  the  use  of  Au  for  the  ohmic  contact  layer, 
such  as  the  Pd/ln-System  [6]  and  the  Pd/Ge/Si-System 
which  have  achieved  very  promising  results.  For  these 
concepts  the  use  of  a  drflusion  barrier  is  also  obligatory. 

Strongly  related  to  device  technology  and  very 
important  for  the  thermal  resistance  of  the  device  is  the 
kind  of  emitter  contacting  technique.  For  the  fabricated 
HBTs  with  4x20  pm^  emitterfingers  nearly  the  whole 
part  of  the  emittercontact  is  covered  with  a  2  pm  thick 
plated  gold  layer  used  for  the  intercormection 


metallization.  This  leads  to  the  thermal  resistance  of 
about  300  K/W  for  a  one  emitter  finger  device  for  the 
fabricated  transistors  and  a  very  good  thermal  coupling 
between  the  emitters  of  a  multifinger-structure.  This 
technique  is  very  difficult  for  emitter  widths  of  about  2 
pm  or  smaller.  For  these  transistors  the  contact  to  the 
plating  is  generally  at  the  broadened  end  of  an 
emitterfinger,  leading  to  worse  thermal  resistance  of 
one  finger  and  worse  coupling  between  emitterfingers. 


4.  Results 


With  the  described  technology  HBTs  with  1,  2  and  4 
emitterfingers  have  been  fabricated  on  an 
Alo,38Gao.62As/GaAs/GaInP  wafer  and  a 
GaInP/GaAs/GalnP  wafer  with  similar  doping  levels 
and  layer  thickness  (emitter:  Nd=  7*10'^  cm'^;  base: 
Na=  4*10'®  cm'^;  collector:  Nd=  5*10*®  cm'^) .  In  fig  2. 
the  S-parameters  of  a  GaInP/GaAs/GalnP  one 
emitterfinger  HBT  are  plotted  for  room  temperature 
(RT),  100°C  and  200'’C  from  45  MHz  to  10  GHz.  The 
bias  point  has  been  chosen  in  a  way  that  the  selfheating 
of  the  device  is  negligible  and  the  collector  current  is 
constant.  For  the  interpretation  of  the  data  a  T- 
equivalent  transistor  circuit  has  been  used.  This 
parameter  extraction  technique  has  been  performed  in  a 
similar  way  as  described  in  [7]. 


Fig.  2  Measured  S-parameters  of  a  one  finger 
GaInP/GaAs/GalnP  HBT  from  45  MHz  to  10  GHz  for 
RT,  100°C  and  200°C 
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Regarding  Sn  it  is  obvious  that  the  real  part  of  the  input 
impedance  is  decreasing  with  temperature.  From  the 
simple  approximation  for  the  dynamic  resistance  of  the 
base  emitter  diode 

reb  =  hj,*n*Vt*  IC' 

an  increase  of  this  impedance  would  be  expected  ( 
hj,=current  gain,  n=ideality  factor  of  base  emitter  diode; 
IC=DC-cotlector  current,  Vt=k*T*q  '  ).  Generally  there 
are  two  temperature  dependent  factors  which  mask  this 
effect.  The  current  gain  as  shown  in  fig.  3  and  the 
contact  resistance  of  base  contact.  The  decrease  in 
current  gain  could  be  stabilised  with  an  higher  Al-mole 
fi-action  in  the  emitter.  With  42%-Al  a  change  from  66 
at  RT  to  42  at  200°C  has  been  reported  by  [8].  The 
temperature  dependence  of  the  base  contact  resistance  is 
strongly  related  to  technology. 

Since  the  fabricated  AlGaAs  devices  have  larger 
variations  in  h2i  with  temperature  the  variations  in  Su 
are  more  profound. 

The  variations  in  S21  are  also  strongly  related  to  these 
two  temperature  dependent  factors.  Therefore  for  the 
AlGaAs  based  device  this  effect  is  again  much  more 
profound  than  for  the  GalnP  based  ones.  Fig.  4  shows 
the  normalized  S21  (T)  for  the  two  transistor  types.  The 
absolute  values  of  plotted  normalized  transistor 
parameters  at  RT  are  listed  in  tab.  1. 


Fig.  3  Temperature  dependence  of  normalized  small 
signal  current  gain  h2i  for  an  AlGaAs/GaAs-  and  a 
GaInP/GaAsHBT 

S22  is  dominated  by  the  intrinsic  and  the  extrinsic  base 
collector  junction  capacitance.  The  resulting  impedance 
is  nearly  temperature  independent  as  long  as  the 
thermally  generated  charge  carriers  in  the  space  charge 
region  of  the  base  collector  junction  are  negligible.  For 
S 12  it  is  basically  the  same  effect. 


Fig.  4  Normalized  |S2 1]  versus  temperature 

The  extracted  normalized  temperature  dependence  of  fi 
and  fmax  for  the  two  wafer  structures  are  plotted  in  fig.  5 
and  fig.  6  respectively.  Again  a  bias  point  has  been 
chosen  were  the  self  heating  of  the  device  is  negligible. 

The  temperature  dependence  is  quit  similar  for  both 
material  system.  The  transit  frequency  decreases  about 
72%  from  RT  to  200°C.  Since  the  parasitics  remain 
primarily  constant,  this  effect  originates  from  the 
increase  of  electron  transfer  time  in  the  device  and  the 
increase  of  the  differential  resistance  of  the  base  emitter 
diode.  Strongly  related  to  fj  is  the  decrease  in  f^ax  which 
is  also  in  the  same  order  in  this  temperature  range. 


Fig.  5  Extracted  normalized  temperature  dependence 
of  fr  for  an  AlGaAs/GaAs-  and  a  GaInP/GaAs  HBT 
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Fig.  6  Extracted  normalized  temperature  dependence 
of  fmax  for  an  AlGaAs/GaAs-  and  a  GaInP/GaAs  HBT 


Similar  measurements  for  2  and  4  emitter  finger  devices 
on  both  materials  exhibit  comparable  results  for  power 
levels  less  than  lOOmW  DC  power  Dissipation  per 
emitterfinger,  indicating  a  good  thermal  coupling 
between  different  fingers  based  on  the  already  described 
technology.  In  this  powerregion  the  transistors  are 
scaleable  in  terms  of  collector  current,  rf-gain,  etc.  Only 
for  high  power  level  the  thermal  runaway  of  single 
emitters  restrict  device  performance. 


Values 

atRT 

AlGaAS/GaAs-HBT 

GaInP/GaAs-HBT 

Ie 

4  kA/cm^ 

3  kA/cm^ 

h2i 

16.9 

18.7 

S21 

3.7 

2.4 

fr 

18.0  GHz 

17.0  Ghz 

fmax 

6.5  GHz 

9.0  GHz 

Tab.  1  Absolute  values  of  plotted  transistor  parameters 
at  room  temperature. 


5.  Conclusions 

In  this  paper  aspects  of  HBT  design  with  respect  to 
requirements  of  elevated  temperature  operating 
conditions  are  pointed  out.  Microwave  measurements 
from  RT  to  200°C  exhibit  similar  high  frequency 
properties  for  the  fabricated  AlGaAs/GaAs/GalnP  and 
GaInP/GaAs/GalnP  HBTs  with  only  one  difference. 
The  HBTs  with  GalnP  emitter  offer  nearly  temperature 
independent  gain  origin  from  the  weak  temperature 


dependence  of  small  signal  current  gain.  AlGaAs 
emitter  based  devices  with  an  Al-mole  fraction  of  more 
than  40%  are  also  expected  to  offer  this  feature. 
Therefore  comparable  results  concerning  electrical 
performance  should  be  achievable  with  both  material 
systems.  Similar  investigations  in  the  field  of  reliability 
may  favour  one  transistor  type. 
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ABSTRACT 
The  Heterojunction  Bipolar  Transistor  (HBT) 
commonly  used  in  GaAs  MMICs  for  microwave 
and  millimeter  wave  systems  can  perform  as  a 
high  speed  photodector  with  inherent  gain  and 
high  linearity.  The  use  of  the  phototransistor  will 
allow  circuit  designers  to  develop  a  new  class  of 
photonic/  millimeter  wave  circuits.  A  new  model 
for  the  HBT  has  been  developed  which  solves  for 
the  electrical  and  photogenerated  currents. 
Experimental  and  theoretical  curves  relating  to  the 
device  behavior  are  presented  and  compared. 

1.  INTRODUCTION 

Three-terminal  microwave  devices  for 
photodetection  have  been  receiving  increasing 
attention.  Combining  detection,  amplification,  and 
mixing  in  the  same  device  results  in  considerable 
simplification  of  the  receiver  design,  and  provide 
an  opportunity  for  new  applications.  The  first 
devices  investigated  as  photodetectors  were 
microwave  MESFETs  [1,3]  andHEMTs  [2].  They 
exhibited  high  optical  gain  but  poor  frequency 
response.  Further  investigations  [4,5]  revealed 
that  the  principal  mechanism  of  photodetection  is 
a  photovoltaic  effect.  This  process  provides  for 
large  gain,  but  it  is  inherently  slow. 

Heterojunction  bipolar  transistors  are  current- 
controlled-current-source  devices  where  the 
photogenerated  carriers  contribute  directly  to  the 
total  photocurrent  therefore  the  device  is 
intrinsically  fast.  Recent  studies  [6]  have  shown 
that  an  active  base  terminal  enhances  the  gain  and 
frequency  response  compared  to  the  floating  base 
configuration.  Furthermore  HBTs  provide  for 
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integration,  they  sustain  very  high  current  densities 
(106  A/cm2),  require  low  bias  voltages  and  a 
single  power  supply. 

2.  EXPERIMENTAL  INVESTIGATION 

The  device  under  test,  comprises  of  a  5000A 
subcollector  doped  at  2x1018  cm-3,  5000A 
collector  doped  at  1.5x1016  cm-3,  800A  base 
Carbon  doped  at  2x1019  cm-3,  300A  AlGaAs 
emitter  doped  at  6x1017  cm-3, 400A  AlGaAs  to 
GaAs  graded  doped  layer  at  2x1018  cm-3  and 
2500A  GaAs  emitter  cap  layer  doped  at  2x10 18cm- 
3.  The  base  was  exposed  for  illumination  by  etching 
the  emitter  cap  layer  and  emitter.  A  modulated 
optical  signal  from  a  semiconductor  laser  at  850nm 
illuminated  HBT's  base  region.  Figure  1  shows 
DC  output  characteristics.  The  photocurrent  at 
Vce=1.5V  is  about  300  mA.  Fig.  2  compares  the 
linearity  of  the  HBT  with  the  MESFET  and  shows 
that  at  optical  intensities  below  1  mW  the  MESFET 
is  highly  non-linear.  As  opposed  to  MESFETs  and 
HEMTs  [8]  the  HBT  shows  no  shift  in  tum-on  or 
saturation  voltages,  indicative  of  no  photovoltaic 
behavior.  This  forecast  that  the  photoresponse  of 
HBT  has  a  larger  bandwidth.  Experimental  results 
confirm  this  prediction. 

Figure  3  shows  a  comparison  for  two  bias  levels 
which  indicates  the  an  increase  in  the  gain  of  the 
device  with  bias.  The  photoresponse  shown  is 
limited  by  the  relaxation  oscillations  of  the  laser  ( 
at  7  GHz).  The  HBT  has  a  potential  response  in 
excess  of  10  GHz  limited  by  the  charging  time  of 
the  base-emitter  junction  (~1.6ps)  and  the  transit 
time  on  the  base-collector  space-charge  region 
(~4.2ps). 
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3.  THEORETICAL  MODEL 


The  transport  equation  to  be  solved  in  the  three 


regions  of  Ae  device:  ^ 
j^d^ni(x)  ni(x) 

‘  dx2  Xi 


+  Gi(x)  =  0 


(1) 


where  Di  is  the  diffusion  coefficient,  ni  is  the 
minority  carrier  concentration  (electrons  or  holes) 
and  ti  is  the  recombination  lifetime,  the  index  i 
refers  to  each  specific  region:  emitter(e),  base(b) 
and  collector(c).  The  last  term  is  the  wavelength 
dependent  generation  term,  which  is  assumed  zero 
in  the  large  band-gap  emitter  region.  The  transport 
equation  is  valid  for  neutral  regions  (zero  E-field) 
inside  the  device  whose  boundaries  are  determined 
by  Poisson's  equation. 


Figure  1.  Measured  DC  output  characteristics 
(dark  and  light)  with  optical  power  =  1  mW 


■  0.0  05  1.0  15 

Optical  Power  (mW) 


Figure  2.  Comparison  of  MESFET  and  HBT 
linearity  for  the  photoresponse 


Frequency  (MHz) 

Figure  3.  Microwave  link  gain  as  a  function  of 
bias. 

The  generation  terms  for  base  and  collector  region 
are  best  represented  by 

Gb(x)  =  Tiba(t>oe-«*  and 

Gc(x)  =  {T|be-“W'>+T|c)a<t>oe'“* 

where  by  replacing  back  at  equation  ( 1 )  allows  for 
the  determination  of  minority  carrier  distribution 
at  the  neutral  regions.  To  account  for  the  effects  of 
the  bands  discontinuities  we  use  the  approach  of 
Parikh  et  al.  [6]  that  defines  an  effective  interface 
velocity  for  the  carriers.  Basically  it  introduces  a 
correction  term  to  the  boundary  value  in  the 
homojunction  case  such  that : 

n(Xj,)  =  n(x„, )  exp  [(-qVj,i  +  AE^)  /  kT]  -  ^  (2) 

®EN 

where  FEN  is  the  electron  flux  and  SEN  is  the 
effective  velocity  across  the  junction.  In  the  case 
of  a  smooth  heteroj  unction  or  a  homoj  unction  DEc 
is  small,  which  makes  SEN  large,  consequently 
eq.(2)  reduces  to  the  homojunction  case.  Since 
there  is  no  spike  in  the  valence  band  the  boundary 
value  for  holes  is  the  same  as  in  a  homojunction 
case.  From  drift-diffusion  theory  the  total  current 
across  ajunction  is  calculated  based  on  the  minority 
carrier  currents  at  the  edges  of  the  depletion  region 
plus  any  contributions  from  inside  the  space-charge 
region. 

Only  the  base-collector  junction  is  shown  for  the 
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abstract.  The  evaluation  of  the  minority  currents 
results  in : 


“  qDbKi,ae‘®V  + 


qDcRic 


qDcBde^'-e^*«cosh  ^ 

+  '  I  '  '  ^  +  qDclCae“^*“  +  Jg.r 

Uinhjg 


+ 


(3) 


where  the  generation-recombination  current  Jg-r 
is  given  by 


Jg-r  - 


Gc(x)dx  =  qr|  (t)oe'“’‘i><{  l-e"®^*^] 


(4) 


In  the  case  where  the  photon  flux  is  zero,  the 
different  components  in  the  current  expressions 
reduce  to  the  well-known  dark  solution  equations. 
The  DC  characteristics  of  the  device  are  obtained 
by  replacing  the  voltage  dependent  carrier  densities 
and  space  charge  boundaries  in  the  current 
equations.  The  different  parameters  in  expressions 
(3)-(4)  are  described  in  appendix  A. 


above.  Good  agreement  is  noted  between 
experimental  and  theoretical  curves  for  moderate 
bias  conditions.  The  discrepancy  at  higher  bias 
voltages  (Vbe>1.2V)  is  due  to  high  injection,  not 
included  in  the  present  model.  For  lower  bias  the 
curves  diverge  because  of  the  non-ideal 
characteristics  of  the  junctions. 

4.  CONCLUSIONS 

A  new  analytical  model  is  under  development  for 
heterojunction  bipolar  transistors  under 
illumination.  Good  agreement  between  experiment 
and  theory  was  observed  within  the  ideal  regions 
of  the  junctions.  The  results  suggest  no  photovoltaic 
behavior  which  will  lead  to  a  faster  optical  response 
at  the  expense  of  gain.  The  high  frequency  response 
is  limited  by  the  transit  time  on  the  base-collector 
depletion  region  which  potentially  leads  to  results 
in  excess  of  10  GHz  limited  by  the  laser. 
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Ppe  —  Ilpe  +  Ki)C 
Ppc  =  Ilpc  ~ 

pTb 

1  - 

_  Jj  0^0  qTc 
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354 


MODELING  FOR  DISPERSION  OF  SfflELDED  NDCROSTRIP  LINE 

USING  SLR  TECHNIQUE 

A.K.  Verma  and  Raj  Kumar 


Department  of  Electronic  Science 
Delhi  University  of  South  Campus 
New  Delhi,  India 


ABSTRACT 

We  report  a  new  dispersion  model  for  shielded  microstrip 
line  in  the  range  1  ^  W/h,  ^  10,  2  ^  ^2/^1, 

0  ^  f  <  20GHz.  Model  has  maximum  deviation  within 
4%  against  the  result  of  SDA.  It  has  average  of  maximum 
deviation  uithin  3.23  % .  Method  is  based  upon  reduction  of 
shielded  nucrostrip  line  into  an  equivalent  open  microstrip 
line  with  MTtual  permittivity  which  is  a  function  of  width 
of  line,  shield  height  and  frequency.  Over  this  equivalent 
virtual  substrate  Kirchning-Jansen  dispersion  model  has 
been  used.  The  present  model  is  useful  for  fast  interactive 
CAD,  on  the  desk  top  computer 

1.  INTRODUCTION 

The  microstrip  line  based  microstrip  components  and 
circuits  are  required  to  be  placed  within  the  shielding 
enclosure  in  order  to  reduce  performance  degradation  from 
external  influence.  The  top  shield  cover  could  also  be  used 
to  some  extent  to  compensate  for  the  degrading  effect  of 
manufacturing  tolerances  {!].  The  microstrip  line  with  top 
shield  is  shown  in  Fig.  (1).  A  closed  form  expression  to 
account  affect  of  top  shield  on  the  static  effective 
permittivity’  and  characteristic  impedance  has  been  reported 
[2].  However,  no  closed  form  expression  suitable  for  the 
CAD  application  has  been  reported  for  frequency 
dependent  effective  pemuttivity  of  the  shielded 

microstrip  line.  The  top  shield  affects  the  disp)ersion 
behaviour  in  ways  which  is  difficult  to  be  modeled.  A 
unified  dispersion  model  for  multilayer  microstrip  line, 
including  top  shielded  microstrip  line  has  been  reported  by 
the  first  author! 3],  We  have  noticed  that  €cff{0  of  the 
shielded  microstrip  structure  determined  from  the  unified 
dispersion  model  degrades  very  much  for  the  close 
proximity  of  top  shield  to  the  microstrip  line.  This  paper 
improves  the  previous  model  by  incorporating  some 
empericaUy  determined  fiK^tors  in  the  single  layer  reduction 
(SLR)  technique  i^x>rted  earlier(3].  The  present  model 
provides  accuracy  for  e^f)  within  4%  against  the  results 
of  SDA  in  the  range  of  1.05  ^  ^  20,  i  ^  W/h|  ^  10, 

2  <  h^/h,  and  0  ^  f  ^  20GHz. 


2,  MODEL 

A  careful  examinatioii  of  behaviour  of  the  frequency 
dependent  effective  permittivity  of  shielded  microstrip  line 
reveals  the  following  characteristic  : 

*  Proximity'  of  the  top  shield  alters  dispersion  of 
microstrip  line  such  that  with  increase  in  frequency  the 
€cff{f)  moves  towards  a  virtual  permittivity  having  higher 
value  than  the  actual  permittivity  of  the  substrate.  This 
virtual  permittivity  of  the  substrate  increases  with 
nearness  of  the  top  shield  to  the  central  conductor. 

*  The  virtual  permittivity  of  the  substrate  is  also  a 
function  of  operating  frequency  beyond  one  third  of  the 
lower  cut  off  frequency  fp  of  the  first  higher  order  mode 
in  the  open  microstrip  line.  With  the  increase  of 
frequency  the  virtual  permittivity  should  move  towards 
the  real  permittivity  of  the  substrate. 

*  The  virtual  permittivity  of  the  equivalent  substiate  is 
also  a  function  of  W/hj  ratio  of  microstrip  line. 

To  Model  of  the  shielded  microstrip  line,  we  reduce 
the  shielded  microstrip  line  to  an  equivalent  open 
microstrip  line  with  virtual  permittivity  of  the  ‘:‘ibstrate.  On 
such  equivalent  open  microstrip  line  avail :ihle  dispersion 
formula  could  be  used.  The  virtual  permittivity  can  be 
written  as 

^rv  ^mid  (1) 

:  Virtual  permittivity  of  the  substrate 

:  Real  permittivity  of  the  substrate 

:  Additional  permittivity  which  is  function  of 

W/h|,  hj/h,  and  frequency.  should  meet  the 
following  requirements. 

(W/h,,  hj/h,,  f)  — ♦  0,  as  1^,  — ►  ot> 

(2) 

Cfwki  (W/h,,  h2/h,,  0  — ^  0,  as  f  — ►  oo 

In  the  single  layer  reduction  (SLR)  technique  the  shielded 
microstrip  line  is  reduced  to  an  equivalent  open  microstrip 
line  under  the  static  condition.  A  difference  in  permittivity 
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Ac,  of  the  substrate  in  presence  of  the  top  shield  is 
obtained.  Ac,  forms  the  basis  for  the  c^  which  is  to  be 
eo^^erically  modified  to  take  the  above  facts  into  account. 

Thus,  Ac  =  c,  -  c/  (3) 

Where  c,'  is  the  permittivity  of  the  equivalent  substrate  of 
the  open  microstrip  line  obtained  from  the  SLR  techique. 
However,  this  equivalent  substrate  is  not  our  present  virtual 
substrate  of  permittivity  c^.  We  can  write  an  expression  for 
^  follows 


6  =  A 

1.0+ o  (W/h,) 

It. 

K/K  J 

3f 

where 

A  =  ,  for  Ac,  <  1 

=  (A  ,  for  Ac,  ^  1 

y3f  =  1,  for  f  ^  y3 


(4) 


(5) 


(6) 


i.e.  the  frequency  dependent  term  is  neglected  for  the 
operating  frequency  f  ^  fp.Z^  is  the  characteristic 
impedance  of  the  shielded  microstrip  line  determined  from 
the  variational  technique.  Constants  a  and  p  have  been 
determined  emperically  from  comparision  of  c,^0 
determined  by  the  present  OKxiel  and  the  SDA.  Table-I 
gives  a  and  P  for  1  ^  W/h|  ^  10. 


The  Kirschining-Jansen  dispersion  model[4]  for  the  open 
microstrip  line  has  been  used  on  this  virtual  substrate  of 
equivalent  open  microstrip  line.  Any  other  dispersion 
model  could  be  used  in  this  virtual  equivalent  substrate. 


3.  RESULTS 


Fig.  (2a)  and  Fig.  (2b)  show  dispersion  results  by  the 
present  model  and  SDA  on  plastic  substrate  of  c,  ~  2.2. 
Average  maximum  deviation  in  the  range  1  ^  W/h,  ^  10 
for  h2/h|  =  2  for  frequency  upto  20GHz  is  1.1%. 
Maximum  deviation  is  -1.8%  at  W/h|  =:  3,  frequency 
14GHz.  For  lyhi  =  6,  average  maximum  deviation  in  the 
range  1  ^  W/h|  ^  10  is  1.4%  and  max.  deviation  is  2.2% 
at  W/h|  —  10,  frequency  12GHz,  Fig.  (3a)  and  Fig.  (3b) 
provide  dispersion  results  on  Alumina  substrate,  c,  —  9.8. 


For  lyh,  —  2«  mmx,  deviation  is  -4%  at  W/h,  -  10, 
frequency  8  GHr.  For  lyh,  —  2,  max.  deviation  is  -3.9% 
at  W/h,  —  6,  frequency  6GHz.  Fig  (4a)  and  Fig.  (4b) 
show  dispersioa  for  the  shielded  microstrip  line  on  c,  ~  20. 
For  bJb,  =  2,  max.  deviation  is  3.6%  at  W/h,  =  4, 
frequency  20GHz  and  the  average  max  deviation  is  2.32% 
in  the  range  1  <  W/h,  10.  For  hy/h^  —  6,  max. 
deviation  is  4.1%  at  W/h,  =  10  frequency  2GHz.  For 
W/h,  =  10,  v^uiaiion  in  the  static  c^O)  of  varational 
method  and  SDA  is  about  2%.  In  most  of  the  cases  more 
deviation  between  the  present  model  and  SDA  takes  place 
for  hJh,  =  3  and  in  the  middle  freqeuncy  range.  The 
average  max.  deviation  fore,  —  2.2,  12.95  and  20  in  the 
range  1  ^  W/li,  <  10  frequency  upto  20GHz  is  shown  in 
the  table-2.  It  is  possible  to  improve  further  accuracy  of  the 
model  in  this  range.  Range  of  W/h,,  6,  and  frequency  could 
also  be  enlarged. 
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Table  - 1 


W/h, 

a 

& 

1-3.5 

1.00 

1.0 

3.5-6.5 

0.30 

0.70 

6.5-8.5 

0.22 

0.62 

8.5-10 

0.20 

0.85 

Table  -  D 

Averge  of  max.  deviation  of  present  model  from  SDA 


Deviation  =  X  100 

^efT(^)sDA 


2.2 

12.95 

20 

hj/h, 

1 

2.0 

1.109^ 

2.68% 

2.32 

3.0 

1.43% 

2.59% 

3.23% 

4.0 

1.41% 

1.94% 

2.74% 

Frequency(QHz) 


Fig  '  •  r  isrersion  Behaviour  of  Shielded 

Microstrip  Line 
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Fig 33  '  . r ispers ion  Behaviour  of  Shielded 
Microstrip  Line 
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Fig . (2b  )  . Disrersion  Behaviour  of  Shielded 
Microstrip  Line 


Frequency(QHz) 


Fig 3b  ). Dispersion  Behaviour  of  Shielded 
Microstrip  Line 
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Fig . (4b > .Dispersion  Behaviour  of  Shielded 
Microstrip  Line 
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ADAPTIVE  ASYMPTOTIC  OPTIMAL  ALGORITHMS 
FOR  DETECTING  SIGNALS  IN  AUTOREGRESSIVE  NOISE 
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Technical  University  Sofia,1618  Sofia,  P.O.Box  104,  Bulgaria 
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Abstract:  Asymptotic  optimal  (AO)  algorithms  for  detection  of  signals  in  additive  autoregressive  noise  of  order  m  (m- 
dependent  Markov  noise)  are  synthesized.  The  algorithms  require  the  storage  of  m  past  data  samples  to  achieve  optimum 
performance.  It  is  AO  memory  discrete-time  detector  of  a  deterministic  or  quasideterministic  signal  in  autoregressive  noise. 
To  assure  the  change  of  the  detector’s  parameters  as  a  result  of  learning  the  AO  algirithm  was  modified  to  adaptive  one. 
Combining  AO  algorithm  with  adaptation  it  is  a  powerful  approach  to  overcome  a  priori  uncertainty  in  information  systems. 
The  investigations  are  carried  out  by  common  approach  with  many  simulation  results. 


IJntroduction 

In  many  works  of  Le  Cam,  Hajek,  Roussas  etc  it 
has  been  proved  that  many  important  properties  of  the 
statistical  criteria  result  from  asymptotic  normality  of  the 
logarithm  of  likelihood  ratio  for  close  hypotheses  (for 
close  values  of  the  parameter)  independently  of  the  way 
of  how  observations  are  connected. 

The  asymptotic  theory  of  statistical  decisions 
applied  to  the  problems  of  detection  and  estimation  of 
signals  on  a  background  of  noise  permits  to  overcome  the 
difficulties  of  interpreting  statistics  with  finite  measure  of 
samples  and  to  create  more  general  and  simpler  methods 
in  treatment  of  information. 


2.Problem  statement  and  assumptions 

We  consider  the  detection  of  signals  in  additive 
m-dependent  Markov  noise.  Let  us  have  a  sequence 
x={xi,  i=l,..,n}  of  real  observations  of  a  process 
X={x(ti)=Xi,  i=l,2,..}  and,  on  the  basis  of  x,  we  wish  to 
decide  between  the  following  pair  of  hypotheses 
concerning  X: 


H  :x;  =nL 


h,:xL  =  nu+As;:_„,, 


(1) 


/(”i|  “/-L)-  Here  s;._/ =  (5, is  a 
deterministic  periodic  signal.  The  parameter  X  is  an 
unknown  amplitude  of  the  signal.  Our  study  is  restricted 
to  the  asymptotic  (large-sample-size)  case  and,  to  avoid 
singularity,  we  will  consider  a  sequence  of  alternatives 

{\=Y  !  4n\n  =  1,2,..},  where  7>0. 

In  [2]  (see  also  [3,1] )  was  derived  an 
asymptotic  optimal  (AO)  statistics  for  the  problem  (1) 


r(x)=-^  ZsLVxL)  = 

Vw  i=m+\ 

1  «  0 

VW  i=m+l  j=-m 

where 

j=-m,..,0. 


(2) 


,  (3) 


=  0 


The  sufficient  statistics  (2)  is  asymptotic  normal 
both  on  the  hypothesis  Ho  and  the  contiguous  alternative 
Hi  with  parameters 


N(0,tr(SK))  and  N(7tr(SK),tr(SK)) 
respectively,  where 


where  =  ( V-_^,.., N,) ,  i=m+l,...  is  a  stationary  K-(kjv)-E{(pj9v  I  Ho),  j,v~-m,..,0;  detK<oo  (4) 

m-dependent  Markov  noise  with  an  initial  probability 
density  /(nf)  and  a  transition  probability  density 
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S=(Sjv)=Iim-  X-^i+j-^z+v  ,j,v=-m,..,0;detS«>o. 


n^~  n  ,=„+i 


a)(x)= 


AO  algirithm  can  be  presented  as  follows 

1  (there  is  a  dgid)  {f2Xx)>C 
P  (iJ^reisrit  asigid)  ^T(iO<C 


Taking  into  account  the  asymptotic  normality  of 
the  statistic  (2)  under  Ho  and  Hi  can  be  determined  the 
threshold  constant  C  and  the  performance  of  AO 
algorithm  for  given  y  and  n-^«>. 


3.Quasidetermiiiistic  signals 

Let  us  have  again  the  problem  (1),  but  useful 
signal  is  quasideterministic  one: 

K 

^(0  =  Sv,¥«(0=v^\|/(0  ;  (6) 

9=1 

s(ti)=Si=vV(ti)=  Vi ;  i=l,n  , 


parametric  estimation  of  the  basic  characteristic 
(see  (3))  is  derived 

=  gy(xL>  Y,^) .  (10) 

and  also  the  estimation  of  the  matrix  K  (see  (4))  and  the 

A,  I  A 

treshold  constant  C  =  tr(SK) .  Here  Za  is  the 
upper  a-th  quantile  of  F,  the  d.f.  of  the  standard  normal. 
It  can  be  proved  that  the  adaptive  algorithm  is  AO,  if  for 
any  j=  -m,..,0  the  convergence 

lim£{«,(x;:..,Y‘)-®  (x;_)’)  =  0  (H) 

is  fulfilled. 

The  additive  m-dependent  Markov  sequence 
{Nil  (see  problem(l))  satisfies  the  linear  autoregresion’s 
equation  with  unknown  coefficients 


=  rjj ,  i=l,..,n, 


where  V^=(Vi,...,Vk)  is  a  set  of  unknown  random 

parameters  and  V^=(Vi(t)>-'Vk(l)) 

functions.  Further  one  can  determine  respectively  (see 

[5]): 

-  likelihood  ratio  (LR) 

l(x;-^v^\|f.„,„)~  (7) 

Vn 


exp  % 

L  -y  /I  i=m+l 


(P(x'_„)-0.5yV  Bv 


-  k-dimensional  vector  statistic 


T(x)=-^  £¥,-..,9(xL)  e 

yTl  i=m+l 

A[T(x)]  =  J/(x^)/(v)tfv  , 


where  Uj  are  arbitrary  constants  and  T|i  are  i.i.d.  random 

variables  with  common  density  function  f^(z)  .  It  can  be 
proved  [4,5]  that  XpNi+X^Si  is  a  m-dependent  Markov 
sequence  with  density  function  of  a  transition  from 
to  Xi,  which  equals  of 


where  cIq=\.  Therefore,  logarithmic  derivative  (see  (3)) 
has  an  other  representation  as  well 


=  =  (14) 

M^=0 


-  by  using  decision  function 

1  (there  is  a  signal),  if  A[T(x)]^ 

<I>[T(x)]=i  (9) 

0  (there  isn’t  a  signal),  if  A[T(x)]<C  . 


,j=-m,..,0. 


4(x.+  LajX.^j) 


4»Adaptive  AO  algorithms  for  detection  of  signals 

Adaptive  algorithms  assure  the  change  of  the 
parameters  and  structure  of  the  system  as  a  result  of 
learning.  Let  Yj^  =  be  a  classified  sample 

belonging  to  the  distribution  of  the  m-dependent  Markov 
noise  (L>m).  With  the  help  of  a  learning  sample  Yj^  a 


If  we  don’t  know  the  coefficients  Uj,  it  can  be 

found  their  maximum  likelihood  estimates  using  a 
learning  sample  of  the  autoregressive  noise  {Ni}.  Finally 
it  can  be  found  out  the  parametric  estimator  of 

the  logarithmic  derivative. 
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S.Communication  systems 

The  basic  algorithm-(8)  is  interpreted,  when 
m=l -Markov  noise  with  unknown  densities,  and  the 
useful  signal  is  a  quasideterministic  one-a  narrow  band 
one  (with  an  initial  random  phase  cpo )  that  is  modulated 
by  amplitude  and  by  phase  (frequency): 

s(t)=a  (t)cos[cflbt+|i(t)-(Po]=Vi\|/i(t)+V2\|/2(t),  (15) 

where  Vi=cos(po;  V2=sin9o;  Vi(t)=  a(t)cos[cflbt+|a(t)], 
\|/2(t)=  a(t)sin[a]bt+|i(t)].  Here  the  envelope  a{i)  and  the 
phase  |x(t)  are  slowly  changing  deterministic  functions; 
the  random  phase  (po  is  uniformly  distributed  in  [0,27i]. 
The  following  algorithm  is  synthesized 

L/=2  7=5-1  J 


X/c 

Fig.2.Estimation  probability  of  correct  detection;  ARl, 
Const.signal,  ai=0.05,  a2=0.01,  a3=0.005,n=100. 


\  (16) 

n  (/=2  7=:-1  J 

and  the  structure  scheme  is  presented  as  well  (see  Fig.l). 

6.Simulation  results 

For  the  basic  algorithm  (2),  (5)  by  simulation 
technique,  were  derived  the  curves  of  probability  of 
correct  detection  P  in  dependence  of  signal  to  noise 
ratio,  when  m=l  and  m=3  and  a,  n  are  fixed  (see 
Fig.2-5). 


Fig.l  Structure  of  the  detector  for  a  narrow  band 

modulated  signal.  DL-delay  line;  SQ-squared;  TD- 
threshold  device. 
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Fig.3. Estimation  probability  of  correct  detection;  ARl, 
Pulse-signal,  ai=0.05,  a2=0.01,  a3=0.005,n=100. 
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Fig.4.Estimation  probability  of  correct  detection;  AR3, 
Const,  signal,  ai=0.05,  a2=0.01,  a3=0.005,n=100. 
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Noise  Noise 


0.001  0.002  0.003  0.004  0.005  0.006  0.007  O.OOi 

Fig.S.Estimation  probability  of  correct  detection;  AR3, 
Pulse-signal,  ai=0.05,  a2=0.01,  a3=0.005,n=100. 


index 


Fig.6.  ARl,  Ni+0.8Ni.,=Tli  ,  ~N(0,1) 


Index 

Fig.7.  AR3,  Ni+0.95Ni.i  +0.2996Ni.2 + 
+0.0314Ni.3=Tii ,  lli~N(0,l) 


7.  Radar  applications 

The  basic  algorithm  (8)  is  interpreted,  when 
m=0-{Ni}  is  nongaussian  noise  with  unknown  density 
function  and  independent  samples.  The  useful  signal  can 
be  written  as  follows; 

s(t)=s(t,v)=acos((0ot-9o)=>  (17) 

acos(cooti-9oj)=  ^7cos(0oticos(poj+^7sin(0otisin(poj= 

=Vi\i/i(ti)+V2\K2(ti),  j=l,...,M, 

where  U  is  an  unknown  amplitude  and  (po  is  uniformly 
distributed  in  [0,2jc].  Here  M  is  a  number  of  the  impulses 
in  radar’s  packet.  The  following  algorithm  is  syntesized 

T(x.)  =-  ^acost0of,.(p(x,..)  + 

^  nL,=i  J 

-  T 

-b-  sin O)of,.9(x,p  (18) 

^  L  1=1 

The  structure  scheme  is  presented  in  [4]. 
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ABSTRACT 

This  paper  shows  how  fuzzy  logic  can  be  used  to  im¬ 
prove  the  behaviour  of  phase-locked  loops.  In  partic¬ 
ular,  we  have  derived  a  fuzzy-rule-based  phase  error 
detector,  which  permits  a  decrease  in  the  steady-state 
phase  variance  to  be  achieved. 


1.  INTRODUCTION 

In  modern  digital  communications,  efficient  synchro¬ 
nization  techniques  are  required  at  the  receiver  side 
to  fully  exploit  the  potentiality  of  modulation/coding 
schemes  without  causing  a  bottleneck  in  the  overall  sys¬ 
tem  performance. 

One  of  the  fundamental  function  in  the  receiver  is 
the  phase  recovery  unit,  which  generally  consists  in  a 
phase-locked  loop  (PLL)  [1]  as  depicted  on  Figure  1. 
A  PLL  is  made  up  with  three  main  blocks:  a  phase 
error  detector  (PD),  a  low-pass  loop  filter  (LPF)  and 
a  voltage  controlled  oscillator  (VCO).  The  phase  de¬ 
tector  provides  an  error  signal  e*  whose  average  value 
is  proportional  to  the  phase  error  between  the  phase 
associated  to  the  received  signal  r(t)  and  the  VCO’s 
phase.  The  phase  error  ejb  is  then  low-pass  filtered  and 
the  obtained  signal  u(t)  is  used  to  drive  the  VCO  in 
order  to  correct  its  phase.  Usually,  the  performances 
of  a  phase  recovery  loop  are  given  in  terms  of  acquisi¬ 
tion  time  and  steady-state  phase  variance.  The  aim  of 
this  paper  is  to  derive  a  fuzzy-rule-based  PLL  controller 
able  to  improve  the  synchronization  loop  performance. 

The  paper  is  organized  as  follows:  Section  2  de¬ 
scribes  the  general  method  for  fuzzy-controlling  a  PLL. 
Section  3  develops  a  specific  example  of  a  fuzzy-rule- 
based  phase  detector  designed  to  minimize  the  PLL 
steady-state  phase  variance.  Finally,  Section  4  is  for 
our  conclusions. 


2.  FUZZY-CONTROLLED  PLL 

Figure  2  shows  how  a  fuzzy  controller  should  be  in¬ 
terfaced  with  a  PLL.  A  set  of  rules  describing  the  de¬ 
sired  behaviour  of  the  PLL  constitutes  the  Expert  Data 
Base.  Such  rules  should  be  expressed  as:  if  condi¬ 
tion  then  action,  where  the  condition  and  action  parts 
are  expressed  in  natural  language  and  therefore  con¬ 
tain  some  amount  of  uncertainty  or  inexactitude  which 
are  straightforwardly  represented  by  subjective  and  not 
sharply  defined  classes  of  objects.  For  example  the  PLL 
controller  could  make  use  of  the  information:  ”the  PLL 
is  almost  locked^ .  Within  fuzzy  set  theory  [2]  it  is  pos¬ 
sible  to  give  a  meaning  to  such  a  sentence  and  a  PLL 
can  be  in  its  lock-in  state  with  a  continuum  of  grades 
of  membership  (and  not  only  locked  or  unlocked). 

Once  the  rules  have  been  set,  the  PLL  state  vari¬ 
ables  related  to  the  condition  part  of  the  rules  and  the 
non-fuzzy  controls  dependent  on  the  action  part  of  the 
rules  should  be  identified.  Then  the  state  variables 
(resp.  non-fuzzy  controls)  should  be  mappped  to  ad- 
hoc  fuzzy  sets  describing  the  conditions  (resp.  actions). 
Every  possible  value  of  the  state  variables  or  controls 
should  belong  to  at  least  one  of  the  fuzzy  sets.  It  is  then 
possible  to  combine  all  the  different  rules  (depending 
on  the  PLL  state  variables)  using  a  classical  fuzzy  logic 
method  (Mamdani,  Larsen,  interpolation,  etc.  [3])  and 
obtain  for  each  non-fuzzy  control  a  value  which  is  the 
most  representative  of  the  combination  of  all  actions. 

3.  FUZZY-RULE-BASED  PHASE 

DETECTOR 

This  section  is  an  attempt  to  show  the  potentiality  of 
the  above  mentioned  method.  For  sake  of  simplicity 
we  will  only  focus  on  improving  the  steady-state  phase 
jitter  by  fuzzy-controlling  the  phase  detector  block. 

Figure  3  shows  the  Minimum  Mean-Square  Error 
(MMSE)  PD,  which  is  currently  known  in  literature 
[4].  The  error  signal  is  given  by  e*  =  lm{pkdl],  where 
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Pk  =  dkC^"^  +  njt  =  Oib  H-  jhk  is  the  optimum  sample 
of  the  signal  at  the  output  of  the  matched  filter  with 
dk  a  random  MxM-QAM  symbol  whose  in-phase  (I) 
and  quadrature  (Q)  parts  belong  to  the  set  {—{M  — 
l)d, . . . ,  d, . . . ,  (M  -  l)d},  (f)  the  unknown  phase  er¬ 
ror,  rik  the  noise  which  is  a  Gaussian  complex  random 
process  whose  variance  is  and  dk  the  decision  taken 
on  pk^ 

A  common  way  to  evaluate  the  performance  of  a  PD 
is  to  compare  the  steady-state  variance  of  the  phase 
measured  in  a  closed  loop  configuration  (for  a  given 
loop  bandwidth  Bl  and  at  a  given  Signal-to-Noise  Ra¬ 
tio,  SNR)  to  the  minimum  achievable  phase  jitter  in 
the  same  conditions.  Such  a  minimum  phase  jitter  is 
also  known  as  the  Cramer-Rao  Bound  (CRB)  [5]  and 
is  given  by 

2  _  2BlT 

SNR 

The  MMSE  phase  detector  gives  very  good  performance 
(very  close  to  the  CRB)  as  far  as  the  decision  dk  is 
correct,  i.e.  as  far  as  the  SNR  is  high.  However,  if 
the  number  of  errors  in  the  decisions  increases  (at  low 
SNR)  the  PD  performance  worsens.  The  idea  is  to  try 
to  take  into  account  only  the  decisions  which  are  more 
likely  to  be  correct  and  to  discard  the  others.  That  is 
to  say  we  try  to  use  the  following  empiric  rules: 

-  If  the  decision  is  correct  then  the  PD  should  cor¬ 
rect  in  the  usual  way  i.e.  using  the  MMSE  algorithm 
(which  is  optimum). 

-  If  the  decision  does  not  seem  to  be  correct  then 
it  is  better  not  to  correct  at  all  (or  at  least  less  then 
usually)  rather  than  correcting  in  the  wrong  direction. 
In  this  case  the  condition  is  the  ”  correctness  of  the  deci¬ 
sion”  and  the  action  can  be  identified  as  the  ”  strength 
of  the  PD  error  signal”.  We  arbitrarily  chose  to  relate 
the  ’’correctness  of  the  decision”  to  two  state  variables 
xi  and  X2i  which  are  the  distances  of  the  received  point 
Pk  from  the  closest  I  and  Q  decision  thresholds  respec¬ 
tively.  We  also  chose  to  relate  the  ’’strength  of  the 
phase  detector  error  signal”  to  one  control  7*,  which 
is  a  gain  factor  ranging  from  0  to  1.  The  new  PD  is 
shown  on  Figure  4  where  ejj.  =  jk^k  is  the  new  error 
signal. 

The  membership  functions  [2]  of  the  fuzzy  sets  cor¬ 
responding  to  the  ’’correctness  of  the  decision”  {pcDi  (iCi) 
and  PcDq{x2))  and  to  the  ’’strength  of  the  PD  error 
signal”  (psEilk))  are  shown  on  Figure  5  as  a  function 
of  a^i,  X2  and  jk  in  the  case  of  a  QPSK  constellation. 

The  membership  functions  pcDii^i)  and  ij>cdq{^2) 
depend  also  on  a  parameter  A  (see  Figure  5),  which 
has  to  be  chosen  carefully  in  order  to  optimize  the 
fuzzy-controller.  Such  a  parameter  defines  some  uncer¬ 
tainty  regions  in  the  complex  plane  as  shown  in  Figure 


6.  Since  the  membership  functions  characterising  the 
state  variables  are  monothonic,  we  used  the  interpo¬ 
lation  method  [3]  to  derive  the  non-fuzzy  control  7jt, 
i.e.  7*  =  li^E{mm\picDi{xi),ficDQ{x2)]).  The  em- 
piric  rules  are  implemented  in  the  Control  Logic  block 
whose  output  7fc  is  close  to  one  if  the  decision  is  likely 
to  be  correct  while  is  close  to  zero  if  the  decision  is 
likely  to  be  erroneous. 

The  slope  of  the  so  obtained  fuzzy-controlled  PD 
has  been  evaluated  by  simulation  and  is  plotted  on  Fig¬ 
ure  7  for  several  values  of  A  in  the  case  of  a  QPSK  con¬ 
stellation,  The  degradation  with  respect  to  the  CRB 
is  given  on  Figure  8  for  different  values  of  SNR  and  A. 
The  values  for  A  =  0  correspond  to  the  classical  MMSE 
PD.  The  optimum  value  of  A  is  the  one  which  minimizes 
the  degradation  and  we  can  see  that  for  this  value  the 
fuzzy-controlled  MMSE  PD  outperforms  the  classical 
one.  Predictibly,  the  improvement  is  large  at  low  SNR 
since  the  number  of  erroneous  decisions  is  high,  while  it 
is  small  at  high  SNR  because  the  decisions  are  almost 
completely  correct. 

4.  CONCLUSIONS 

In  this  paper  we  have  shown  the  potentiality  of  apply¬ 
ing  fuzzy  logic  to  improve  the  performance  of  tracking 
loops.  In  particular,  we  have  developed  a  specific  and 
very  simple  example  of  a  fuzzy-rule-based  phase  detec¬ 
tor.  Even  in  this  very  simple  case  (only  one  rule)  an 
improvement  in  the  steady-state  phase  variance  can  be 
achieved  by  chosing  an  appropriate  value  of  A.  Modi¬ 
fications  in  the  shape  of  the  membership  functions  or 
the  use  of  more  sophisticated  rules  taking  into  account 
other  conditions  (SNR  level,  speed  of  convergence,  loop 
bandwidth,  etc...)  are  expected  to  provide  further  im¬ 
provement. 
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Figure  5:  Membership  functions  of  the  fuzzy  sets  corre¬ 
sponding  to  the  ”correcteness  of  the  decision”  (a)  and 
(b)  and  to  the  ”  strength  of  the  PD  error  signal”  (c)  in 
the  case  of  a  QPSK  constellation. 


Figure  7:  Phase  Detector  slope. 


Figure  6:  Uncertainty  regions  for  a  QPSK  constella¬ 
tion. 


Figure  8:  Degradation  with  respect  to  the  Cramer-Rao 
Bound  as  a  function  of  A. 
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ABSTRACT 

The  last  few  decades  have  witnessed  tremendous 
progress  in  the  performance,  reliability,  and  wide-spread 
use  of  speech-processing  devices.  Using  mathematical 
models  of  human  speech  production  and  perception  has 
been  an  important  factor  in  the  improved  performance 
of  these  devices.  In  this  tutorial  paper,  we  review 
recent  advances  in  speech  production  and  perception 
modeling  and  summarize  the  challenges  that  lie  ahead 
in  developing  fully  parametric  models. 

1.  INTRODUCTION 

Quantitative  models  of  human  speech  production  and 
perception  provide  important  insights  into  our  speech 
production  and  perception  mechanisms  and  lead  to  high- 
quality  computer  synthesis  of  speech,  robust  automatic 
speech  recognition  (ASR),  and  efficient  speech  and  au¬ 
dio  coders.  These  issues  are  of  importance  in  the  devel¬ 
opment  of  effective  human-computer  communications 
through  the  medium  of  human  language. 

Simplified  linear  models  of  speech  production  form 
the  basis  of  several  speech  synthesizers  [Klatt,  1987] 
and  the  most  widely-used  speech  coder  today:  Code- 
excited-linear-prediction  (CELP)  [Schroeder  and  Atal, 
1985].  Simple  auditory  models  have  been  used  suc¬ 
cessfully  in  optimizing  the  performance  of  speech  and 
audio  coders  [Schroeder  et  al.,  1979;  Shen  et  al.,  1995; 
Johnston,  1988];  these  models  are  also  embedded  in 
the  MPEG  audio-coding  standards  [Brandenburg  and 
Stoll,  1992].  Using  auditory  models  as  preprocessors 
has  resulted  in  improving  the  performance  of  recogni¬ 
tion  systems  in  noise  [Ghitza,  1986]. 
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Recently,  with  rapid  advances  in  hearing  research, 
medical  imaging,  non-linear  analysis,  and  computer  tech¬ 
nology,  more  sophisticated  speech  production  and  per¬ 
ception  models  are  being  proposed. 

2.  SPEECH  PRODUCTION  MODELS 

Speech  production  research  is  directed  towards  better 
understanding  and  quantitative  characterization  of  the 
acoustics,  articulatory  dynamics,  and  cognition  of  both 
normal  and  pathological  human  speech.  Speech  pro¬ 
duction  modeling  efforts,  however,  face  at  least  two 
major  challenges:  (1)  the  lack  of  appropriate  physical 
and  physiological  data  which  are  crucial  for  developing 
quantitative  models,  and  (2)  the  presence  of  articula¬ 
tory  and  acoustic  variabilities,  both  within  and  across 
speakers. 

A  great  deal  of  attention  is,  hence,  given  to  the 
development  and  use  of  novel  measurement  and  instru¬ 
mentation  techniques  that  provide  better  insights  into 
speech  production  mechanisms.  Examples  of  such  tech¬ 
niques  include  the  use  of  magnetic  resonance  imaging 
(MRI)  to  study  the  3D  vocal-tract  geometry,  use  of  ul¬ 
trasound  [Stone,  1990],  x-ray  microbeam  and  electro¬ 
magnetic  midsagittal  articulometer  (EMMA)  [Perkell, 
1992]  to  study  tongue  dynamics,  measuring  linguopalatal 
interaction  using  electropalatography  (EPG)  [Hardcas- 
tle  et  al,  1989],  and  the  use  of  microsensor-based  aero¬ 
dynamic  and  acoustic  measurements  in  the  vocal  tract. 
Since  most  of  these  techniques  provide  information  about 
certain  specific  attributes  of  speech  production,  data 
are  typically  collected  using  several  of  these  techniques 
in  parallel,  if  not  simultaneously.  Novel  signal  process¬ 
ing  techniques  are  often  required  to  process,  visual¬ 
ize,  and  quantify  the  various  physical  and  physiological 
speech  production  data  thus  obtained. 

A  major  focus  of  speech  production  research  is  in 
modeling  articulatory-acoustic  relationships  of  speech 
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sounds.  Physically  and  physiologically-based  models 
for  speech  acoustics  are  particularly  important  for  de¬ 
veloping  high-quality  speech  synthesis  and  low  bit  rate 
(articulatory)  coding.  Significant  progress  has  been  re¬ 
cently  made  towards  developing  improved  articulatory- 
acoustic  models.  Extensive  data  from  human  subjects 
using  techniques  such  as  MRI  were  used  to  obtain  ac¬ 
curate  quantitative  and  qualitative  characterizations 
of  the  vocal-tract  geometry  for  vowels  and  consonants 
[Baer  et  al.,  1991;  Narayanan  et  al.,  1995].  The  infor¬ 
mation  obtained  from  these  experiments,  such  as  ac¬ 
curate  vocal-tract  dimensions,  were  then  employed  in 
modeling  the  acoustics  of  these  sounds.  For  example, 
physically- motivated  ID  acoustic  models  for  fricatives, 
showing  good  agreement  with  the  spectra  of  natural 
fricatives,  were  recently  reported  [Narayanan,  1995]. 

Numerical  simulation  of  the  acoustic  wave  propa¬ 
gation  in  actual  3D  vocal-tract  models,  particularly  for 
^static’  scenarios  corresponding  to  sustained  sounds, 
using  techniques  such  as  finite  element  and/or  finite 
time-difference  methods  has  received  wide  attention 
[Miki  et  al.,  1994;  Cummings  et  al.,  1995].  This  is 
largely  due  to  the  vast  improvements  in  computational 
capabilities  and  the  availability  of  more  realistic  vocal- 
tract  data.  An  alternative  to  numerical  simulations  for 
studying  flow  problems  in  the  3D  vocal  tract  is  to  use 
mechanical  analogs.  It  should,  however,  be  noted  that 
simulation  of  speech  dynamics  in  3D  models  is  an  ex¬ 
tremely  challenging  problem,  which  is  as  yet  largely 
untackled. 

Considerable  interest  has  also  been  devoted  to  the 
inverse  problem  in  speech  production  i.e,,  the  identi¬ 
fication  of  articulatory  parameters  from  the  acoustic 
speech  signal.  Solution  to  the  inverse  problem  has  po¬ 
tential  applications  in  low  bit  rate  coding  and  speech 
recognition  [Schroeter  and  Sondhi,  1992;  Deng  and  Sun, 
1994].  It  is,  however,  important  to  bear  in  mind  that 
the  unconstrained  inverse  problem  is  non-unique.  That 
is,  the  acoustic  to  articulatory  transformation  is  not 
one-to-one.  Identifying  appropriate  constraints  to  solve 
uniqueness  is  not  a  solved  problem.  Various  techniques 
such  as  neural  networks  [Shirai,  1993]  and  genetic  algo¬ 
rithms  [McGowan,  1994]  have  been  proposed  for  solv¬ 
ing  the  inverse  problem  although  their  viability  is  yet 
to  be  fully  demonstrated. 

Novel  approaches  such  as  those  drawn  from  non¬ 
linear  dynamical  systems  (chaos)  theory  have  recently 
found  applications  in  speech  analysis.  Such  analyses 
have  provided  further  insights  into  the  production  of 
pathological  speech  [Titze  et  al.,  1993]  and  consonant 


sounds  such  as  fricatives  which  are  characterized  by 
turbulence  formation  in  the  vocal  tract  [Narayanan  and 
Alwan,  1995]. 

Recently,  there  has  also  been  an  interest  in  under¬ 
standing  higher-level  processing  during  speech  produc¬ 
tion  by  monitoring  brain  activity  using  functional  MRI 
[Shaywitz  et  al.,  1995]  in  conjunction  with  other  meth¬ 
ods  such  as  PET. 

Considerable  efforts  directed  towards  modeling  speech 
production  are  hoped  to  provide  a  clearer  picture  of  the 
underlying  mechanisms  and  help  in  the  development  of 
better  engineering  applications.  The  significance  of  effi¬ 
cient  analysis  schemes  (identification  of  the  articulatory 
parameters  from  the  speech  waveform)  and  synthesis 
schemes  (generation  of  the  acoustic  speech  waveform 
from  the  articulatory  parameters)  extends  well  beyond 
the  realm  of  speech  production  modeling  with  clinical 
and  linguistic  implications. 

3.  SPEECH  PERCEPTION  MODELS 

Much  of  the  evidence  for  current  auditory  models  are 
based  on  acoustic  masking  experiments.  Static  (or  si¬ 
multaneous)  masking  experiments  have  led  to  the  defi¬ 
nition  of  the  critical  bandwidth,  and  a  non-uniform  fil¬ 
ter  bank  model  of  auditory  perception  [Fletcher,  1940; 
Zwicker  et  al,  1957;  Zwicker  and  Terhardt,  1980].  An¬ 
other  well-quantified  aspect  of  auditory  processing  is 
“loudness  equalization”  [Robinson  and  Dadson,  1956]. 
For  example,  the  perceived  loudness  of  two  tones  at  dif¬ 
ferent  frequencies  might  be  different  even  though  their 
intensity  levels  (in  dB  SPL)  are  the  same. 

Dynamic  (or  non-simultaneous,  forward)  masking 
experiments  [Plomb,  1964;  Jesteadt  et  al,  1982;  Moore 
and  Glasberg,  1983]  also  provide  significant  insight  for 
auditory  modeling  efforts.  Specifically,  forward  mask¬ 
ing  experiments  suggest  that  the  relative  levels  of  short- 
time  spectral  estimates  are  perceptually  significant. 

Further,  perceptual  experiments  with  spectrally  com¬ 
plex  speech-like  stimuli  indicate  that  human  audition  is 
particularly  sensitive  to  the  frequency  location  of  local 
spectral  peaks  [Klatt,  1982].  Measurements  of  the  tim¬ 
ing  detail  of  individual  inner  hair  cell  responses  reveal 
firing  responses  which  track  dominant  spectral  features 
[Delgutte  and  Kiang,  1984]  (instead  of  exclusively  rep¬ 
resenting  the  spectral  content  at  the  fiber’s  center  or 
best  frequency  as  a  filter  bank  model  might  predict) 
suggesting  a  possible  mechanism  for  this  spectral  peak 
sensitivity. 
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Our  understanding  of  auditory  processing  of  speech 
signals  is  at  an  early  stage;  one  of  the  major  challenges 
in  the  next  few  decades  is  unraveling  and  modeling  the 
neural  mechanisms  involved  in  speech  processing  [Pick¬ 
les,  1994]. 

Auditory  Models  and  Automatic  Speech  Recog¬ 
nition 

Early  efforts  to  build  automatic  speech  recognition 
machines  occurred  in  the  1950s  at  Bell  Laboratories 
[Davis  et  al,  1952].  These  systems  relied  on  identify¬ 
ing  the  spectral  resonances  of  vowels  within  isolated 
digits.  Despite  tremendous  research  focus,  and  many 
significant  technical  advances,  a  robust  speech  recogni¬ 
tion  system  that  approaches  human  performance  still 
does  not  exist.  Research,  therefore,  continues. 

Typical  speech  recognition  systems  involve  two  fun¬ 
damental  steps:  short-term  spectral  analysis,  followed 
by  pattern  comparison  with  representative  templates 
(or  statistical  models  of  templates).  In  the  1970s,  LPC- 
based  spectral  analysis  [Itakura,  1975],  and  the  appli¬ 
cation  of  dynamic  programming  techniques  to  pattern 
comparison  [Vintsyuk,  1968]  led  to  successful  isolated 
word  recognition  systems.  Today  many  systems  use 
Mel- Frequency  Cepstral  Coefficients  (and  their  tempo¬ 
ral  derivatives)  for  spectral  analysis,  and  Hidden  Markov 
Models  (HMM)  of  the  templates  for  pattern  compar¬ 
ison  [Rabiner,  1989].  Mel- Frequency  Cepstral  Coeffi¬ 
cients  (MFCC)  are  defined  as  the  DCT  of  log  spec¬ 
tral  estimation  obtained  with  a  critical  bandwidth-like 
non-uniform  filter  bank  model  [Davis  and  Mermelstein, 
1980].  The  DCT  provides  an  orthogonal  transforma¬ 
tion  to  a  vector  space  with  better  energy  compaction, 
which  therefore  requires  fewer  coefficients  per  acous¬ 
tic  vector.  Including  temporal  derivatives  of  the  cep¬ 
stral  coefficients  in  the  acoustic  observation  vector  is 
to  account  for  dynamic  sensitivity  [Rabiner  and  Juang, 
1993]. 

Providing  a  rigorous  stochastic  framework,  HMMs 
and  the  techniques  to  train  and  apply  them,  have  led  to 
successful  large- vocabulary  speaker-independent  systems 
[eg.,  Lee  et  al,  1991].  Unfortunately  the  performance 
of  these  systems  still  degrades  significantly  when  the 
acoustic  environment  (amount  and  type  of  background 
noise,  reverberation,  competing  sources,  etc.)  differs 
from  the  training  data. 

HMM  systems  [Rabiner,  1989]  rely  on  a  fundamen¬ 
tal  stochastic  identity  (Bayes  Theorem).  HMMs  pro¬ 
vide  the  probability  of  an  acoustic  observation  for  each 
specific  template  model.  We  then  choose  the  template 


model  which  generates  the  highest  probability  of  the 
acoustic  observation  occurring. 

If  we  are  trying  to  answer  the  question:  ^What  was 
most  likely  said?”  it  seems  reasonable  that  we  also 
want  to  characterize  statistically  the  acoustic  distinc¬ 
tions  between  sounds.  Although  recent  efforts  are  ad¬ 
dressing  this  issue,  the  formal  derivation  of  a  compu¬ 
tationally  tractable  training  procedure  is  much  more 
difficult,  and  remains  an  open  question. 

Using  MFCC  and  their  temporal  derivatives  to  rep¬ 
resent  the  acoustic  observation,  current  recognizers  in¬ 
corporate  a  suitable  model  of  frequency  selectivity  and 
a  rough  approximation  of  short-term  auditory  adap¬ 
tation.  A  number  of  researchers  have  proposed  mod¬ 
els  with  more  explicit  short-term  auditory  adaptation 
[Seneff,  1988;  Lyon  and  Mead  1988;  Hermansky  and 
Morgan  1994],  however  parameterizing  these  models  so 
that  they  are  quantitatively  consistent  with  measurable 
top-level  functionality  remains  a  consistent  challenge. 

The  recent  system  we  developed  includes  a  model  of 
short-term  auditory  adaptation  parameterized  through 
a  series  of  perceptual  forward-masking  experiments,  and 
a  novel  cepstral  processing  technique  to  isolate  local 
spectral  peaks  [Strope,  1995].  Figure  1  illustrates  dif¬ 
ferent  representations  for  the  digits  one,  three,  and 
nine,  spoken  by  a  male  speaker.  The  top  part  of  the 
figure  is  the  time  waveform,  the  middle  part,  a  spectro¬ 
gram  computed  using  cepstral  coefficients  derived  from 
a  linear  prediction  model,  and  the  third  is  a  “percep¬ 
tual”  spectrogram  computed  using  our  dynamic  per¬ 
ceptual  model.  Note  how  the  perceptual  representa¬ 
tion  highlights  onsets  and  spectral  transitions  which  are 
perceptually  important.  Using  a  dynamic-programming 
based  recognition  system,  this  representation  leads  to 
significantly  more  robust  recognition  performance  when 
compared  to  common  cepstral  and  cepstral  derivative 
representations.  Currently,  we  are  also  evaluating  this 
representation  with  an  HMM  system. 

Stochastic  modeling  provides  a  powerful  tool  to  train 
recognition  systems  based  on  a  sequence  of  acoustic 
observations.  Future  efforts  will  undoubtedly  continue 
to  improve  these  stochastic  models  and  training  tech¬ 
niques.  We  should  also  expect,  however,  increased  per¬ 
formance  by  incorporating  acoustic  observation  sequences 
that  are  more  consistent  with  human  perception. 

Auditory  Models  and  Speech  and  Audio  Coding 

Speech  coding  is  the  process  of  obtaining  a  compact 
representation  of  voice  signals  for  efficient  transmission 
and  storage.  Obtaining  an  efficient  method  of  transmit- 
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ting  speech  signals  over  band-limited  wired  and  wireless 
channels  drives  the  research  in  speech  coding.  Today, 
in  a  software- controlled  digital  format,  speech  coders 
have  become  an  essential  component  in  telecommunica¬ 
tions  and  in  the  multimedia  infrastructure  [Jayant  and 
Cox,  1993].  Commercial  systems  which  rely  heavily 
on  efficient  speech  coding  include  cellular  communica¬ 
tion,  video  conferencing,  digital  simultaneous  voice  and 
data  (DSVD),  as  well  as  numerous  PC-based  games 
and  multimedia  applications. 

To  measure  the  overall  effectiveness  of  digital  speech 
coders,  three  metrics  are  commonly  used:  complexity, 
perceptual  quality,  and  bit  rate.  The  complexity  of  a 
speech  coding  algorithm  is  reflected  in  the  number  of 
machine  instructions  executed  when  operating  in  a  real 
time  environment,  measured  in  millions  of  instructions 
per  second  (MIPS).  The  second  criterion  is  perceptual 
quality.  A  common  metric  used  to  measure  percep¬ 
tual  quality  is  the  Mean  Opinion  Score  (MOS).  This 
measure  is  an  average  score  derived  from  subjective 
listening  tests  using  a  five-point  scale.  A  score  of  one 
is  assigned  to  speech  with  ^poor’  quality,  while  a  score 
of  five  is  given  to  speech  with  ‘excellent^  quality.  The 
third  metric  is  the  operating  bit  rate  of  a  coder,  which  is 
the  rate  at  which  data  must  be  transmitted  or  stored 
in  order  to  effectively  reconstruct  the  original  speech 
signal  [Jayant  and  Noll,  1984]. 

Auditory  modeling  can  aid  in  the  design  of  speech 
and  audio  coders  in  two  important  ways:  1)  develop¬ 
ing  an  objective  measure  to  rapidly  and  reliably  as¬ 
sess  the  performance  of  a  coder  instead  of  conduct¬ 
ing  time-consuming  and  expensive  subjective  listening 
tests  [Wang  et  ah,  1992],  and  2)  optimizing  the  perfor¬ 
mance  of  perceptually-based  speech  and  audio  coding 
schemes  [Shen,  1994;  Shen  et  al,  1995]. 

Simplified  auditory  models  have  been  used  success¬ 
fully  in  both  areas  (developing  objective  measures  and 
designing  speech  and  audio  coders).  These  simplified 
models  view  speech  as  a  sequence  of  unrelated  static 
segments  and  exploit,  predominantly,  static  masking 
effects.  A  more  complete  auditory  model,  especially 
one  which  takes  into  account  dynamic  spectral  distor¬ 
tions,  will  undoubtedly  have  an  important  impact  in 
speech  coding  research. 

4.  SUMMARY  AND  CONCLUSION 

It  is  clear  that  our  understanding  of  speech  produc¬ 
tion  and  perception  mechanisms  has  improved  tremen¬ 
dously  in  the  past  few  decades.  Further  multidisci¬ 
plinary  research  in  signal  processing,  acoustics,  psy¬ 


choacoustics,  linguistics,  imaging,  and  auditory  physi¬ 
ology  are  needed  to  better  model  these  mechanisms. 
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Figure  1.  Different  representations  of  the  digits  “one,”  “three,”  and  “nine.”  The  top 
shows  the  time-domain  waveform,  the  middle  is  the  spectral  estimation  from  short- 
time  LP-Cepstral  analysis,  and  the  bottom  shows  the  output  of  our  dynamic 
perceptual  model.  The  “perceptual  spectrogram”  highlights  salient  onsets,  spectral 
transitions,  and  local  spectral  peaks,  providing  a  more  noise-robust  representation. 
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ABSTRACT 

Microphone  arrays  can  capture  desired  signals  and  at¬ 
tenuate  interfering  noise  and  reverberation  in  multipath 
environments.  Hands-free  speech  technologies  can  there¬ 
fore  be  supported  using  microphone  arrays.  This  report 
considers  delay-and-sum  beamformers,  advanced  array 
technology  using  matched-filters  and  hands-free  speaker 
identification  using  microphone  arrays. 

I.  Introduction 

Microphones  are  used  to  detect  and  transduce  sounds. 
However,  the  effectiveness  of  a  microphone  can  be  lim¬ 
ited.  Received  sound  quality  can  be  degraded  by  dele¬ 
terious  properties  of  the  acoustic  environment,  such  as 
multipath  distortion  (reverberation)  and  ambient  noise. 
In  a  teleconference,  a  microphone  is  able  to  pick  up 
sounds  of  a  talker  seated  close  to  the  microphone  bet¬ 
ter  than  one  seated  far  away.  Ideally,  the  teleconfer¬ 
ence  should  feel  as  natural  as  face-to-face  conversation. 
This  suggests  the  need  for  a  hands-free  sound  capturing 
system  that  has  neither  a  tether  nor  the  encumbrance 
of  hand-held  or  body-worn  sound  equipment.  Acoustic 
noise  and  multipath  distortion  make  hands-free  sound 
capture  particularly  difficult  to  achieve.  The  ideal  is 
to  approach  sound  pickup  comparable  to  that  from  a 
close-talking  microphone.  The  microphone  array  is  an 
attractive  approach. 

An  array  microphone  system  is  constructed  of  a  num¬ 
ber  of  microphones  distributed  in  space.  The  output  of 
each  individual  microphone  is  processed  and  all  outputs 
are  summed.  In  a  reverberant  environment,  the  micro¬ 
phone  array  improves  the  quality  of  sound  capture. 

Today,  advanced  technology  in  electrets  provides  low- 
cost,  high-performance  transducers.  In  addition,  the 
increased  computational  power  and  decreeised  cost  of 
digital  signal  processors  (DSPs)  make  computationally 
demanding  systems  implement  able.  By  utilizing  digi¬ 
tal  signal  processing  techniques,  it  is  practical  to  design 
large  scale  microphone  array  systems  for  high  quality 
sound  capture  for  group  conferences  in  large  enclosures. 

II.  Delay-and-sum  beamformer 

The  delay-and-sum  beamformer  delays  the  output  of 
each  transducer  to  coph2ise  the  sound  arriving  from 

*  This  research  is  supported  by  NSF  contracts  MIP-9121541 
and  MIP-9314625 
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Figure  1:  One  dimensional  microphone  array  of  (2NH-1) 
elements. 

a  given  direction.  This  technique  has  proved  very 
effective  for  hands-free  sound  capture  in  moderately- 
reverberant  auditoriaand  teleconference  rooms  [1,  3,  4]. 
Arrays  for  single  beamforming  may  be  one-,  two-  or 
three-dimensional  in  architecture  dependending  upon 
the  characteristics  of  the  desired  spatial  selectivity.  But, 
single  beamformers  are  adversely  affected  by  severe  mul¬ 
tipath  distortion,  and  function  best  if  the  enclosure  has 
at  least  a  modest  amount  of  sound  treatment. 

Fig.  (1)  depicts  a  line  array  of  (2N“fl)  omni¬ 
directional  microphones  spaced  equally  by  distance  d. 
All  microphones  are  identical  with  the  same  sensitiv¬ 
ity.  A  typical  omni-directional  microphone  is  sensitive 
to  pressure  fluctuations  of  an  acoustic  wave  with  a  flat 
frequency  response  over  the  audio  frequency  range  for  all 
incident  angles  of  the  arriving  signal.  For  a  plane  wave 
from  the  direction  of  angle  (f>,  the  time-domain  impulse 
response  of  a  uniformly  weighted  line  array  is  [3] 

N 

h{t)=  Ht  +  nT)  (1) 

n=-JV 

where  t  is  time,  6  is  the  Dirac  delta  function,  and  T 
is  the  time  difference  of  transit  delay  between  adjacent 
microphones.  The  frequency  response  of  Eq.  (1)  is  the 
transform 

eia.»T  _  8in[(2Jy  +  l)(a)T/2)]  ,  . 

H{ju)  -  ^  ^  -  sin(a,T/2) 
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SNR  for  a  randomly  distributed  matched-filler  array  with  1X  mics 


Figure  2:  Simulation  of  a  single  beam  formed  by  a  2- 
dimensional  rectangular  array  steered  to  a  source  in  a 
hard- walled  room. 

where  u  is  the  radian  frequency.  The  line  array  can  be 
steered  to  angle  if  the  output  of  the  microphone  is 
delayed  by  ndcos(^«)/c,  where  c  is  speed  of  sound.  The 
time  difference  between  adjacent  microphones  is  T  = 
{cos{(f>)  -  cos{<f>s))d/c.  Therefore,  the  spatial  response 
for  a  (2N-i-l)  element  line  array  steered  to  angle  is 

_  8in[(2JV  +  l)(dc.;[cos(^)  -  co8(^,)]/(2c))] 
sin[da)[cos(^)  —  cos((^,)]/(2c)] 

(3) 

The  delay-and-sum  system  can  direct  a  beam  of  sensi¬ 
tivity  along  the  direction  of  the  sound  source,  and  can  at¬ 
tenuate  sound  arriving  from  other  directions.  However, 
in  the  multipath  environment,  the  beam  will  also  pick  up 
all  reflected  signals  (images)  along  the  beam  direction. 
Fig.  (2)  shows  a  simulation  of  a  two-dimensional  rectan¬ 
gular  array,  which  is  single  beamformed  to  a  source  in 
a  hard-wall  room.  Rays  up  to  second  order  images  are 
plotted.  From  this  figure,  one  can  see  that  some  images 
are  along  the  “bore”  of  the  beam,  leading  to  degradation 
of  the  quality  of  sound  pickup. 

The  beamwidth  of  the  array  spatial  response  (Eq.  (3), 
Fig.  (2))  is  a  function  of  frequency.  To  provide  constant 
beam-width  over  a  wide  frequency  range,  harmonically 
nested  sub-bands  of  sensors  are  used  [4] .  Research  con¬ 
tinues  on  constant  beamwidth  for  delay-and-sum  beam- 
formers  [2,  9]. 

III.  Matched-Filter  Arrays 

Matched-filter  techniques  can  be  applied  to  microphone 
arrays  to  improve  noise  rejection  and  to  achieve  spatial 
selectivity  in  three  dimensions  [5,  6].  A  matched-filter 
is  a  causal  approximation  to  the  time  reverse  of  the  im¬ 
pulse  response  of  the  system  to  be  matched.  In  the  ar¬ 
ray,  a  matched-filter  is  dedicated  to  each  microphone. 
The  array  output  is  the  summation  of  outputs  from  all 
matched-filters.  Because  the  time-inverse  impulse  re¬ 
sponse,  h{—t)^  is  typically  non-causal,  truncation  and 
fixed  delays  are  required  to  realize  a  causal  filter  which 


xaxis  (m) 

Figure  3:  SNR  of  a  100  randomly-distributed  MFA  for 
sources  located  at  various  positions  on  the  X-Y  plane 
and  passing  through  the  focal  point. 

approximates  the  desired  response. 

The  impulse  response  from  the  desired  focal  point  to 
each  receiver  in  the  array  is  required  to  implement  the 
Matched-Filter  Array  (MFA).  This  response  can  either 
be  calculated  from  the  room  geometry  or  measured  in 
actual  rooms.  For  a  source  located  at  the  focal  point 
emitting  a  signal  of  ^(i),  the  temporal  output  of  the 
MFA  is 

N  N 

n=l  n=l 

(4) 

where  hnf{t)  is  the  impulse  response  from  the  focal  point 
to  the  nth  sensor,  N  is  the  total  number  of  sensors,  and 
*  denotes  convolution.  When  the  source  is  off  the  focal 
position  the  temporal  output  of  the  array  is 

N 

0{t)  =  s(t)  *  ^  ^  hnsi"^)  *  h.n/(”^)  (h) 

n  =  l 

where  hnsit)  is  the  impulse  response  from  the  source  to 
the  nth  sensor.  One  can  immediately  notice  that  the 
size  of  the  focal  volume  for  the  retrieval  of  low  distor¬ 
tion  signals  is  dependent  on  the  spatial  correlation  of  the 
impulse  responses  hnf{t)  and  h„,(t). 

The  configuration  of  the  matched-filter  array  is  of  im¬ 
portance.  A  newly  invented  randomly  distributed  array 
with  100  sensors  outperforms  a  31  x  31  uniformly  spaced 
2-D  array.  The  spatial  volume  selectivity  of  the  MFA 
can  be  demonstrated  using  a  ratio  of  undistorted  signal 
power  to  reverberant  (interfering)  noise  power  (SNR). 
Fig.  (3)  shows  the  SNR  for  a  100  element  randomly  dis¬ 
tributed  MFA  for  various  source  locations  on  the  X-Y 
plane.  The  SNR  is  displayed  in  decibels.  The  center  of 
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Figure  4:  Impulse  response  of  a  two  100  randomly- 
distributed  MFA  with  105  ms  truncation. 

the  array  is  located  at  (10,0,3)  m  in  a  20x16x5  m  room. 
The  wall  reflection  coefficient  is  0.9.  The  microphone 
spacing  is  8  cm  and  the  focal  point  of  the  MFA  is  fixed 
at  (14,9,1.7)m.  The  source  locations  are  sampled  by  an 
11x11  rectangular  grid,  using  the  focal  point  as  the  cen¬ 
ter  of  the  grid.  The  grid  size  is  0.5  m.  The  sampling  rate 
is  8  kHz  and  up  to  fifth  order  images  are  included  in  the 
simulation.  The  lower  graph  in  Fig.  (3)  is  a  contour  plot 
of  the  SNR  which  presents  equal  SNR’s.  The  peak  is 
clearly  located  at  the  focal  point.  The  SNR  values  drop 
dramatically  even  if  the  source  is  50  cm  away  from  the 
focus.  This  figure  shows  the  acuteness  of  the  spatial  vol¬ 
ume  of  the  MFA  which  implies  that  those  sources  which 
are  only  50  cm  away  or  more  from  the  focal  point  will 
be  rejected  by  the  array.  Signal  processing  techniques 
permit  control  of  the  size  of  the  focal  volume. 

Truncation  of  matched-filter  for  the  MFA  is  usually 
required  because  of  the  following  reasons.  If  the  full 
length  impulse  response  is  employed, 

(1)  the  system  delay  is  at  least  equal  to  reverberation 
time  which  is  usually  not  tolerable. 

(2)  large  amounts  of  computation  are  required  for  con¬ 
volution. 

(3)  the  sound  quality  is  diminished  by  the  long  “tails” 
of  the  impulse  response. 

Fig.  (4)  shows  the  simulation  results  of  the  truncated 
MFA.  The  matched-filter  is  truncated  at  105  ms.  Up 
to  fifth  order  images  are  included.  Two  100  randomly- 
distributed  arrays  are  mounted  on  two  orthogonal  walls. 
In  time  domain  impulse  responses,  the  magnitude  of  the 
major  impulse  is  slightly  smaller  than  the  untruncated 
MFA  and  the  tails  after  the  major  impulse  are  similar. 
However,  the  tg,ils  before  the  major  impulse  are  much 
shorter.  Therefore,  the  echoes  before  the  desired  signals 
are  significantly  reduced.  Because  the  backward  mask¬ 
ing  is  always  greater  than  forward  masking,  the  trun¬ 
cated  system  provides  a  much  better  sound  quality.  In 
addition,  the  computation  is  reduced  dramatically  and 


Figure  5:  Frequency  response  of  a  single  matched-filter 
and  a  MFA  with  46  sensors  in  a  real  room. 

the  delay  of  the  system  is  only  105ms.  Significant  im¬ 
provement  of  sound  capture  quality  is  shown  using  real 
room  data. 

An  experimental  array  with  46  channels  was  con¬ 
structed  in  a  real  room.  The  data  was  collected  in  a 
regular  bard- walled  laboratory  with  dimensions  6x6x3m. 
The  reverberation  time  is  approximately  0.5  sec.  The 
impulse  response  from  the  source  to  the  microphone  is 
measured  using  Maximum- Length  Pseudo  Random  se¬ 
quences.  During  the  data  collection,  several  computers 
generating  fan  noise  were  inside  the  room.  Therefore,  a 
high-pass  filter  with  cut  off  frequency  at  200  Hz  is  em¬ 
ployed.  Fig.  (5)  shows  the  frequency  responses  of  a  sin¬ 
gle  matched-filter  and  of  the  composite  of  46  matched- 
filters.  The  impulse  response  of  46  matched-filters  is 
almost  flat,  explaining  the  reason  for  high  quality  au¬ 
dio  output  from  the  MFA  in  a  multipath  environment. 
The  small  amplitude  in  the  low  frequency  range  (0  Hz  to 
200  Hz)  is  due  to  the  high-pass  filtering.  The  amplitude 
drops  around  4kHz  is  due  to  anti-aliasing  filter  of  the 
D/A  conversion. 

Research  continues  to  reduce  the  number  of  micro¬ 
phones  by  optimum  array  configurations  to  relieve  the 
computational  burden  and  to  make  hardware  implemen¬ 
tation  more  practical. 

IV.  Hands-free  speaker  identification 

In  this  section,  hands-free  speaker  identification  (speaker 
ID)  is  presented.  The  reasons  for  pursuing  this  study  are 
as  follows.  First,  both  spatial  selectivity  and  speaker  se¬ 
lectivity  are  often  required  in  a  teleconferencing  environ¬ 
ment.  Speaker  selectivity  enables  the  array  to  capture 
the  designated  speaker  in  the  enclosure.  The  speaker 
identification  system  aids  this  process.  Also,  in  multi¬ 
point  video  conferencing,  hands-free  speaker  ID  assists 
participants  in  identifying  remote  talkers.  This  talker 
identification  and  localization  contribute  to  realism. 

Secondly,  the  error  rate  measurement  for  speaker  ID 
can  be  used  as  an  alternate  index  to  demonstrate  the 
performance  of  the  array  in  comparison  to  a  single  micro¬ 
phone.  The  performance  of  the  array  is  tested  through 
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speaker  recognition  as  opposed  to  speech  recognition 
since  a  speaker  recognition  system  is  simpler  and  is  less 
likely  to  be  influenced  by  the  details  of  the  “back  end”, 
such  as  speaker  dependency  and  the  language  model. 
Further,  the  text  independent  speaker  identification  re¬ 
lies  exclusively  on  the  acoustic  information  carried  in 
individual  frames  of  the  time  waveform.  The  result  is 
more  useful  in  evaluating  speech  enhancement  of  the  mi¬ 
crophone  array  [7].  Additionally,  speaker  ID  permits  se¬ 
curity  control  for  data  access  which  is  often  desired  in  a 
multipoint  conference. 


Two  parts  are  included  in  the  design  of  the  speaker 
ID  system.  For  experimentation,  the  commonly  used 
LPC  derived  cepstrum  is  used  to  provide  measured  fea¬ 
tures  and  a  Vector  Quantizer  (VQ)  with  one  codebook 
for  each  speaker  is  used  as  the  clcissifier.  The  New  Eng¬ 
land  subset  of  the  TIMIT  database  is  used  as  the  speech 
database.  This  subset  is  comprised  of  24  males  and  14 
females.  For  each  of  the  38  speakers,  the  first  five  sen¬ 
tences  are  concatenated  and  used  as  the  training  data, 
and  another  five  sentences  are  used  as  the  testing  data. 
The  total  number  of  trials  is  190  (5  x  38).  The  TIMIT 
data,  originally  bandlimited  to  8  kHz  and  sampled  at 
16  kHz,  are  low-pass  filtered  to  4  kHz  and  down  sam¬ 
pled  to  8  kHz.  The  hands-free  speaker  identification 
system  is  evaluated  using  speech  signals  impaired  both 
by  room  reverberation  and  interfering  noise.  The  de¬ 
graded  speech  data  is  generated  by  computer  simulation 
and  the  previously  discussed  MFA  is  used  as  a  front-end 
sound  capture  device. 

The  dimensions  of  the  room  used  in  the  simulation 
are  20  x  16  x  5  m.  The  acoustic  absorption  coefficient 
for  all  of  the  walls  is  set  to  0.1,  representing  a  highly  re¬ 
verberant  enclosure.  Again,  images  up  to  the  fifth  order 
are  included  to  generate  the  reverberant  speech.  The 
reverberation  time  of  the  enclosure  is  approximately  1.6 
seconds.  The  speech  input  for  the  room  simulation  is  a 
signal  source  placed  at  coordinates  (14,9.5,1.7)  m.  The 
system  performance  is  evaluated  under  various  intensi¬ 
ties  of  interfering  noise  power.  The  noise  is  generated  by 
a  Gaussian  random  number  generator  and  is  located  at 
coordinates  (3. 0,5.0, 1.0)  m. 

Fig.  (6)  shows  the  scores  for  correct  identification  from 
a  single  microphone  and  from  two  31x31  MFA’s  placed 
on  orthogonal  walls  of  the  room.  Sensors  of  the  arrays 
are  uniformly  spaced  at  4  cm.  The  system  is  evaluated 
under  matched  condition  where  the  acoustic  conditions 
for  training  and  testing  section  are  identical.  Four  differ¬ 
ent  values  of  interfering  noise  SNR’S  are  used,  ranging 
from  20  dB  to  0  dB.  Fig.  (6)  shows  that  the  correct 
identification  score  of  the  single  microphone  system  de¬ 
creases  significantly  as  SNR  decreases,  while  the  perfor¬ 
mance  of  the  MFA  remains  nearly  unchanged,  showing 
that  the  MFA  can  be  used  as  an  effective  front-end  de¬ 
vice  for  hands-free  speaker  ID  systems. 


Figure  6:  Speaker  identification  under  a  matched  train¬ 
ing  and  testing  condition.  An  interfering  noise  is  pre¬ 
sented  with  varying  SNR. 

V.  Conclusion 

The  matched-filter  microphone  array  is  able  to  pro¬ 
vide  high-quality  hands-free  sound  capture  in  reverber¬ 
ant,  distance  talking  environments.  Conference  partici¬ 
pants  are  provided  an  environment  approaching  the  nat¬ 
uralness  of  face-to-face  conversation  without  tether  or 
encumbrance  by  hand-held  or  body-worn  microphones. 
Additionally,  MFA’s  can  be  used  as  a  front-end  device 
for  current  speech  technologies  to  provide  hands-free  op¬ 
eration  and  more  natural  human-machine-interfaces. 
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ABSTRACT 

Basic  researches  in  speech  communications  have  significantly 
progressed  in  the  past  four  decades.  Paradoxically,  the  most  of 
currently  developing  text-to-speech  systems  is  based  on  a  "cut- 
and-paste”  waveform  concatenation  method.  Its  main 
advantage  seems  to  come  from  the  fact  that  detail  knowledge 
about  speech  is  not  necessary.  The  "dumb"  but  efficient 
engineering  approach  is  actually  taking  over  a  more 
scientifically  sound  rule-based  synthesis  in  the  development 
and  application  of  a  high  quality  text-to-speech  system.  We 
overview  these  two  different  approaches  and  then  try  to 
demonstrate  the  feasibility  of  a  vocal-tract  synthesis  method 
which  is  nodiing  but  a  retake  of  an  old  speech  synthesis 
method  called  "vocal-tract  analog".  Some  examples  of  vocal- 
tract  synthesis  of  V|CV2  (vowel-consonant-vowel)  utterances 
using  an  extremely  simple  phoneme  concatenation  by 
interpolation  will  be  demonstrated. 

1.  INTRODUCTION:  TEXT-TO-SPEECH  SYSTEM 
Since  the  number  of  different  sentences  a  talker  produces  is 
practically  infinite,  any  text-to-speech  conversion  system  must 
produce  speech  concatenating  a  finite  number  of  some 
segmental  units.  Sounds  of  world  languages  can  be  transcribed 
by  only  about  100  phonemes  and  30  diacritics.  English  sounds 
can  be  transcribed  by  about  40  phonemes.  The  phoneme, 
therefore,  is  a  phonetically  well  motivated  unit.  However,  all 
efforts  to  directly  concatenate  phoneme-size  segments  of 
speech  have  badly  failed.  This  failure  was  explained  by  the  fact 
that  the  acoustic  manifestation  of  phonemes  highly  depends  on 
the  segmental  context,  i.e.,  the  well-known  coarticulatory 
effects.  The  high  context-dependent  variability  of  phonemes 
motivated  two  different  approaches  for  text-to-speech  systems, 
(1)  rule-based  systems  in  which  the  coarticulatory  effects  are 
handled  by  a  set  of  heuristic  rules  and  (2)  concatenation 
systems  using  a  unit  larger  than  the  phoneme  so  that  the  units 
are  concatenated  at  point  where  the  coarticulatory  effects  are 
relatively  small,  for  example  at  the  middle  of  a  vowel.  We 
shall  argue,  in  contrary  to  the  "common"  sense,  that  a  straight 
forward  phoneme  concatenation  might  be  possible  if 
concatenations  are  done  at  the  level  of  the  vocal  tract,  instead 
of  at  the  acoustic  level. 

The  concatenation  of  segments  is  only  one  of  components 
in  text-to-speech  systems.  A  correct  assigiunent  of  prosodic 
patterns,  such  as  voice  fundamental  frequency  (FO),  segmental 
duration  and  pauses,  is  as  important  as  segment  concatenation 
to  build  a  high  quality  system.  Moreover,  the  phoneme  is  a 


phonetically  well  motivated  unit,  yet  describing  sentences  by 
phonemes  is  hardly  practical.  Systems,  then,  need  a  grapheme- 
to-phoneme  converter  in  real  applications.  In  this  paper 
however,  we  describe  only  issues  in  segment  concatenation, 
because  it  characterizes  best  the  difference  in  the  state  of  the 
art  speech  synthesis  systems.  A  more  general,  comprehensive, 
and  historical  review  on  text-to-speech  systems  is  given  by 
Dennis  Klatt  [1]. 


Figure  1 .  Specification  of  formant  transitions  by  targets 
and  rules  formulated  by  Holmes  [2]  (after  [1]). 


2.  RULE-BASED  FORMANT  SYNTHESIS 
In  a  rule-based  system,  formant  transitions  observed  in  natural 
speech  are  simplified  and  stylized  and  then  specified  by 
heuristic  rules.  Let  us  describe  how  the  formant  transitions  of 
"God",  as  an  example,  are  determined  by  rules  due  to  Holmes 
et  al.  [2]  (described  in  [1]).  Probably  the  most  perceptually 
important  formant  transitions  occur  just  after  the  vowel  onset 
following  a  stop  consonant  and  just  before  the  offset  preceding 
a  stop.  These  transitions  are  important  because  they  are  one  of 
cues  signalling  place  of  articulation  and  thus  the  identity  of  the 
consonants.  First,  the  target  values  are  read  from  a  table:  The 
target  values  of  the  first  three  formants  of  /a/  are  shown  in 
horizontal  dotted  lines  in  Figure  1.  The  targets  for  the  stops  /g/ 
and  /d/  are  indicated  by  the  dots  with  dashed  lines,  which  are 
called  the  "loci".  Second,  using  these  targets  in  combination  of 
rules,  the  formant  transitions  indicated  by  the  thick  lines  are 
determined.  For  example,  the  value  of  F2  onset  frequency  is 
determined  by. 
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^^onset  ^(^^target  ’  ^^locus)  ^^locus  > 

which  is  a  locus  equation.  In  the  case  of  Holmes'  synthesis,  a 
constant  k  =  0.5  was  used.  The  value  of  k  can  be  optimized  to 
fit  better  with  observed  data.  The  specification  of  the 
transitions  by  loci  is  called  locus  theory  [3].  This  theory  states 
that  loci  (the  dots)  are  the  points  A^ere  the  vowel  formants 
emerge  and  converge,  if  the  voicing  were  extended  as  indicated 
by  the  dashed  lines,  and  they  have  fixed  values  for  a  given  stop 
independently  of  the  vowel  context.  However,  a  significant 
improvement  in  the  intelligibility  of  stops  is  gained  by  the 
context  dependent  loci.  Klatt  [4]  proposed  the  locus  values 
varying  depending  on  three  vowel  categories,  front  vowels, 
rounded  vowels,  and  else. 

Although  it  is  possible  to  achieve  a  highly  intelligible 
speech  by  concatenating  phonemes  with  this  kind  of  basic 
rules,  a  number  of  additional  rules  to  handle  phonetic  details 
such  as  vowel  reduction,  vowel  nasalization  due  to  the 
coarticulatory  effect  from  the  adjacent  nasal  consonant,  and  so 
on,  and  detailed  improvements  in  the  excitation  sources  [5]  are 
necessary  to  obtain  human  like  speech.  The  formulation  of 
concatenation  rules  requires  knowledge  and  experience  in 
acoustics  and  perception  of  speech.  Often  an  improvement  is 
gained  after  many  tedious  try-and-error  experimentation.  In 
short,  concatenation  of  phonemes  in  formant  synthesis  is 
difficult  to  justify,  unless  there  is  another  motivation  or 
objective,  beside  the  construction  of  a  high  quality  system.  The 
motivation  was  often  to  gain  deeper  understanding  about  which 
acoustic  cues  are  important  to  distinguish  a  phoneme  from 
another  in  speech.  This  is  probably  the  main  reason  why  rule- 
based  phoneme  concatenation  systems  using  a  formant 
synthesizer  have  been  and  are  developed  at  highly  regarded 
academic  institutes,  such  as  Haskins  Laboratories  in  the  earlier 
time,  the  MTT  speech  group,  and  speech  communication 
department  at  Royal  Institute  of  Technology  (Stockholm),  to 
mention  a  few. 

3.  WAVEFORM  CONCATENATION:  PSOLA 
An  engineering  oriented  method  is  to  concatenate  units  larger 
than  the  size  of  phonemes,  such  as  diphones  to  avoid  the 
problem  of  the  context  variability  altogether.  The  basic  idea  is 
that  the  transition  part  is  extracted  from  speech  continuum  by 
cutting  at  the  stable  parts,  for  example  at  the  middle  of  a  vowel 
and  stored  in  the  inventory.  The  concatenation,  therefore,  can 
be  done  at  the  relatively  stable  edge  of  the  transition.  The 
smallest  such  unit  is  the  diphone  (or  dyad),  which  is  a  segment 
corresponding  to  the  middle  part  of  a  phoneme  to  that  of  the 
following  phoneme.  For  example,  "Paris"  is  synthesized  by 
concatenating  six  diphones  as  <#p><pa><ar><ri><is><s#>, 
where  #  denotes  silence  occurring  at  word  boundaries.  In 
certain  cases,  it  is  difficult  to  find  a  stable  portion  in  a 
phoneme  especially  in  a  consonant,  like  /r/.  Then  we  may 


employ  even  longer  unit  and  concatenation  takes  place  only  at 
the  stable  part  of  vowels:  such  a  unit  is  called  the  "syllable 
dyad"  or  "triphone".  Then  "Paris"  can  be  obtained  by 
concatenating  three  triphones,  <#pa><ari><is#>.  Increasing  the 
size  of  concatenation  segments  might  improve  the  quality  of 
synthetic  speech,  the  number  of  necessary  segments  would 
increase  dramatically  however.  The  number  of  phonemes,  of 
diphones,  and  of  triphones  in  English  is,  respectively,  about 
40,  1600,  and  64000.  A  good  compromise  calls  for  mix  units 
having  different  length.  Using  a  relatively  large  speech 
database,  necessary  segments  with  different  length  are 
automatically  extracted  with  a  criterion  for  goodness  of 
concatenation  [6  and  7],  thus  keeping  the  niunber  of  segments 
manageable,  say,  in  the  order  of  1000. 

Earlier  attempts  in  this  kind  of  cut-and-paste  synthesis  did 
not  always  give  satisfactory  results  however.  A  possible  reason 
was  that  the  parametric  representation  of  speech  segments  did 
not  have  sufficient  quality.  The  LPC  encoded  speech,  for 
example,  were  stored  in  the  inventory.  For  the  prosodic 
control,  a  simplified  excitation  signal,  consisting  of  pulse  train 
for  voiced  soimds  and  white  noise  for  voiceless  consonants 
were  modified  to  control  prosody,  thus  degrading  the  quality.  It 
is  well  known  that  LPC  encoded  speech  can  have  a  high 
quality  only  when  the  LPC  filter  is  excited  with  the  residual 
signal.  An  improvement  was  reported  by  using  the  multi-pulse 
LPC  coding  [7]. 

If  one  could  modify  pitch  and  duration  of  speech  signals  by 
manipulating  the  waveform,  then  extracted  speech  chunks  are 
directly  used  as  concatenable  segments.  If  the  modifications  are 
perfect,  we  should  be  able  to  obtain  concatenated  speech 
whose  quality  is  as  high  as  the  original  human  speech,  at  least, 
for  the  segmental  characteristics. 

One  of  such  prosodic  modification  techniques  is  a  non- 
parametiric  Pitch-Synchronous  Overlap-Add  (PSOLA).  The 
development  of  PSOLA  is  traced  back  to  the  phase  vocoder  as 
a  short-time  Fourier  analysis/synthesis  scheme,  which  was  to 
uniformly  modify  time-scale  (speaking  faster  or  slower  with  a 
constant  rate)  [8]  or  pitch-scale  (speaking  with  a  higher  pitch 
or  lower  pitch)  [9].  PSOLA  is  intended  for  non-uniform  time¬ 
scaling  (duration)  and  pitch-scaling.  The  key  to  the  method  is 
to  use  detected  voice  excitation  points  as  analysis  time 
instants,  thus  it  can  be  used  for  prosodic  control  [10].  The 
method  requires  the  detection  of  "instants"  of  voice  excitation 
points.  This  is  not  really  a  handicap  for  a  text-to-speech 
application,  because  pitch  is  detected  and,  if  necessary, 
corrected  during  the  construction  of  the  waveform  segment 
inventory.  In  synthesis,  the  synthesis  time-instants  are 
determined  such  that  they  exhibit  the  desired  FO  and  duration 
patterns. 

How  PSOLA  modifies  pitch  is  illustrated  in  Figure  2.  A 
short  time  window  is  applied  to  the  original  speech  waveform 
at  every  analysis  time-instants  (pitch  periods).  Then  synthesis 
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is  done  by  simply  placing  these  windowed  waveforms  (let  us 
call  it  "wavelet")  at  desired  synthesis  time-instants  as  shown  in 
Figure  2.  Then,  the  wavelets  are  added  together  to  obtained 
pitch  modified  signal.  The  modification  of  duration  is  done  by 
duplicating  one  or  more  wavelets  to  increase  duration  or  by 
eliminating  (discarding)  one  or  more  wavelets  to  shorten  the 
duration.  It  may  be  noticed  that  this  method  operates  strictly  in 
the  time  domain  and  thus  called  TD-PSOLA.  Why  such  a 
simple  method  works  and  how  to  determine  the  synthesis  time- 
instants  to  derive  desired  prosodic  patterns  are  explained  in 
[10].  The  pitch  modification  range  of  TD-PSOLA  is  of  from  0.5 
to  2,  which  is  largely  sufficient  for  the  text-to-speech  purpose, 
since  the  normal  pitch  range  of  a  speaker  is  less  than  one 
octave. 


Figure  2.  Pitch  modification  with  TD-PSOLA  (after  [10]). 


PSOLA  technique  significantly  improved  the  quality  of 
text-to-speech  system.  This  is  surprising  from  the  scientific 
viewpoint,  since  the  concatenation  of  segments  from  different 
utterance  would  destroy  a  physical  coherence  the  stream  of 
speech  signal  at  every  concatenation  points.  The  quality  in 
general  is  comparable  to  commercially  available  top-of-line 
rule-based  systems.  However,  Klatt  [1]  stated  "...  the  two  types 
of  systems  seem  to  have  a  different  set  of  perceived 
deficiencies  in  naturalness",  which  we  think  is  still  valid. 
Perhaps,  a  disadvantage  of  waveform  concatenation  approach 
is  lack  of  flexibility,  in  contrary  to  the  rule-based  system.  The 
waveform  segment  inventory  is  build  from  speech  of  a 
particular  speaker.  If  we  want  a  different  voice,  we  have  to 
build  a  new  waveform  inventory  from  a  new  speaker.  Although 
some  attempts  to  overcome  this  problem  were  earned  out,  it 
seems  so  far  that  an  effective  method  is  yet  to  be  formulated 
however  (e.g.  [11]). 

4,  VOCAL-TRACT  SYNTHESI  BY  PHONEME 
CONCATENATION 

Speech  sounds  are  produced  either  by  the  vibration  of  the  vocal 
folds  for  voiced  source  or  by  the  turbulence  of  airflow  at  the 
downstream  of  the  constriction  for  noise  source.  These  sound 


sources  undergo  spectral  modification  by  the  resonance 
characteristics  of  the  vocal  tract.  Since,  the  human  vocal-tract 
is  a  narrow  tube,  the  major  mode  of  resonance  occurs  along  its 
length  from  the  glottis  to  the  lips  (and/or  to  the  nostrils  in  the 
case  of  nasal  sounds).  The  generation  and  propagation  inside 
the  tract,  therefore,  can  be  simulated  by  an  acoustic 
transmission  line  [12,  13,  14].  Values  of  the  acoustic  elements 
in  the  transmission  line  is  calculated  from  the  area  function 
that  describes  the  variation  of  cross-sectional  areas  along  the 
length  of  the  vocal  tract.  The  simulator  can  be  used  for  speech 
synthesis,  although  its  computational  load  is  much  higher  than 
acoustic  synthesizers  such  as  a  formant  synthesizer.  This  kind 
of  synthesizers  was  called  "vocal-tract  analog",  whereas  the 
formant  synthesizer  "terminal  analog"  [12],  because  it  specifies 
the  input-output  (source-radiated  soimd)  relations  which  is 
actually  the  transfer  function. 

In  a  study  of  the  acoustics  of  Arabic  fricatives  [15],  the 
vocal-tract  synthesizer  [16]  was  used.  In  order  to  judge  the 
perceptual  adequacy  of  synthesized  fricatives,  it  is  necessary  to 
set  them  in  vowel  contexts.  The  most  technically  simple  way  is 
to  concatenate  vowel-consonant-vowel  (VjCV2)  by  target 
interpolation  without  pay  any  attention  to  the  coarticulation 
phenomenon. 


Figure  3.  Interpolation  of  the  k-th  area  of  a  vocal-tract  area 
function  in  VjCV2.  The  circles  indicate  targets. 

In  our  synthesis,  each  phoneme  is  specified  by  a  fixed  area 
fimetion  as  a  target.  The  area  function  is  specified  by  a  fixed 
number  of  sections  in  which  the  k-th  section  is  defined  by  the 
cross-sectional  area,  A(k,  n),  and  the  length,  x(k,  n).  The  index 
n  denotes  a  discrete  time.  Obviously,  a  transition  period  from 
one  phoneme  to  another  is  needed  so  that  the  area  function 
varies  smoothly  between  phonemes.  The  interpolation  of  A  and 
x  is  done  section-by-section  independently.  The  interpolation 
of  A(k,  n)  is  schematically  illustrated  in  Figure  3.  The 
stationary  part  in  each  phoneme  is  specified  by  the  same  target 
value  at  the  onset  and  offset.  The  area  (and  length)  between 
the  offset  of  a  phoneme  and  the  onset  of  the  following  phoneme 
is  interpolated  by  a  cosine  function.  Since  the  k-th  section  in 
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this  example  corresponds  to  the  constriction  section  during  the 
thcative  consonant  /s/,  the  area  becomes  close  to  zero  (actually 
0.2  cm^).  This  is  all  we  need  to  concatenate  a  V1CV2 
sequence. 

It  may  be  integected  here  that  the  target  area  fimction  for 
/s/  is  a  uniform  tube  (corresponding  to  a  neutral  vowel)  having 
a  constriction  with  one  section  long  at  an  appropriate  place  for 
the  consonant.  More  importantly  perhaps,  the  advantage  of 
concatenation  in  the  area-function  domain  is  that  both  targets 
for  consonants  and  those  for  vowels  are  specified  by  the  area 
functions  having  only  different  values,  thus  the  interpolation 
can  be  explicitly  defined.  In  the  acoustic  domain  however,  oral 
vowels  are  characterized  by  only  poles  of  the  transfer  function, 
whereas  consonants  requires  poles  and  zeros  [12,  14].  The 
interpolation  in  the  acoustic  domain,  thus,  would  become 
complicated,  if  one  wishes  to  interpolate  the  transition  from  a 
consonant  to  a  vowel  accurately. 

The  interpolated  area  function  is  fed  into  the  vocal-tract 
synthesizer.  The  glottal  area  variation,  parameterized  glottal 
pulse  shape  representing  the  quasi-periodic  vocal-fold 
oscillation  for  voicing  and  a  slow  opening/closing  of  the  glottis 
during  consonant  to  supply  a  sufficient  airflow,  is  also 
calculated  by  the  target  interpolation  scheme,  and  then  fed  into 
the  synthesizer.  The  frication  noise  is  automatically  generated 
in  the  synthesizer.  Pink  noise,  actually  a  sequence  of  low-pass 
filtered  random  numbers,  is  injected  at  one  section  downstream 
from  the  constriction  section.  The  amplitude  of  the  noise  is 
modulated  by  a  function  of  the  cross  sectional  area  of  the 
constriction  and  the  simulated  airflow  using  either  the  square 
law  [14]  or  cubic  law  [17]. 

Using  the  concatenation  scheme  described,  various 
fricatives,  such  as  /f,  s,  J,  and  x/  and  unvoiced  stops,  such  as 
/p,  t,  and  k/  with  the  different  vowel  contexts  were  synthsized. 
We  did  not  perform  any  formal  perceptual  evaluation,  but  in 
casual  listening,  they  sounded  surprisingly  natural  and  their 
intelligibility  seems  to  be  quite  high.  It  is  somewhat  puzzling 
that  the  area-function  interpolation  in  the  earlier  attempt  by 
Kelly  and  Lochbaum  [18]  (described  in  [1])  resulted  in  a  poor 
speech  quality,  which  could  not  compete  with  a  formant  rule- 
based  system  in  the  same  period.  We  suspect  that  the  quality 
we  obtained  is  primarily  due  to  our  vocal-tract  synthesizer  that 
simulates  the  vocal-tract  acoustics  more  accurately  than  the 
Kelly-Lochbaum  model. 

5.  CONCLUDING  REMARKS 
Although  our  experiment  is  preliminary  concerning  only 
isolated  syllables  with  a  limited  number  of  consonants,  it  is 
tempted  to  state  that  the  straight  forward  concatenation  of 
phonemes  might  be  adequate  to  obtain  a  high  quality  speech.  If 
this  is  the  case,  it  may  possible  to  construct  a  text-to-speech 
synthesizer  with  some  flexibility.  For  example,  it  is  a  simple 


matter  to  convert  a  male  speech  to  a  female  speech,  and  vice 
versa,  with  the  vocal-tract  synthesis. 
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Sigfrid  D.  Soli 
House  Ear  Institute 
Los  Angeles,  California 

ABSTRACT^ 


For  a  hearing  aid  user  to  perforin  binaural  directional  hearing  in  noisy  environ¬ 
ments,  it  is  important  to  maintain  at  audible  levels  the  binaural  cues  (interaural  time 
and  level  differences)  that  would  be  present  if  the  hearing  aid  were  not  in  place.  We 
have  developed  a  wearable  prototype  digital  hearing  aid  and  personal  computer  (PC) 
based  methods  of  digital  filter  design  for  evaluation  of  the  effectiveness  of  this  method. 
The  body-worn  prototype  processor  utilizes  a  Motorola  56000-series  DSP  chip  to  im¬ 
plement  the  filtering  and  amplification  for  both  ears  via  cabled  ITE  ear  modules.  Up 
to  four  different  user-selectable  algorithms  can  be  programmed  in  the  processor. 


Binaural  fitting  of  the  processor  is  accomplished  in  two  steps:  Hearing  Aid  Equal¬ 
ization  (HAE)  and  Hearing  Loss  Compensation  (HLC).  HAE  is  achieved  with  an  FIR 
filter  which  equalizes  the  amplitude  and  phase  insertion  effects  of  the  ear  modules  and 
maintains  the  binaural  cues  with  the  modules  in  place.  The  HAE  filter  coefficients 
for  each  ear  are  obtained  from  in  situ  probe  tube  measurements  of  aided  and  unaided 
test  signals  using  optimal  filter  calculations  performed  by  the  PC.  HLC  for  each  ear 
is  also  achieved  with  an  FIR  filter  and  associated  gain.  HLC  filter  coefficients  are  ob¬ 
tained  with  the  PC  using  a  weighted  least-squares  filter  design  technique.  The  target 
response  for  each  HLC  filter  is  determined  from  measures  of  auditory  thresholds  and 
soundfield  reference  signals,  and  is  based  on  an  adaptation  of  Articulation  Theory  for 
hearing  aids.  The  HAE  and  HLC  filters  are  convolved  to  produce  a  single  filter.  A  de¬ 
tailed  description  of  the  processor,  fitting  algorithms,  methods  and  accuracy  of  fittings, 
as  determined  from  in  situ  probe  tube  measurements,  and  the  hardware  and  software 
comprising  the  binaural  hearing  aid  will  be  given.  Results  from  field  trials  with  the 
portable  processors  will  also  be  reported. 


^  Paper  will  be  avaihble  at  the  time  of  the  conference. 
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ABSTRACT 

In  this  paper  a  new  type  of  electromagnetic  field 
signals  is  considered.  The  signals  have  discretely 
modulated  field  stntetures  (topologies).  For  their 
processing  new  logical  passive  circuits  are 
proposed  and  results  of  numerical  modelling  are 
considered. 


1.  SIGNALS  AND  SWITCHES 


h 


■#  2 


H 


Fundamental  characteristics  of  signal  processing 
devices  are  defined  by  physics  of  nonlinear 
switching  phenomena  and  integrated  circuits  (ICs) 
technologies.  Changing  the  physical  effects  it  is 
possible  to  improve  dissipative  and  time 
characteristics  of  digital  circuits. 

One  of  the  approaches  of  electromagnetic  field 
signal  processing  consists  in  writing  digital 
information  into  a  spatial  structure  (topology)  of 
fields.  The  characteristic  was  named  as 
topological  chart  of  the  electromagnetic  field  and 
it  is  an  ordered  set  of  separatriccs  and  equilibrium 
points  of  field  force  lines.  The  topological  chart 
can  be  changed  disetretely  only.  A  particular  case 
of  the  signal  is  modes  of  coupled  strip 
transmission  lines,  which  have  different  spatial 
structures  of  the  electromagnetic  fields  [1]. 
According  to  the  results  of  topological  methods  of 
electromagnetic  fields  design  [2]  a  spatial 
switching  is  rcalisacd  with  passive  circuits,  which 
have  nonlinear  characteristics  relative  to  the  input 
field  topologies. 

Some  experimental  and  theoretical  results  have 
been  considered  in  [1],  which  touch  upon 
microwave  circuits  on  the  base  of  coupled  strip 
and  slot  transmission  lines.  For  super  high-speed 
applications  rectanguler  time  dependent  impulses 
are  preferable.  In  this  area  some  new  results  have 
h&tn  obtained  with  the  help  of  a  switch  which  is 
shown  in  Fig.  1. 

An  input  signal  is  a  series  of  impulses  of  even 


Fig,  1.  Spatial  switch  for  topologically  modulated 


Fig.  Z  Truth-table  of  the  spatial  switch. 


(logical  variable  ‘T)  and  odd  (logical  variable  '0‘) 
modes  in  coupled  strip  transmission  lines  1 
(Fig.  1).  The  modes  have  different  topologies  of 
electromagnetic  fields,  equal  phase  and  group 
velocities.  The  signals  are  switched  at  different 
outputs  2  and  3,  according  to  truth-table  (Fig.  2). 
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The  3witch  is  a  device  of  triple  reversible  logic.  Its 
switching  time  depends  on  linear  dimensions  and 
values  of  parasitic  reactivities  of  the  circuits. 


2.  TRANSIENT  ANALYSIS 

Simulation  of  transient  processes  with  the  help  of 
equivalent  circuits  has  shown  switch  delay  time 
t=0. 1-0.3  ps  for  linear  dimensions  of  the  switch 
about  4  pm  (Fig.  3  and  4),  if  input  signal  is  a 
rectangular  time  dependent  impulse  of  even  or  odd 
modes  with  its  duration  At=l  ps  and  its  amplitude 
U=1  V.  Power  losses  P  are  defined  by  resistors 
R=20  Ohm  and  the  ones  are  less  than  20  mW  for 
input  current  1=0. 1-1  mA  and  do  not  depend  on 
semiconductor  physics.  It  is  possible  to  reduce  P  to 
a  minimum,  if  to  drop  the  resitors.  In  this  case  the 
losses  are  equal  approximately  to  0.1  mW  for 
aluminia  conductors  which  have  a  volume 
V=100  pm^.  The  switching  effect  does  not  depend 
on  semiconductor  physics,  due  to  that  it  is  possible 
to  overcome  a  part  of  fundamental  limitations, 
which  are  typical  for  transistors. 

Exjxjrimental  studies  of  the  switch  have  shown  the 
outputs  2  and  3  isolation  about  3S-40dB  in  the 
area  of  microwave  frequencies.  The  switch  was 
produs«l  with  MIC  technology  on  AI2O3- 
substrate. 

On  the  base  of  the  switch  two  types  of  logical 
circuits  “AND",  “OR".  "NOT"  are  proposed.  The 
first  is  based  on  interference  effect  [1,  2,  3],  the 
second  is  hybrid  circuits  with  semiconductor 
elements  [3]. 


Fig.  4.  Output  signal  on  the  port  3. 

3.  CONCLUSIONS 

In  this  paper  a  new  picosecond  switch  has  been 
considered  for  electromagnetic  signals  which  field 
structures  are  modulated  discretely.  The  spatial 
switching  is  realised  by  a  passive  circuit.  Typical 
time  delay  of  the  switch  is  equal  to  0. 1-0.3  ps 
and  power  losses  are  less  then  20  mW.  A  field  of 
applications  of  the  new  approach  to  field 
information  processing  is  super  high-speed 
circuits. 
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ABSTRACT 

Due  to  the  increasing  interest  in  extending  the 
frequency  limits  of  silicon  devices  for 
telecommunication  applications,  we  have 
analyzed  the  performance  of  different  series  of 
advanced  polysilicon  bipolar  transistors. 

The  devices  were  grouped  according  to  their 
emitter  finger  number  and  were  characterized  over 
the  2-6  GHz  frequency  range  in  terms  of  noise, 
gain  and  scattering  parameters  at  different  bias 
conditions. 

A  bias-dependent  model  extraction  including 
noise  sources  has  been  subsequently  performed 
for  each  series  and  a  comparative  analysis  has 
been  carried  out.  The  results  of  this  analysis 
point  out  distinguishing  features  of  this  advanced 
bipolar  process  which  allows  for  realizing  highly 
competitive  bipolar  devices  for  use  in 
telecommunication  equipment  at  low  microwave 
frequencies. 

1.  INTRODUCTION 

In  portable  communication  systems  operating 
over  the  low  microwave  range,  devices  and 
circuits  must  be  designed  to  meet  their  best 
performance  at  very  low  current  and  voltage 
levels.  The  industry's  trend  is  for  developing 
advanced  silicon  bipolar  processes  to  realize 
lower  cost  components,  size  and  performance 
competitive  with  solution  using  GaAs.  Among 
the  most  advanced  bipolar  processes  for  the 
realization  of  high  speed  silicon  transistors, 
double-polysilicon  self-aligned  (PSA)  schemes 
have  definitively  emerged  as  the  most  effective 
technique. 

The  adoption  of  polycristalline  silicon  layers  upon 
the  base  and  the  emitter  single  crystal  zones 
allows  for  a  remarkable  increase  of  the  common 
emitter  current  gain  B  and  a  smaller  dependence 
of  B  itself  on  temperature  than  conventional 
BJT's.  In  addition,  the  self-alignment  of  the 


extrinsic  base  and  the  emitter  results  in  a 
substantial  reduction  of  the  base  resistance 
and  the  collector-base  capacitance  thus  allowing 
for  an  appreciable  improvement  in  the  gain- 
bandwidth  product,  maximum  oscillation 
frequency,  minimum  noise  figure,  power 
dissipation  and  component  integration  level  in 
VLSI  application  [1]. 

To  the  aim  of  exploring  the  features  and 
performance  of  a  commercially  competitive  PSA 
bipolar  process,  the  manufacturer  ( CoRiMMe,  an 
SGS-Thomson  Research  Center,  Catania,  Italy) 
has  supplied  us  with  several  devices  characterized 
by  different  emitter  configurations.  We  had 
previously  performed  a  complete  analysis  on  a 
preliminary  PSA  device  series  from  the  same 
manufacturer  aimed  at  the  extraction  of  an 
accurate  circuit  model  including  noise  sources 
[2]. 

We  have  recently  characterized  a  total  of  29  PSA 
transistors  supporting  5  different  chip  topologies 
to  gain  a  better  insight  on  those  effects  related  to 
emitter  geometry  which  affect  device  operation. 
The  features  of  the  measured  families  (named  Q  ) 
are  the  following: 

-  Q2  4  emitters,  5  base  contacts,  8  pim  emitter 
length,  with  4.6  jim  pitch 

-  Q3  8  emitters,  9  base  contacts,  8  /<m  emitter 
length,  with  4.6  pim  pitch 

-Q4  16  emitters,  17  base  contacts,  8 

emitter  length 

-Qfi  same  as  Q2,  with  3.8  ]4.m  pitch 
-Q7  same  as  Q3,  with  3.8 /^m  pitch 
The  emitter  finger  pitch  may  be  a  critical 
parameter  in  improving  the  RF  performance  of 
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BIT'S  since  the  transition  frequency  fr  can  be 
increased  by  reducing  its  size.  We  performed  the 
characterization  and  the  model  extraction  of  the 
transistor  series  in  terms  of  noise,  gain  and 
scattering  parameters  over  the  2-r6  GHz 
frequency  range  at  the  bias  conditions  suggested 
by  the  manufacturer  which  consisted  of  the  fixed 
low  voltage  of  2.8  V  and  two  Ic  current  values 
proportional  to  the  emitter  finger  number  Ue. 

As  a  general  result,  Fq  takes  on  very  low  values 
as  compared  to  conventional  bipolar  transistors: 
less  than  1  dB  at  2  GHz  for  Q3  and  around  2  dB 
up  to  5  GHz  for  all  families. 

The  noise  performance  at  the  lowest  frequencies 
improves  proportionally  to  He  (which  is  expected 
since  the  base  resistance  decreases  with 
increasing  finger  number)  and  deteriorates  with 
increasing  bias  current.  The  maximum  available 
gain  is  less  affected  by  either  He  and  the  bias 
current  level  with  typical  values  in  the  range  of  1 1 
to  14  dB.  Also,  this  bipolar  process  has  exhibited 
an  average  fp  around  10  GHz  and  no  performance 
variations  have  been  noticed  between  the  device 
series  having  different  emitter  ptch. 

2.  DEVICE  CHARACTERIZATION 

The  PSA  devices  were  encapsulated  in  a  standard 
70-mil  microstrip  package  which  fits  our 
measuring  text  fixture  (Maury  Microwave,  mod. 
MT950)  equipped  with  proper  insert  calibration 
elements  (planar  package-size  devices  supporting 
ihe  short  andt/irw  layouts). 

In  our  measurements,  we  refer  to  the 
representation  in  terms  of  the  noise  parameters 
which  appear  in  the  following  relationship 

IFs-rol  2 

F(ra)  =  Fo+  4  Fn  - 

11  +  ro  I  2(1-  lrsl2) 

where  F  and  Fs  are  the  noise  figure  and  the  input 
termination  reflection  coefficient  of  the  device 
under  test,  respectively,  and  the  four  noise 

parameters  areFo  (minimum  noise  f  igure),  Fq 

(value  of  the  complex  variable  Fs  at  which  the 
minimum  is  located)  and  rn  (equivalent  noise 
resistance  normalized  with  respect  to  50  Q).  A 
similar  relationship  holds  for  the  recipro^l  of  the 
gain  parameters  which  defines  the  conditions  for 
the  maximum  value  of  the  available  gain  (Gaopt) 

vs.  Fs. 

By  means  of  an  automatic  system  whose 
measuring  procedure  has  been  developed  in  our 


Lab,  we  perform  measurements  of  the  noise 
figure  at  the  system  output  for  either  some 

properly  selected  values  of  Fs  and  ,  at  each  Fs 
value,  for  different  values  of  system  losses  as 
introduced  by  a  high  repeatability  step  attenuator. 
From  these  noise  data  we  derive  both  the  noise 
and  the  gain  parameter  sets  of  the  DUT  by  an 
accurate  deembedding  of  the  various  stage 
contributions  and  by  applying  appropriate  data 
processing  techniques.  The  scattering  parameters 
are  then  calculated  by  use  of  well-known 
relationships  employing  the  gain  parameters  and, 
therefore,  are  not  measured  directly  by  a  network 
analyzer  (see  [3]  and  references  therein). 

The  transistor  series  were  tested  at  the  following 
bias  conditions: 

-Q2  @  Vce  =  2.8  V,  lc=2and8mA; 

-Q3  @Vce=2.8V,Ic=4  andl6mA; 

-Q4  @Vce  =  2.8V,  Ic  =  16  and64mA; 

-Qfi  same  as  Q2; 

-07  same  as  (^. 

Since  all  the  measured  data  for  each  group 
exhibited  reduced  spread,  a  Qi  typical  device 
(bold  line  in  the  reported  diagrams)  to  be 
employed  during  the  modeling  step  has  been 
determined  by  simple  statistic  functions  for 
representing  each  transistor  series.  As  an 
example,  we  report  in  Fig.  1  the  four  noise 
parameters  of  the  Q3  series  at  the  above  given 
bias  conditions. 

3.  CIRCUIT  MODEL  EXTRACTION 
WITH  NOISE  SOURCES 

The  transistor  chip  model  is  a  standard  hybrid-jc 
structure  where  the  distributed  base  resistance  and 
capacitance  between  intrinsic  base  and  collector 
region  have  been  introduced  for  an  accurate 
representation  of  the  transistor  performance  at 
microwave  frequencies. 

By  deembedding  the  effects  of  the  package  from 
the  measured  parameter  values,  we  obtained 
scattering  and  noise  parameter  sets  which  refer 
directly  to  the  chip  device  performance.  The 
optimization  procedure  has  then  been  applied  to 
the  chip  model  network  to  determine  the  circuit 
element  values  as  well  as  the  noise  sources  by 
minimizing  the  scattering  and  the  noise  parameter 
error  functions.  The  noise  generators  are 
represented  by  two  correlated  noise  current 
sources  at  the  input  and  the  output  port  plus  the 
thermal  contribution  arising  from  each  of  the 
physical  resistors.  The  noisy  model  structure  is 
shown  in  Fig.  2;  the  relevant  element  values, 
including  noise  generators,  are  reported  in  Tab.  1. 
The  model  effectiveness  has  been  verified  by 
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deriving  the  element  value  variations  at  the 
different  bias  conditions  for  each  series. 

All  the  model  elements  exhibit  value  trends  in 
very  good  accordance  with  the  physics-based 
predictions.  In  addition,  numerical  evaluation  of 
Fo  and  rn  by  use  of  well-assessed  expressions 
has  matched  quite  satisfactorily  the  measured 
values  showing  how  the  noise  resistance  is 
heavily  affected  by  Ue  as  reported  in  Fig.3  at  low 
current  bias  values,  [4]. 

4.  COMMENTS  ON  THE  DEVICE 
PERFORMANCE 

Some  interesting  remarks  can  be  made  with 
reference  to  the  measured  noise  parameters: 

-  Fq  increases  and  IFol  decreases  with  this  four¬ 
fold  increment  of  the  bias  current,  whereas 

Rn  (Q)  =50  rn  and  /Fq  are  affected  by  Ic  to  a 
very  reduced  extent; 

-  Fo,  Rn  and  IFol  decrease  markedly  with 
increasing  Be  ;  therefore,  the  Q2  series  exhibit 
values  of  the  noise  figure  F50  measured  in  input 

matched  conditions  (i.e.,  @  Fs  =0)  much  higher 
than  Fo-  The  quantity  AF=  F50  -  Fo  reduces  at 
higher  current  values; 

-  the  slope  of  Fo  vs.  frequency  increases 
proportionally  to  Be  .  As  a  result,  Fo  approaches 
values  around  2  dB  for  all  Qi  families  @  5  GHz 
(lower  bias  current) ; 

-  the  slope  of  Rn  vs.  frequency  vanishes  with 
increasing  b®  ,  thus  becoming  flat  for  higher  n®. 
The  available  gain  Ga  is  less  affected  by  either  n® 
and  the  bias  current  level  than  the  corresponding 
noise  figure  F50. 

By  summarizing  the  best  results,  we  measured 
typical  values  of  Fo  even  lower  than  1  dB 
and  available  gain  values  as  high  as  14  dB  @  2 
GHz. 


4.  CONCLUSIONS 

We  here  present  the  reasults  of  an  extensive 
characterization  activity  performed  on  several 
PSA  bipolar  transistors. 

The  devices  were  grouped  according  to  their 
emitter  finger  number  and  were  tested  over  the 
2-6  GHz  frequency  range  in  terms  of  noise,  gain 
and  scattering  parameters  at  different  bias 
conditions.A  bias-dependent  model  extraction 
including  the  noise  performance  has  been 
subsequently  accomplished  and  a  comparative 
analysis  among  the  measured  families  has  been 
carried  out.  Outstanding  performances  in  terms 
of  minimum  noise  figure  and  available  gain  have 


been  brought  out  for  this  advanced  bipolar 
process  for  telecommunication  applications  up  to 


4-6  GHz. 
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Fig.  1  -  Noise  parameters  Fq,  IFqI,  ITq  and  Rn 
of  the  03  series  at  the  bias  conditions:  a)  Vce= 
2.8  V,  Ic=4  mA;  b)  Vce=  2.8  V,  Ic=  16  mA. 
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Fig.2  -  Circuit  model  including  noise  sources. 
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Rg.  3  -  Values  of  Rn  vs.  frequency  for  the  Q2 
Q3  and  Q4  series  at  lower  bias  current. 
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ABSTRACT 

This  paper  presents  the  recent 
developments  in  the  time  domain 
simulation  of  microwave  circuits.  This 
approach  is  gaining  attention  mainly  due 
to  its  ability  for  modeling  re^stic 
microwave  structures  which  can  include 
active  and  nonlinear  devices.  In 
particular,  an  ovCTview  of  the  Finite- 
Difference  Time-Domain  (FDTD)  method 
with  its  recent  additions  such  as 
Diakoptics  and  System  Identification,  and 
nonlinear  circuit  capabilities  will  be 
presented. 


I.  INTRODUCTION 

Modeling  of  microwave  structure  has 
undergone  a  rapid  growth  in  the  past  decade. 
The  driving  force  for  this  growth  has  been  a 
combination  of  factors  ranging  from  the 
increase  in  the  operating  firequencies  and 
integrating  a  complete  system  on  a  single 
module  where  the  proximity  coupling 
becomes  important,  to  the  need  for  reducing 
the  design  cycle  whae  a  design  is  expected  to 
operate  properly  wifoout  further 
modifications  after  tire  initial  fabrication.  The 
accompanying  increase  in  the  available 
computation  power  has  also  provided  a 
synergy  for  this  growth.  At  this  point  a 
wealth  of  methodologies  are  available  to 
address  various  modeling  needs  for 
microwave  structures.  These  methodologies 
can  be  classified  broadly  as  ficquency  and 
time  domain  methods.  While  the  frequency 
domain  techniques  have  been  under 
development  for  the  past  four  decades,  the 


time  domain  methods  have  recently  been 
gaining  attention  mainly  due  its  ability  for 
modelmg  realistic  microwave  structures 
which  can  include  active  and  nonlinear 
devices.  This  method  is  currently  known  as 
die  Finite-Difference  Time-Domain  (FDTD). 
Reforences  [1]  and  [2]  provide  a  gener^ 
comparison  of  the  numerical  properties  of  the 
EDTO  and  other  popular  numCTical  methods. 
It  is  important  to  note  that  this  algorithm 
provides  both  spatial  and  temporal  interaction 
of  the  electromagnetic  fields  with  the 
noicrowave  stractures.  This  information 
provides  insights  into  microwave  circuit 
opCTation  This  algorithm  has  been 
successfully  used  for  characterizing  the 
performance  of  a  broad  class  of  microwave 
structures  such  as  waveguides,  multi-layer 
structures,  discontinuities,  transitions,  filters, 
couplers  and  antennas.  This  paper  provides 
an  overview  of  this  technique,  and  highlights 
some  of  the  recent  developments. 


n.  FDTD  FORMULATION  AND 
ALGORITHM 

The  FDTD  algorithm  is  based  on  the 
discretiztion  of  the  Maxwell's  equations 
in  time  and  space.  A  number  of 
differencing  schemes  are  available  for 
discretization  of  the  Maxwell's  equations. 
A  numerically  stable  method  is  the  central 
differencing  approach.  Using  a  Taylor 
series  expansion  of  the  field  values,  it  can 
be  shown  that  the  inttoduced  errors  are  of 
second  order  with  respect  to  discretization 
size  in  time  and  space.  These  difference 
equations  are  obtained  directly  from 
Maxwell's  equations.  A  closer  look  at 
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these  equations  show  that  the  E-  and  El- 
fields  are  updated  on  a  dual  grid  which 
are  displaced  in  time  and  space  by  half  of 
the  discretization  length.  In  addition,  the 
integral  form  of  Faraday's  and  ampere's 
laws  are  satisfied  on  the  E  and  H  field 
cells,  respectively.  As  a  cons^uence, 
inhomogeneous  media  can  be  included 
with  minor  modifications  to  the  updating 
algorithm. 

These  equations  are  suitable  for  direct 
implementation  on  a  digital  computer. 
The  algorithm  updates  the  field  values 
throughout  the  ^screte  space  for  each 
time  step.  As  a  consequence  of  this 
approach,  an  iterative  scheme  is 
implemented  which  eliminates  the  need  to 
store  large  matrices  which  results  in  large 
savings  in  computation  requirements.  M 
additional  advantage  of  this  algorithm  is 
its  ability  to  efficiently  utilize  the  parallel 
computer  architecture.  This  is  a 
consequence  of  the  Huygen's  principle 
which  requires  the  knowledge  of  field 
distribution  over  a  surface  for  updating 
the  fields  within  the  volume  enclosed  by 
the  surface.  As  a  consequence,  the 
communication  among  processors  are 
limited  to  exchanging  the  surface 
information,  while  each  processor 
updates  the  fields  within  a  volume. 


m.  INFORMATION  GENERATED 
BY  THE  FDTD  ALGORITHM 

This  method  of  solution  to  the 
electromagnetic  field  equations  provides 
both  temporal  and  spatial  field  values 
throughout  the  computation  domain. 
This  information  can  be  exploited  both 
for  extracting  useful  information 
regarding  the  circuit  operation  such  as  the 
S-parameters  and  resonance  frequencies, 
and  reducing  the  computation  time  and 
memory  requirements.  The  spatial 
distribution  is  used  for  extracting  the 
modes  of  the  structure,  and  proximity 
coupling.  The  temporal  variations  is  used 
for  extracting  the  frequency  response  of 
the  stmcture  for  a  range  of  frequencies. 


Reduction  in  computation  requirements 
are  achieved  by  invoking  a  number  of  the 
Linear  Network  concepts  to  the  FDTD 
algorithm.  The  memory  requirements  is 
reduced  by  representing  portions  of  the 
microwave  structure  by  their  respective 
impulse  response  matrices.  The  impulse 
response  matrix  is  in  effect  the  numerical 
Green's  function  and  is  computed  in 
advance  using  the  FDTD  algorithm  and 
an  impulsive  excitation.  The  details  of 
this  implementation  is  presented  in  [3]. 
This  approach  is  particularly  useful  in 
optimization  process  where  for  portions 
of  the  microwave  stmcture  remain 
unchanged  through  the  iterations  and  can 
be  replaced  by  their  pre-computed 
impulse  response  matrices.  The 
reduction  in  computation  time  is  obtained 
by  applying  the  System  Identification 
meth^s  to  the  temporal  variation  of  the 
field  values  at  the  locations  of  interest. 
Using  this  approach,  an  Auto-Regressive 
Moving  Average  (ARMA)  model  is  used 
to  describe  the  circuit  behavior.  The 
parameters  of  the  ARMA  model  are 
computed  using  the  FDTD  generated  field 
values.  The  ftequency  response  of  the 
system  is  computed  directly  from  the 
ARMA  parameters  thus  eliminating  the 
need  for  the  Fourier  Transformation.  It  is 
important  to  note  that  the  time  interval  for 
computation  of  the  ARMA  model  can  be 
much  smaller  than  the  time  interval 
required  for  the  Fourier  Transformation 
due  to  spectral  resolution  and  truncation 
effects  [4]. 

IV.  APPLICATION  TO  ACTIVE 
DEVICES 

An  important  utility  of  the  FDTD 
algorithm  has  been  its  ability  to 
characterize  microwave  soiictures  which 
include  active  and  nonlinear  devices  [5, 
6].  Many  practical  microwave  circuits  are 
embedded  with  components  which  are 
both  active  and  nonlinear  (e.g., 
semiconductor  devices),  and  re^stic 
modeling  of  these  circuits  requires  the 
inclusion  of  electromagnetic  fields 
interaction  with  the  passive  circuit 
structure  as  well  as  devices.  The  device 
behavior  is  represented  by  its  port 
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characteristics  and  is  included  in  the 
FDTD  algorithm  via  the  Ampere’s  and 
Faraday's  Laws.  Figure  1  shows  the 
layout  of  a  typical  microwave  amplifier. 
The  system  under  consideration  is 
composed  of  distributed  passive 
stmctures  such  as  transmission  line, 
matching  stubs  and  DC  biasing  circuit, 
lumped  passive  devices  such  as 
capacitors  and  resistors,  and  an  active 
device  (GaAs  MESFET).  The  lumped 
equivalent  circuit  small  signal  model  of 
the  active  device  is  shown  in  Figure  2. 
Figure  3  shows  the  field  distribution  a 
plane  under  the  metalization  for  a 
particular  time.  It  is  important  to  note 
that  this  methodology  is  applicable  to 
stmctures  which  contain  multiple  devices 
as  well  as  multi-layer  metalization  and 
dielectric  discontinuities.  This  approach 
has  also  been  applied  to  an  oscillator 
circuit  where  the  large  signal 
characteristics  of  the  active  device  is 
included.  The  stracture  is  consist  of  two 
coupled  patch  antenna.  Each  patch  is 
excited  by  a  Gunn  Diode  which  are 
coupled  strongly  through  a  transmission 
line.  The  FDTD  simulation  of  the  entire 
stmcture  produced  excellent  agreement 
with  the  measurements  [6]. 

V.  SUMMARY 

The  time  domain  simulation  of 
microwave  stmctures  using  the  FDTD 
has  been  successful  for  a  wide  class  of 
microwave  stmctures.  This  algorithm 
provides  the  spatial  and  temporal 
variations  of  the  electromagnetic  fields 
which  can  be  used  to  gain  additional 
insights  into  tire  circuit  operations.  The 
method  also  lends  itself  to  a  number  of 
linear  system  techniques  such  as 
Diakoptics  and  System  Identification 
which  results  in  an  efficient  use  of  the 
computational  resources  for 
electromagnetic  circuit  simulation.  This 
algorithm  has  also  been  extended  to 
include  active  and  nonlinear  device 
models .  which  provides  the  ability  to 
simulate  realistic  microwave  circuits. 
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Power  pad 


Figure  1:  The  layout  of  the  microwave  ampUfier,  which  is  designed  at  6  GHz  with 
9  dB  gain. 


Lgg  Lt  R<  Ctd  Rd  U  Ldd 


Figure  2:  Lumped  equivalent  circuits  of  the  GaAs  MESFET. 


Figure  3:  A  snapshot  of  the  field  distribution  of  Ey  component  beneath  the  air- 
dielectric  interface  during  the  simulation. 
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Abstract 

A  new  numerical  solution  technique  to  electro¬ 
magnetic  (EM)  Held  problems  is  presented  in  this 
paper.  It  is  based  on  Complex-R’equency  Hop- 
ping  (CFH),  which  is  an  expanded  asymptotic 
waveform  evaluation  approach  recently  proposed 
in  the  circuit  simulation  area  with  great  sucess  in 
solving  large  linear  lumped  and  distributed  cir¬ 
cuits.  The  Helmholtz  equations  are  formulated 
into  a  set  of  linear  ordinary  differential  equations 
which  are  solved  by  asymptotic  waveform  evalua¬ 
tion.  The  technique  offers  speed  or  stability  ad¬ 
vantage  over,  e.g.,  the  Finite  Difference  J^’equency 
Domain  (FDFD)  and  the  Finite  Difference  Time 
Domain  (FDTD)  approaches  for  comparable  accu¬ 
racy.  An  example  of  waveguide  frequency-domain 
analysis  is  provided. 


1.  INTRODUCTION 

Modeling  and  simulation  based  on  EM  field  formu¬ 
lations  are  important  for  accurate  analysis  and  design 
of  today’s  high-speed  and  high-frequency  circuits  and 
systems.  Much  research  in  efficient  numerical  tech¬ 
niques  for  EM  problems  has  been  conducted  in  the 
past  [1].  However  the  task  of  numerically  solving  EM 
field  problems  is  generally  computationally  intensive. 
For  example,  in  the  FDTD  approach,  the  computa¬ 
tion  has  to  be  done  at  a  large  number  of  time  jjoints 
to  ensure  stability  [2][3].  In  FDFD  [4],  it  is  required  to 
solve  a  large  set  of  linear  equations  at  a  large  number 
of  frequency  points.  For  practical  systems  involving 
the  modeling  of  many  components  and  iterative  opti- 
mizaiton,  the  computation  cost  could  be  prohibitive. 

Recently,  Complex  Frequency  Hopping  (CFH)  [5]- 
[7])  which  is  an  expansion  of  a.symptotic  waveform 
evaluation  [8],  has  been  developed  in  the  circuit  simu¬ 
lation  area,  achieving  CPU  speedup  of  factor  of  hun¬ 


dreds  or  even  thousands  over  conventional  circuit  sim¬ 
ulators.  The  technique  received  attention  from  the  cir¬ 
cuit  simulation  community  and  has  been  extended  to 
solutions  of  static  fields  in  VLSI  interconnects  [9],  in 
groud/power  planesflO]  and  thermal  fields  [11].  CFH 
techniques  are  particularly  suitable  for  solving  large 
sets  of  linear  differential  equations.  However  its  poten¬ 
tial  to  be  a  technique  for  solving  nonstatic  EM  equa¬ 
tions  has  not  been  addressed  in  the  literature.  This 
paper  bridges  this  technical  gap  with  a  specific  appli¬ 
cation  of  CFH  techniques  in  solving  the  wave  equa¬ 
tions  derived  from  Maxwell  equations.  A  waveguide 
example  based  on  the  finite  difference  approach  is  used 
to  illustrate  the  technique  and  to  compare  with  con¬ 
ventional  methods. 


2.  CFH  FORMULATION  OF 
MAXWELL’S  EQUATIONS 


From  the  source-free  Maxwell’s  equation,  one  can 
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obtain  the  vector  wave  equations: 

vxvx[^]-/^^|2[^]=0  (1) 

Consider  a  two-dimension  EM  problem  shown  in 
Fig.l.  For  TM  wave  in  a  rectangular  waveguide,  the 
wave  equation  becomes 

d^E^{x,y,t)  _  ^  (d'^E^{x,y,i)  d^-E,{x,y,i)\ 

dii  ^  \  5x2  ay2 

where  c  is  the  velocity  of  light. 

Discretizing  space  and  taking  the  Laplace  trans¬ 
form,  we  get 

^s^Et{i,j,s)  +  F{i,j,s)  =  -^sEi.ii,j,i)\t=o  (3) 

c 


where 


■F(»,i.s)  = 

4.Ez(,i>j>s)  —  Ez{i  +  l,j,s)  —  Ez{i  -  l>i,s) 

and  EziiJ,s)  is  the  Laplace  transform  of  Ez{iJ,t)  , 
Ezii,j,i)  =  Ez{x  =  ih,  y  =  jh,i),  and  h  =  Ax  =  Ay. 

Suppose  n*  and  Uy  a.re  the  number  of  grids  m  the  x 
and  y  directions  respectively,  and  N  =  (n*  ~  1)  x  (”y  ~ 
1).  Let  JC(s)  be  a  AT-vector  containing  Ez{i,j,s) 
for  all  t,  j',  i  =  1,2,  ....,nx  —  1)  i  —  ~ 

Equation(3)  can  be  rewritten  in  the  form 

[s2A-hG]X(s)  =  B(s)  (4) 

where  A  and  G  are  //  x  N  real  matrices  containing 
constants  (  such  as  I’s,  O’s,  4’s,  )  which  are  derived 

from  (3).  In  general,  B(s)  is  a  function  of  the  initial 
conditions  and  external  sources.  For  example,  if  an 
impulse  input  is  added  at  grid  point  (*,i),  then  the 
element  of  B{s)  in  the  [(i - 1)  *  (uy  -  1)  +i]th  position 
is  equal  to  ^s,  and  all  other  elements  are  equal  to 
zero. 

Equation  (4)  is  usually  very  large.  CFH  uses  a  lower 
order  Fade  approximation  to  the  solution  of  (4)  which 
can  be  obtained  by  solving  (4)  at  only  a  few  selected 
frequency  points  [6],  [7]. 

To  use  Fade  approximation  ,  -X^(s)  is  expanded  into 
Taylor’s  series  at  frequency  point  s  =  sq 

oo 

^(«)  = 

n=0 

where  M„  is  the  nth  moment  vector  of  X(s). 


When  the  expansion  point  is  on  the  imaginary  axis. 
So  =  jwo,  then  the  moments  M„  can  be  solved  recur¬ 
sively  from 

[-WgA  +  G]  M'„  = 

A[(-ir+‘2u;oM;_i-M'„_2], 

n  >  2  (6) 


with  , 

[— u)oA  -f  G]  Af  0  =  wqB  (7) 

[-i^oA  +  G\  M'l  =  2ivoAMo  +  B  (8) 
where  B(s)  =  sB;  M„  =  M^,  if  n  is  odd;  and  M„  = 
jM'n,  if  is  even. 

For  each  expansion  point,  the  moments  mn  — 
[M„](i);  n  =  0,1,2,...,2(/ -  1  of  an  output  i  are 
matched  to  a  lower  order  frequency  domain  function 
in  the  form  of  network  transfer  function 


H{s)  =  AC[i](s)  = 


y^-j— n 


(9) 


For  given  I  and  q,  the  coefficients  of  the  numerator 
and  denominator  are  related  to  the  moments  by 


nil 

bo  =  1  (11) 

ar  =  '^rn.r-jbj  r  =  0, 1,...,/.  (12) 

j=0 

where  mj  =  0  if  j  <  0  and  I  =  q-l.  Equation  (10)  is 
known  as  a  Fade  approximation  which  is  the  ba.se  for 
asymptotic  waveform  evaluation. 

For  accurating  solutions  of  Af(5)  of  EM  problems, 
it  is  necessary  to  perform  the  Fade  approximation  at 
several  expansion  points,  i.e.,  complex  frequency  hop¬ 
ping.  Solutions  from  CFH  is  always  stable  and  the  ac¬ 
curacy  can  be  easily  controlled  by  hopping.  Selection 
of  the  expansion  points  is  described  in  details  in  [6],  [7]. 
The  computation  cost  of  the  moments  defined  by  (6) 
is  one  LU  factorization  and  2q  forward-backward  sub¬ 
stitutions  of  a  REAL  matrix  for  each  expansian  point. 
Therefore,  significant  CFU  speed  up  can  be  achieved 
over  conventional  techniques  such  a.s  FDFD  where  (4) 
is  solved  at  a  large  number  of  frequency  points. 
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3.  NUMERICAL  RESULTS 


We  use  three  difTerent  methods  to  calculate  the  fre¬ 
quency  spectrum  of  a  rectangular  waveguide  for  the 
same  problem  given  in  Fig.l  .  The  three  methods  are 
the  CFH  technique,  the  FDTD  and  the  FDFD  ap¬ 
proaches  respectively. 

We  use  Gaussian  distribution  pulses  as  the  in¬ 
put  source[2],  i.e.,  when  ^  =  0,  Ez{i^j)  = 

—  2.0,  (io^jo)  is  the  cen¬ 
ter  input  point  and  io  =  nx/4\  jo  =  ny/2.  We  chose 

for  obtaining  good  stability 
and  accuracy [1],  where,  /  is  the  highest  frequency  of 
the  system.  In  the  CFH  technique,  g  =  8  and  the 
number  of  expansion  points  is  26. 

To  verify  the  proposed  method,  the  cutoff  frequency 
of  the  rectangular  waveguide  is  also  calculated  by  the 
analytical  method  [12]  and  compared  with  that  from 
CFH  in  Table  1. 

Fig.  2  shows  the  frequency  responses  of  the  waveg¬ 
uide  for  the  different  methods.  Table  2  shows  the  CPU 
time  for  the  different  methods  for  different  cases  . 


Table  1:  Cutoff  frequency 


analytical 

fcut(GHz) 

CFH 

FDTD 

FDFD 

21.2281 

21.2281 

21.2217 

21.2201 

26.8526 

26.8495 

26.8494 

26.8502 

34.2310 

34.2120 

34.2231 

34.2202 

51.1253 

51.1047 

51.1094 

51.1011 

57.7478 

57.7160 

57.7080 

57.7011 

60.0444 

59.9967 

59.9887 

59.9610 

63.6859 

63.6477 

63.6477 

63.6522 

69.1162 

69.0701 

69.0542 

68.9522 

.74.1487 

74.1137 

74.1153 

73.9026 

80.5579 

80.5261 

80.5340 

80.4930 

87.5289 

87.4000 

87.3506 

87.2033 

95.4118 

95.3132 

95.1715 

95.0530 

96.8188 

96.7440 

96.5816 

96.1536 

99.1185 

98.9531 

98.8973 

98.8536 

Figure  2:  Frequency  response 


4.  CONCLUSION 


Table  2:  CPU  Time  Consumption 


parameter 

method 

CPU  time(sec.) 

CPU  time 
normalized 
to  CFH 

r2a:  =  100  7?y=50 

h=158.0/i.m 

/3=0.6 

CFH 

765.98 

1 

FDTD 

1986.67 

2.6 

FDFD 

25950.0 

34 

na:=60  ny=30 

h=263.3//m 

/9=0.7 

CFH 

102.48 

1 

FDTD 

440.64 

4.3 

FDFD 

17127.0 

167 

n3:=60  ny=30 
h=263.3/im 

p=0.6 

CFH 

102.48 

1 

FDTD 

489.32 

4.8 

FDFD 

17127.0 

167 

n*=60  ny=30 

h=263.3pm 

p=0.3 

CFH 

102.48 

1 

FDTD 

979.14 

9.6 

FDFD 

17127.0 

167 

The  concept  of  Complex  Frequency  Hopping  has 
been  used  to  derive  an  solution  procedure  to  EM  wave 
equations.  The  technique  provides  potential  speed  or 
stability  advantages  over  conventional  finite  difference 
based  techniques. 
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ABSTRACT 

A  microwave  filter  has  been 
designed.  The  designed  filter  has  been 
simulated  by  using  an  efficient  general 
purpose  analysis  program  based  on  state 
space  approach.  The  practical  design  has 
been  built  by  using  microstrip  technology, 
The  results  of  practical  measurements  and 
the  simulated  results  are  with  good 
agreement. 

1-  Introduction 

Microwave  filters  have  many 
applications  in  radars,  guidance, 
satellite  communication  and  electronic 
warfare.  The  microstrip  filters  are  very 
suitable  f  or  these  applications  because 
the  microstrip  has  the  advantage  of  small 
size,  light  weight,  low  cost  and  easy 
fabrication.  Various  kinds  of  filters  can 
be  realized  by  using  microstrip  type 
structures!!]. 

The  objective  of  this  paper  is  to 
introduce  the  analytical  and  practical 
design  of  microwave  filter.  A  program 
based  on  state_space  approach  has  been 
applied  for  the  analysis  of  the  designed 
circuits  in  both  time  and  frequency 
domains.  Practical  design  and  simulation 
results  are  compared. 

2-  Analytical  Design 

The  design  of  microwave  low  pass 
filter  with  cutoff  frequency  of  2Ghz 
has  been  developed.  Microstrip  filters 
may  be  designed  with  maximally  flat 

response  or  Chebyshev  response  in  the  pass 
band.  The  design  is  based  upon  the 

normalized  g-values.  These  values  must 
be  converted  into  inductances  and 

capacitances.  The  elements  values 

,g  and  g  ,  of  the  low  pass 
^n  "^n+l  ^ 


prototype  filters  for  both  maximally  flat 
(Butter  worth)  and  Chebyshev  are  given  in 
[2].  In  printed  form,  these  types  of 
filters  will  be  realized  as  a  series 

of  high  impedance  and  .  low  impedance 
sections  as  shown  in  Fig.l.  The  length 

of  the  high  impedance  inductive  sections 
may  be  calculated  by[2]. 


Fig.l  Microstrip  L.P.F 
^  -1 

I  (k)= - Sin  (g(k).Z  /Z  )  (1) 

1  wo  0  L 

where  >u..is  the  velocity  of  propagation  in 


microstrip 


eff 

C  =  3E+8  m/sec 

€  ...effective  dielectric  constant 

eff 

Z^  is  the  characteristic  impedance  of 
the  inductive  section. 

z  ....  is  the  characteristic  of  the 
0 

terminating  resistance. 

The  length  of  the  capacitive  section  is 
given  by  [2] 

£  (k)  =  X  (k)  — .  (3) 

c  cc  Wo 
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cc  Zo 


X  (k-l)+X  (k+1) 

cn  C1Z 


wo  I  ( k) 

X  . corrected  value  of  capacitance 

cc 

susceptance. 

3-  Simulation  program 

A  general  computer  program  for  the 
analysis  of  microwave  circuits  has  been 
applied.  The  program  is  based  upon  the 
formulation  of  the  state  and  output 
equations  using  an  efficient  topological 
method  with  no  restriction  on  the 
topology.  The  state  and  output  equations 
for  any  network  can  be  represented  as  the 
f  ollowing. 

(3a)  The  state  and  output  equations 

It  has  been  shown  [9]  that 
the  state  variables  of  a  distributed 
network  are  the  reflected  parameters  at 
all  the  transmission  iine  terminals  or 
ports.  The  state  variables  of  a  lumped/ 
distributed  network  are  the  voltages  on 
all  the  independent  capacitors,  the 
currents  in  all  the  independent  inductors 
and  the  reflected  parameters  (  or  the 
reflected  voltageslat  all  the  transmission 
-line  ports. 

The  state  equation  is  the  differential- 
difference  equation  of  the  form; 


X  (t)  A 
1  1 1 


X  (t+T)  A  . 
2  2l 


A  X  (t)  B 

_i2  _i —  ^  (6) 


A,,  X  (t)  B, 
22  2  2 


where  X^(t)  and  ^2^^^ 

state  vectors  of  the  lumped  and 
distributed  elements  respectively  and 
u(t)  is  the  input  vector.  The 
output  equation  is  given  by 


■  i  irx,(t)' 

y(t)=  CliC2  - - +E  u(t)  (7) 

Equations  (6)  and  (7)  are  derived 
using  topological  methods  with  no 


restriction  on  the  network  topology. 
Where  A,  B,  C  and  E  are  the  matrices  of 
the  state  and  output  equation  of  the 
systems. 

When  the  Laplace  transform  is 
applied  to  equations  (6)  and  (7)  we 
obtain  the  frequency-domain  equations. 


/l2 

-1 

^^^2n"  ^22- 

-®2- 

Si  -  A,, 
m  1 1 


i  +eI  U(s)  (9) 


where  2  =  exp  sT  ,  I  is  the  identity 
matrix,  m,n  are  the  number  of  lumped-state 
elements  and  transmission  lines, 
respectively. 

(3b)  Network  Analysis  Using  the  State  Equatioi 

The  previous  results  are  used  to 

analyze  lumped/  distributed  networks  in 
either  the  frequency  or  time  domain  . 
(l)Frequency  domain  analysis  :  is  Based  on 
the  expansion  of  the  equation  (9) 

to  obtain  the  transfer  function  of  the 
form. 

2n  m 

i 

i=0  j=0 

F(s,z)= — — — - —  (10) 

y2n  ym  g 
i=o^j=o 

This  expansion  is  achieved  by 

using  a  modified  faddeeva  algorithm[7] 
for  the  inversion  of  the  two  variable 
matrix  in  equation  (8). 

(2)Time  domain  analysis  :  The  output 
in  the  time  domain  is  calculate  d  from 
equation  (7)  _  when  the  state  vector 
[X  (t)i  X  (t)r  is  known.  The  state 

Jl  W 

vector  is  first  determined  from 
the  solution  of  (6).  Equation(6)  represents 
two  simultaneous  differential  and 
difference  equation  : 


398 


and 

X2(t)=A2iXi(t-T)+A22X2(t-T)+B2u(t-T)  (12) 

The  state  vector  [Xl(t)  i  X2(T)]''^  is 

assumed  to  be  zero  at  t<O.The  vector 

X2(t)  is  calculated  by  substituting  the 

values  of  XI  (t-T),  X2  (t-T)  and  u(t-T) 
in  equation(12).  The  vector  Xl(t)is 
then  calculated  by  solving  the 
differential  equation  (ll)numerically  in 
each  time  period  kt  <  t  <  (k+l)T,  where  k 
=  0,1,2 . 

The  state  and  output  vectors  have 
discontinues  at  t=  KT  and  the  initial 
values  of  the  state  vector  at  the 

start  of  each  time  period  are 

obtained  from  the  condition  that  the 
integral  of  the  state  vector  is 

continuous. 

The  insertion  loss  predicated  by 
simulation  program  is  shown  in  Fig.l 


Fig.l  Insertion  loss  versus  frequency(Ghz) 
of  an  L.P.F 

4-  Practical  Design 

The  practical  design  of  the 

microwave  filter  with  cutoff  frequency  of 
2Ghz  has  been  developed  in  a  stepped 
impedance  form  by  using  microstrip 

technology  according  to  the  following 

considerations. 

1-The  choice  of  the  substrate  material 
depends  on  the  frequency  of 
operation,  cost,  and  applications. 
High  dielectric  constant  substrates 


result  in  small  size  circuits.  RT- 
Duriod  5880  (Teflon)  of  er  =  2.33 
and  h  =  .787  mm  is  chosen.  The 

substrate  with  copper  clad  with 

is  widely  used  in  microwave 

integrated  circuits. 

2- A  conventional  microstrip  synthesis 

technique  described  in  [3]is  carried 
out  to  calculate  the  dimension  of 
microstrip  circuits.  The  layout  of 
the  circuit  is  carefully  prepared 
keeping  in  mind  the  microstrip 

discontinuity  at  step  in  width  the 
mask  layout  is  shown  in  Fig.  2. 

3-  Fabrication  process  of  thin  film 
microwave  integrated  circuits  using 
photolithographic  technique  is 
applied. 

4- The  circuits  is  measured  by  using 

direct  measurement  technique.  An 
input  signal  ranging  from  0.8  to  2.4 
Ghz  with  varying  amplitude  is 
applied  to  the  input  port  of  the 

filter.  The  power  of  the  output 

signal  and  spectrum  of  the 
fundamental  and  other  harmonics  are 
measured  at  the  output  port. 

The  insertion  loss  is  given  by: 

IL(dB)  =  P  TdBm)  -  P  (dBm) 
out  in 

The  insertion  loss  versus  the 

frequency(Ghz)  is  shown  in  Fig.3. 


Fig. 2.  The  layout  of  the  L.P.F 


Fig.3.  The  insertion  loss  versus  the 
frequency(Ghz) 
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5"Conclusion 

The  design  procedure  of  microstrip 
low-pass  filter  is  presented.  The  computer 
aided  analysis  adopted  in  this  paper  is 
with  good  stability  and  suitable 
computation  time.  The  results  show  good 
agreements  between  simulation  and 

measurements.  The  deviation  between  the 
measured  and  predicted  simulated  values 
was  due  to  imperfection  associated  with 
the  connectors  used  and  the  fabrication 
tolerances. 
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ABSTRACT 

The  paper  describes  a  new  implementation  of  testing- 
algorithm  model  for  analogue  circuits.  It  is  based  on 
the  possibilities  of  HSPICE  and  MATLAB  to  manage 
whole  test  simulation  including  the  simulations  of 
faulty  or  fault  free  circuits  as  well  as  post-processing  of 
them.  The  approach  takes  account  to  tolerances 
deviations  on  parameters.  Both  the  low-level  and  high- 
level  modelling  are  assumed.  Aim  of  this  project  is  to 
model  test  techniques  in  order  to  classify  their 
effectiveness  in  the  sense  of  fault  coverage,  namely  for 
test  techniques  using  supply  current  monitoring. 

1.  mTRODUCTION 

In  the  last  few  years,  analogue  and  mixed-signal 
integrated  circuits  have  grown  in  importance.  Several 
new  test  ideas  were  introduces  that  provides  a  number 
of  benefits  in  comparison  with  traditional  analog  test 
approaches.  But  at  present  there  are  still  only  few 
techniques  available  that  can  adequately  test  these 
devices,  although  the  area  has  attracted  considerable 
interest  in  recent  years.  There  are  three  major  problems 
in  analog  testability:  the  problem  of  modelling  of  faulty 
components,  the  problem  of  tolerances  and  the  problem 
of  limited  number  of  test  points  (accessible  nodes). 

The  approach,  which  is,  at  present,  attracting  a 
great  deal  of  interest,  is  to  monitor  a  supply  current 
rather  than  functional  outputs  in  order  to  observe 
faults.  This  approach  covers  fault  detection  in  both  the 
analogue  and  digital  parts  of  the  device.  In  this 
approach  the  current  passing  through  the  terminal  is 
monitored  during  the  application  of  test  vectors. 
Originally  these  methods  were  proposed  for  digital 
testing  and  are  widely  worked  out,  providing  effective 
solutions  to  increasingly  difficult  test  problems.  There 
are  also  several  described  techniques  for  analogue 
testing  which  also  offer  the  advantage  of  direct 
detection  of  the  defects  obviating  the  necessity  to 
propagate  a  defective  voltage  to  an  output  port.  It  is 
also  able  to  detect  certain  defects  missed  by  traditional 
voltage  test  approaches  based  on  the  stuck-at  fault 
model.  Unfortunately,  these  methods  are  not  worked 
out  enough  and  we  are  also  required  to  eliminate  the 


effect  of  tolerances  of  nonfaulty  components  and 
measurement  errors,  those  problems  have  absolutely  no 
equivalent  in  the  digital  circuits  testing.  Monte  Carlo 
technique  has  been  proved  to  be  a  powerful  tool  which 
is  expected  hopefully  to  reach  the  above  goal. 

2.  PROBLEM  OF  TOLERANCES 

In  the  testability  area  we  deal  with  faulty  models 
of  devices  for  catastrophic  and  soft  faults,  but 
nevertheless,  the  effect  of  tolerances  must  be  taken  into 
account.  Calculations  only  with  the  nominal  values  of 
nonfaulty  elements  parameters  would  be  an 
oversimplification.  The  values  of  these  parameters  may 
lie  anywhere  inside  of  prescribed  tolerance  band.  As  a 
result  of  this,  the  faulty  model  of  device  will  not  be 
characterized  by  single  voltage/current  at  accessible 
nodes  (test  points).  Instead,  the  model  will  be 
characterized  by  allowed  bands  of  each  response,  which 
can  be  considered  as  mapping  of  tolerance  bands  of 
nonfaulty  components  [2].  These  bands  are  called  the 
tolerance  envelopes.  The  method  of  testing  is  said  to  be 
robust,  if  it  is  reliable  also  in  the  case,  when  the 
parameters  have  not  nominal  values,  but  lie  in  the 
prescribed  tolerance  bands. 

The  tolerance  envelope  (tolerance  band)  of  supply 
current  for  fault  free  analogue  circuit  can  be  quite  wide 
and  it  is  difficult  to  establish  border  between  fault  free 
and  faulty  circuits  (go/nogo  test).  A  worst-case 
tolerance  analysis  is  the  identification  of  the  extreme 
(i.e.  worst)  values  of  supply  current  (generally  a 
performance)  resulting  from  the  variations  in 
component  values.  The  effect  of  tolerances  can 
completely  dominate  the  performance  of  a  device  under 
test.  There  is  the  lack  of  published  data  on  Idd  test 
limits.  Some  values  are  based  on  an  engineering 
judgement  and  others  are  guessed.  Unfortunately,  the 
ideal  situation  with  the  defined  gap  (Fig.  6)  or 
similarly  with  the  frequency  signature  was  not  proved, 
esp.  for  soft  faults.  Of  course,  Idd  should  be  tested  at 
the  highest  possible  voltage  and  after  bum-in  process 
(activation  of  hidden  defects).  Initial  proposals,  using 
off-chip  current  testing,  were  replaced  for  large  circuits 
by  on-chip  testing  (also  with  several  sensors  to  get  over 
the  limited  resolution). 
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Fig.  1.  HSPICE  based  flowchard  of  the  proposed  system 


Analogue  circuits  can  and  do  have  a  wide  range 
of  element  values  and  topology.  Direct  simulation 
seems  to  be  only  way  to  obtain  an  exact  behaviour  of 
supply  current  or  other  responses.  The  only  method  we 
can  use  is  the  Monte  Carlo  analysis.  This  method  is 
characterized  by  the  use  of  pseudo-random  numbers 
within  statistic^  sampling  experiments.  Modelling  for 
Monte  Carlo  analysis  requires  a  statistical  model 
capable  of  generating  individual  transistor  models 
which  are  then  used  in  an  analysis  programme.  The 
statistical  modelling  problem  considered  below  is  how 
to  construct  a  suitable  statistical  model  which  can 
accurately  approximate  both  the  spreads  and 
correlations  of  parameters.  An  assumption  is  made  that 
the  parameters  distribution  functions  (pdf)  are  well 
modeled  by  Gaussian  distributions  including  inter¬ 
parameter  correlations.  The  problem  was  approached 


successfully  for  the  cases  to  be  described  by  a 
combination  of  theoretical  considerations  and  the 
maximum  practical  use  of  experimental  data.  We  have 
been  handicapped  by  a  lack  of  statistics  about  circuits. 

The  pdf  ^p)with  expectations  p.  and 
covariance  matrix  C,  can  be  written 
as; 

Independent  random  parameters  of  statistical 
device  models  are  often  Gaussian.  If  their  mapping  to 
performance  characteristic  (inc.  supply  current)  can  be 
approximated  by  linear  functions,  then  the 
performance  characteristics  will  have  Gaussian  pdf. 
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The  computer  generation  of  the  n-th  random 
vectors  p“  with  the  above  pdf  is  performed  as  follows: 
first  a  normal  vector  x"  with  a  distribution 

whereE  is  the  unit  matrix,  is  generated 
and  then  it  is  transformed  to  p*  as  follows 

where  L  is  the  lower  triangular  matrix  resulting  from 
the  Cholesky  decomposition  of  C,  namely  C~L-lF . 
In  practice  it  turns  out  that  it  is  easier  to  use  the 
formula  for  the  bootstraped  pdf  (tracked  pdf) 

<? 

where  ai  and  X,  are  random  numbers  from  closed 
interval  (-1,  l)  both  with  Gaussian  pdf.  Coefficients 
kj  (equals  zero  for  imcorrelated  parameters)  express  the 
tracking  by  variables  X,.  See  Fig.  4. 

3.  METHOD  DESCRIPTION 

On  fig.  1  we  can  see  the  flowchart  of  the 
simulation  system.  Only  MOS  circuits  are  simulated. 
Each  transistor  is  replaced  by  its  model  for  shorts  and 
opens  [IJ.  For  circuit  simulation  the  HSPICE 
programme  is  used.  As  outputs  with  benefit  we  can 
assume  four  different  characteristics:  supply  current  in 
time  and  frequency  domains  as  well  as  output  voltage 
in  time  and  frequency  domain.  It  has  been  noticed  that 
processing  of  supply  current  characteristic  by  Fourier 
transform  could  bring  interesting  results  (frequency 
domain  approach)  •  a  signature  of  the  current  status. 
But  there  is  a  problem  about  the  best  input  stimuli  for 
testing  purposes.  Some  presumptions  has  existed  and  it 
could  be  possible  to  verify  them.  Our  choice  was  a  sine 
wave,  a  square  wave,  and  a  sawtooth  wave. 


Fig.  2.  Two  dimension  Gaussian  distribution 


First  TR  analysis  runs  for  fault  free  circuit  with 
parametr  variation  using  Monte  Carlo  analysis  for  both 
supply  current  and  output  voltage.  Then  we  can 


estimate  tolerance  intervals  for  supply  currents  and 
frequency  signatures.  Now  TR  an^ysis  runs  for  all 
possible  (or  supposed)  faults.  By  comparing  faulty 
characteristics  with  tolerance  intervals  and  frequency 
signatures  we  obtain  the  final  result  •  detectability  for 
each  fault  (faults  coverage). 


A  larger  circuit  can  be  modelled  on  a  high  level 
as  a  mathematical  description  of  its  behaviour.  The 
fault  free  circuit  and  its  tolerance  envelope  are 
transferred  to  the  high  level  as  a  point  with  a  sphere 
around  (Figure  5).  The  idea  was  to  do  the  same 
procedure  for  a  fyulty  circuit  to  model  a  faulty  circuit 
by  use  of  the  fault>firee  model.  On  the  transistor  level 
the  faulty  and  fault-free  circuits  are  represented  by 


Fig.  4.  Linearly  correlated  parameters 


different  coimections  (added  or  missing).  This  circuitry 
difference  will  be  shifted  to  the  high  level  as  a 
parameter  difference.  The  faulty  behaviour  can  be 
placed  to  the  same  board  as  the  fault  free  behaviour 
(Figure  5).  Each  fault  is  displayed  as  a  point. 
Manufacturing  parameter  variation  of  the  faulty  circuit 
can  be  transferred  and  placed  around  the  fault  as  a 
probability  density  cloud  (the  tolerance  band  is  not 
defined  for  the  faulty  circuit).  If  the  fault  free  envelope 
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(tolerance  band)  and  faults  are  put  together,  a  simple 
detection  strategy  can  be  found.  The  faults  which  are 
situated  inside  the  envelope  are  “potentially”  non- 

m 


Fig,5.  Tolerance  envelope  for  a  fault  free  (FF)  circuit 
and  a  scatter  of  high-level  pareameters  for  a 
faulty  (FI)  circuit 

etectable.  Faults  that  are  placed  outside  are 
“potentially”  detectable.  This  simple  comparison  does 
not  demonstrate  degree  of  detectability  or  respective 
non-detectability.  To  add  this  scaling,  the  variation  of 
manufacturing  parameters  for  faulty  devices  has  to  be 
taken  in  to  account.  The  variation  for  a  faulty  device 
follows  a  Gaussian  distribution.  This  enables  us  to 
calculate  the  probability  of  in-  or  out-  envelope 
occurrence  for  each  fault  separately.  The  probability  of 
occurrence,  defined  as  an  integral  of  probability  density 
function  apart  from  the  envelope,  is  equal  to  the 


10 


Fig.  6.  Three  dimensional  map  of  faults  with 

probability  density  values  on  the  z-axis 


potential  detectability  of  the  fault.  By  using  the 
principles  above  the  faults  were  transferred  one  by  one 
to  high-level  space  with  their  parameter  variation.  The 
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Fig.  7.  Fault  coverage  for  the  supply  current  in  time 
domain  (input  signal  =  sawtooth  wave) 


results  are  placed  on  to  a  two  dimensional  map  as 
probability  density  clouds,  and  on  a  three  dimensional 
map  (Figure  6)  with  probability  density  on  the  z-axis. 
The  potential  detectability  is  calculated  by  a  Monte- 
Carlo  integration,  the  integral  is  a  ratio  of  the  points  in 
the  envelope  to  the  total  number  of  points.  TTie  map 
can  be  made  only  for  two  parameters,  but  the 
calculation  is  n-dimensional  generally. 

These  topics  will  be  discuss  in  more  detail  in  the 
presentation  as  well  as  the  results  achieved.  Examples 
will  be  given  of  three  circuits,  an  invertor,  an 
operational  amplifier  (Fig.  7),  and  a  2-bit  flash  ADC. 
liiese  applications  are  representative  of  practical 
problems. 


4.  CONCLUSION 

Testing  is  an  essential  part  of  each  manufacturing 
process.  Test  methods  for  analogue  circuits  are  not  yet 
fully  established.  In  this  work,  the  inclusion  of  the 
effects  of  manufacturing  tolerances  of  analogue  and 
mixed-mode  circuits  to  fault  detectability  has  been 
presented.  Here  the  test-process  modelling  system  was 
described.  It  solves  needs  for  wide  flexibility  during  the 
test  process  modelling.  Different  fault  models,  input 
stimuli  but  mainly  different  analyses  can  be  easily 
processed.  Novel  fault-processing  algorithm  with  wide 
reprogramming  features  was  designed.  Both  the  low- 
level  and  high-level  modelling  are  assumed.  The 
importance  of  a  supply  ciurent  testing  method  was 
justified.  This  method  is  recommended  in  mass 
production  of  integrated  circuits  for  inexpensive 
screening  out  of  faulty  circuits  in  the  initial  stages  of 
testing. 

The  author  would  like  to  thank  to  Mr  J. 
Povazanec  from  The  Leeds  Metropolitan  University  for 
his  helpful  discussions  and  suggestions. 
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Abstract  -  A  Simulated  Annealing  algorithm  is 
presented  for  topological  optimization  of 
communication  networks.  An  extensive  testing  over 
networks  of  various  sizes  and  configurations  is  made 
in  order  to  choose  the  best  values  of  the  parameters 
and  to  adapt  the  Simulated  Annealing 
implementation  to  the  specific  problem.  The 
performances  of  the  algorithm  are  compared  to  that 
of  other  traditional  techniques. 

1.  Introduction 

Large  attention  has  been  recently  devoted  to  various 
problems  related  to  the  optimization  of  network 
architecture,  due  to  the  impressive  growth  of 
communication  services  and  to  the  need  of  larger 
frequency  bandwidth  and  availability  of  new 
transmission  media. 

In  particular,  networks  based  on  a  unique  exchange  or 
hub  facility  connected  to  the  users  trough  multiplexed 
links  seem  to  provide,  in  many  cases,  installation 
flexibility  and  costs  suitable  to  fill  the  operational 
requirements  of  the  so-called  B-ISDN,  Large 
Bandwidth  Integrated  Service  Digital  Network.  The 
same  topology  is  used  in  die  computer  networks  using 
concentrators.  Users  are  distributed  over  local  or 
metropolitan  areas,  and  are  represented  in  the  following 
by  network  access  points  at  a  level  of  about  2  Mbit/s. 

To  represent  the  topology  of  these  multiplexer  or  traffic 
concentrators  networks  it  is  convenient  to  introduce  a 
graph  model.  Let  G(N,  A)  be  an  undirected  graph  (the 
communication  networks  at  hand  allow  simultaneous 
transmission  in  both  the  directions),  where  N={  1,  2, ..., 
n}  is  the  set  of  nodes  representing  locations  of  boUi 
existing  or  possible  multiplexers,  and  locations  of 
users,  and  A={(i,  j)}  is  the  set  of  arcs  of  G.  These 
represent  the  possible  communication  links  that  may  be 
used  to  connect  the  users,  the  multiplexers  and  the 
exchange  among  them  (cable  conduit  graph).  This 
graph  contains  all  the  information  about  die  particular 
area  in  which  die  net  must  be  built,  and  can  be  derived 
examining  the  road  map,  the  existing  telephone 
network  and  any  geographical  constraint. 

The  design  problem  consists  of  choosing  a  spanning 
tree  T  of  G  connecting  all  the  users  to  die  multiplexers 
dirough  the  distribudon  network,  and  die  multiplexers 


to  the  exchange  through  the  transport  network,  allowing 
the  overall  cost  of  the  plant  be  minimised. 

Considering  the  higher  cost  of  the  fibber-opdc  cables, 
compared  with  the  costs  of  the  equipment,  it  seems 
reasonable  to  realise  a  minimum  path  spanning  tree. 
The  opdmizadon  problem  is  so  reduced  to  choose  a 
sub-set  M  of  N  in  which  muldplexer  centres  have  to  be 
implemented.  This  fundamental  scheme  has  been 
already  proven  to  be  suitable  to  use  various  objective 
functions,  considering  different  terms  of  cost,  and  to 
investigate  connectivity  and  reliability  problems  [1]. 
When  the  exchange  site  is  selected  and  a  number  of 
multiplexers  are  made  active,  bodi  the  transport  and  the 
distribution  networks  can  be  obtained  by  means  of  the 
well-known  minimum  distance  Dijkstra  algorithm  [2]. 
This  leads  immediately  to  model  the  network  by  a 
binary  string,  in  which  ones  and  zeroes  indicate 
activation  or  no-activation  of  multiplexers  in  die  n 
nodes. 

2.  Simulated  Annealing 

The  objective  function,  representing  the  cost  of  the 
network,  can  be  described  as  the  sum  of  four 
components:  the  cost  of  the  distribution  network,  the 
cost  of  the  transport  network,  the  cost  of  the 
multiplexers  and  the  cost  of  the  exchange.  This 
function  takes  discrete  values  according  to  feasible 
configurations  of  the  net  because  the  multiplexers  have 
a  finite  number  of  inputs  and  one  output  and  are 
available  in  standard  hierarchies,  as  the  fibber  optic 
cables.  In  addition,  the  topological  variables  are  of 
binary  nature.  Summing  up,  the  variables  to  be 
optimized  are  discrete  and  the  objective  function  is  not 
linear. 

The  resulting  problem  is  a  typical  large  size 
combinatorial  optimization  problem,  falling  in  die  class 
of  the  NP-hard  problems,  for  which  there  is  the  need 
for  heuristic  algorithms.  Among  them,  large  attention 
has  been  recently  devoted  to  the  Simulated  Annealing 
(S.A.)  approach  [3],  based  on  an  iterative  improvement 
technique,  suitably  corrected  introducing  random 
moves. 

To  define  such  an  algorithm,  a  generation  and  an 
acceptance  procedure  for  the  different  network 
configurations  must  be  introduced,  or,  better,  a 
perturbation  method  has  to  be  defined  allowing  to  pass 
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from  one  configuration  to  another.  The  acceptance 
criterion  is  governed  by  a  random  number  generator 
and  a  control  parameter,  called  temperature. 

Starting  from  an  admissible  solution  of  the  problem,  the 
search  strategy  in  the  neighbourhood  of  such  a  solution 
will  be  more  intensive  in  the  more  promising  regions, 
penalising  the  searches  that  move  far  from  these 
regions  but  accepting  with  certain  probability,  also 
searches  that  worsen  the  solution.  The  temperature, 
slowly  modifying  its  value,  drives  the  system  toward 
the  final  solution,  which  corresponds  to  a  local 
minimum  of  the  objective  function.  In  Fig.  1  a  typical 
implementation  of  the  algorithm  is  shown. 


1.  Choose  an  initial  solution  S 

2.  Sett=to 

3.  Repeat  until  system  is  frozen. 

3.1  Do  the  following  cycle  /  times. 

3.1.1  Make  a  perturbation  in  S  and  generate 
S'eRs 

3.1.2  Set  AC=C(S')-C(S) 

3.1.3  If  ziC<0,tlienset5=5' 

3.1.4  If  4C>0set  S  =  S'  with  probability 
exp( -  AC/ 1) 

3.2  Set  r  =  r  t. 

4.  Show  S. _ 

Fig.  1  Typical  implementation  of  Simulated  Annealing 

The  core  of  this  procedure  is  in  step  3.1,  known  as 
Metropolis  algorithm.  It  must  be  noted  that  exp(- 
AC/t)  is  a  number  in  [0,1)  when  AC  and  t  are 
positive,  which  may  be  correctly  understood  as  a 
probability  dependent  on  ^  C  and  r. 

In  the  above  implementation,  it  is  of  fundamental 
importance  to  introduce  two  additional  parameters:  the 
cooling  ratio  r  and  an  integer  number  /  that  represent 
the  number  of  iteration  executed  for  each  value  of  the 
temperature  [4],  i.e.,  the  length  of  tlie  Markov  chain  in 
the  Metropolis  algorithm. 


3.  Cooling  schedule 

To  apply  tlie  S.A.  algorithm  to  the  problem  at  hand,  we 
focus  our  attention  on  three  main  aspects:  tlie  goodness 
of  die  solutions,  die  computational  time  and  die 
capability  of  the  algorithm  to  cope  widi  networks  of 
different  dimensions  and  topology. 

As  previously  mentioned,  die  implementation  requires 
to  set  various  parameter  values,  following 
considerations  of  experimental  nature. 

To  ensure  die  convergence  of  the  algoridim,  it  is 
necessary  to  define  the  dimension  of  the  neighbourhood 
of  a  configuration.  A  large  number  of  experiments 
showed  that  the  dimension  of  the  neighbourhood 
affects  only  computational  time,  but  not  die  final  value 
of  the  objective  function.  As  previously  mentioned  any 
feasible  configuration  has  been  represented  widi  a 


binary  string  of  n  elements:  a  T’  in  a  certain  position  of 
the  string  means  that  multiplexer  has  been  activated  in 
the  corresponding  node;  '0*  means  that  the  node  does 
not  contain  a  multiplexer  center.  The  neighbourhood  is 
generated  randomly  choosing  k  elements  of  the  string 
and  changing  ’I’s  with  ’O's  and  'O's  with  *1'$.  The 
dimension  of  the  neighbourhood  is  stricdy  linked  with 
the  value  of  k.  Many  trials  showed  that  this  value  has  to 
be  changed  with  the  number  of  nodes.  In  many  cases  k 
can  be  changed  during  the  evolution  of  the  algorithm, 
so  that  in  the  beginning  large  neighbourhood  is  visited, 
while  in  the  proximity  of  frozen  point  neighbourhoods 
are  restricted  because  only  with  small  perturbations 
better  solutions  can  be  obtained. 

One  of  the  most  important  problem  involved  in  the  S.A. 
algorithm  implementation  is  the  definition  of  a  proper 
cooling  schedule,  which  is  based  on  the  choice  of  the 
following  parameters:  starting  temperature,  final 
temperature,  length  of  Markov  chains,  the  way  of 
decreasing  temperature.  A  correct  choice  of  these 
parameters  is  crucial  because  the  performances  of  the 
algorithm  strongly  depend  on  it.  In  literature  many 
cooling  schedule  are  mentioned  [5].  In  this  paper, 
parameters  have  been  chosen  after  a  large  number  of 
tests. 

Tlie  starting  value  of  t,  to,  has  been  chosen  according  to 
[5].  Let  X  be  the  acceptation  ratio,  i.e.  the  ratio  between 
the  number  of  accepted  tiansitions  and  the  number  of 
generated  transitions,  the  value  of  to  is  doubled  until  % 
becomes  higher  than  a  fixed  value  Xo-  Starting  with 
to-0.5  and  [5],  the  program  has  been  optimized 

(in  terms  of  the  goodness  of  the  solutions  and  of  the 
computational  time)  witli  to=0.4  and 
Anotlier  important  problem,  involved  in  the  Metropolis 
criterion,  consists  in  introducing  a  suitable 
normalisation  of  the  costs.  In  fact,  they  have  to  be 
compared  with  the  control  parameter  t.  As  can  be  seen 
in  step  3.1.4  in  Fig.  1,  the  values  exp(-AC/t )  must  be 
uniformly  distributed  in  the  interval  [0,1).  This  can  be 
achieved  assuming  a  different  normalisation  parameter 
in  each  step  of  the  procedure,  varying  according  to  tlie 
variation  of  the  costs  of  the  solutions  explored  during 
the  previous  Markov  chain. 

The  second  parameter  to  be  set  is  the  length  of  Markov 
chains.  It  is  polynomially  linked  to  the  problem'  size 
[5],  i.e.  to  the  number  n  of  nodes  of  the  network.  In 
particular  the  maximum  number  of  accepted 
configurations  have  been  set  to  10*n  while  the 
maximum  number  of  generated  configurations  has  been 
set  to  50*n. 

One  of  tlie  most  used  strategy  for  varying  t  during  the 
search  is  [5]:  t^^+i;  =  ^  with  0.5 <r< 0.99.  The 

best  choice  for  r  has  been  made  considering  the 
goodness  of  solutions  and  the  amount  of  computational 
time  needed.  Best  results  have  been  obtained  in  the 
field  of  0.95  <r<  0.97.  In  fact,  it  has  been  noted  that, 
assuming  lower  values  it  is  possible  to  be  trapped  in 
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local  minima,  otherwise,  with  higher  values  a  large 
amount  of  computational  time  is  needed. 

In  Fig.  2  the  results  of  three  tests  are  shown:  one  with 
r=0.99,  one  with  r^O.95  and  another  with  r=0.9.  As  can 
be  noted,  in  the  first  case  a  large  amount  of 
computational  time  is  needed;  in  the  second  and  in  the 
third  case  the  computational  time  is  reduced:  In 
particular  setting  r=0.95  results  very  close  to  optimum 
are  found. 


Fig,  2  Costs  vs.  computation  time  for  different  clioices  of  r. 

The  final  value  of  t  is  strictly  linked  to  stop  conditions. 
To  be  sure  that  no  improvement  is  possible  and  tliat  the 
system  is  frozen,  two  conditions  must  be  verified 
together.  Firstly  t  is  decreased  until  it  is  less  or  equal  to 
a  fixed  value  iy.  Moreover,  improvements  in  the 
objective  function  must  be  negligible. 

The  evolution  of  the  solution  values  during  the 
annealing  processes  shown  in  Fig.  3,  for  a  network  of 
large  size  (120  nodes),  and  for  parameter  values  chosen 
widi  the  previous  criteria. 


Fig,  3  Algoritlim  evolution  for  a  120  nodes  network. 

4.  Results 

Results  of  extensive  testing  over  networks  exhibiting 
different  sizes  and  different  topologies  of  tlie  cable 


conduit  graph  are  reported.  In  particular,  two  topologies 
of  random  and  geometrical  graphs  [4]  with  small  (about 
50  nodes),  medium  (about  100  nodes),  and  large  (about 
150  nodes)  size  are  considered. 

As  a  result,  we  have  settled  the  optimum  values  of  the 
parameters  involved  in  the  algorithm. 

In  'Tab.  I,  we  report  the  values  of  the  optimised 
parameters. 


Table  I  -  Optimal  cooling  schedule 


PARAMETERS 

VALUES 

Number  of  Changes  k 

1+5 

Initial  Temperature 

0.4 

Final  Temperature 

0.14 

Length  of  Maricov 
Chain 

50*n 

Cooling  Ratio 

0.95 

As  an  example,  we  report  the  results  obtained  for  a  real 
network  of  large  size  (160  nodes),  whose  data  have 
been  supplied  by  the  Italian  Public  Telephone 
Company.  The  cable  conduit  graph  has  been  derived 
examining  the  existing  telephone  network  and 
geographical  constraints. 

In  Fig.  4,  the  results  corresponding  to  200  tests  are 
reported.  As  can  be  noted,  the  larger  part  of  the  results 
falls  in  a  range  of  +/-  1%  around  the  minimum  cost 
(44.724  MLit).  The  minimum  was  obtained  with  a 
frequency  of  17%.  In  Fig.  4  we  also  report  the  result 
obtained  with  a  local  optimization  algorithm  (L.O.A.) 
[6],  showing  tlie  improvement  obtained  with  the  S.A. 
approach. 


central 

value 


Difference  wi«h  Ihe  optimal  solution  % 

Fig.  4  Frequency  occurrence  of  the  solutionf?  versus  percent 
difference  with  respect  to  tlie  optimal  solution. 

The  computation  time  of  the  more  frequent  solutions 
range  from  15  to  17  minutes  on  a  HP/715  workstation. 
The  results  obtained  applying  the  S.A,  algorithm  to  a 
network  of  58  nodes,  corresponding  to  tlie  telephone 
district  of  Oristano,  Sardinia,  are  reported  in  Tab.  II,  for 
different  choices  of  the  location  of  the  exchange, 
starting  from  random  configurations  or  from  the  local 
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optimization  algorithm  result.  The  S.A.  results  are 
compared  with  those  obtained  with  the  L.O.A.. 


Table  II  -  Comparison  between  S.A.  and  L.O.A.  results  for  tlie 
Oristano  network. 


Exchange 

location 

Starting 

configurati 

on 

Candidate 
sites  for  tlie 
multiplexers 

Links 

layout 

Cost 

improvement 
with  respect 
to  L.O.A. 

fixed 

(Oristano) 

result  of 
L.O.A. 

15 

optimal 

fixed 

(Oristano) 

random 

15 

optimal 

Wo 

fixed 

(Oristano) 

random 

58 

random 

0.1% 

fixed 

(Oristano) 

result  of 
L.O.A. 

58 

optimal 

1.3% 

random 

random 

58 

complete 

graph 

7% 

In  Tab.  Ill,  all  the  statistics  over  the  numerous  networks 
considered  are  reported,  together  with  comparison  with 
tliose  obtain  witli  other  optimization  methods  such  as 
heuristic  [6]  and  genetic  [7],  both  in  terms  of  cost  and 
computation  time. 
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Table  III  -  Results  of  21  test  cases  (r  random  networks,  fieometric  networks). 
a:. . . . . I  . . '  I 


Heunsiic 

Genetic 

Vnnealing 

Networks 

Cost 

[MUt] 

Cost 

[MUt] 

Tinie[sJ 

Cost 

[MLitJ 

Time  [s] 

82725.9 

31306.70 

115.17 

31306.70 

35.79 

45  nodes  r 

31566.30 

19440.79 

58.75 

19.92 

50  nodes  r 

30217.76 

20083.49 

0.19 

19956.80 

19956.80 

21.7 

54  nodes  r 

42621.92 

26731.53 

0.37 

26057.90 

58.75 

26049.71 

18.89 

96  nodes  r 

79007.85 

39748.11 

1.84 

39272.08 

573.38 

39163.73 

172-7 

100  nodes  r 

114315.5 

51072.22 

2.25 

50240.43 

851.74 

50065.82 

105  nodes  r 

230950.7 

75139.37 

2.99 

72368.35 

1130.33 

72365.66 

148  nodes  r 

174828 

65912.76 

11.33 

64929.02 

2673.84 

63526.07 

150  nodes  r 

166723.60 

70526.54 

7 

69483.10 

67726.28 

689.83 

152  nodes  r 

284270.30 

95534.58 

15.43 

96495.11 

3165.80 

95170.32 

1008.27 

48  nodes  g 

20630.60 

12866.69 

0.21 

12853.94 

12853.94 

10.03 

50  nodes  g 

27250.14 

15807.78 

11.75 

14891.19 

11286.37 

162 

52983.24 

0.63 

24996.41 

■EEiilrM 

24600.07 

83.38 

47029.95 

28330.15 

27548.35 

114.38 

102  nodes  g 

26276.03 

16954.45 

712.29 

16928.37 

123.59 

120  nodes 

142813.7 

62 

51919.23 

425 

147  nodes  g 

50542.34 

28121.80 

2.01 

27469.46 

1858.41 

27068.75 

485.28 

150  nodes  g 

53957.48 

31550.54 

1.91 

31460.94 

1580.45 

153  nodes  g 

62696.83 

32389.70 

1.97 

31748.29 

1720.48 

31321.24 

600.99 

160  nodes 

176382 

44920.50 

15.34 

44904.60 

5700 

44724.50 

955.98 

5.  Conclusions 

A  Simulated  Annealing  implementation  for  the 
topological  optimization  of  communication  networks 
design  has  been  presented.  Results  over  several  test 
cases  showed  the  crucial  importance  of  a  correct  choice 
of  the  parameter  values  to  obtain  good  solutions  in  a 
reasonable  amount  of  time. 

Furthermore,  comparisons  with  other  optimization 
techniques  showed  the  suitability  of  die  proposed 
algoridim  to  solve  such  problem. 
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ABSTRACT 

This  paper  considers  various  wideband  signal  .model  opti¬ 
mization  techniques  and  associated  performance  results  us¬ 
ing  steerable  fixed  beam  microphone  array  processing  for 
hearing  aid  applications  in  free-space  and  reverberant  con¬ 
ditions.  We  first  review  and  compare  various  conventional 
broadband  and  narrowband  array  optimization  techniques. 
Then  new  results  on  maximum  energy  criterion  broadband 
array  optimization  formulated  for  sub-band  processing  are 
presented.  The  uniforrrily  spaced  sub-band  and  the  non- 
uniformly  spaced  sub-band  using  quadrature  mirror  filter 
approaches  are  treated.  Then  algorithms  for  implementing 
the  hearing  aid’s  search  mode  base  on  direction-of-arrival 
(DOA)  estimation' techniques  for  acoustic  signals  using  sub¬ 
space  methods  are  discussed.  Finally,  various  simulation  re¬ 
sults  under  the  maximum  energy  criterion  and  DOA  estima¬ 
tion  for  free-space  and  reverberant  conditions  are  presented 
to  demonstrate  the  feasibility  of  the  proposed  techniques. 

1.  INTRODUCTION 

Speech  conversations  in  a  room  may  be  difficult  due  to  sev¬ 
eral  nearby  simultaneous  speakers,  a  number  of  indepen¬ 
dent  noise  sources  in  the  background,  and  reverberation  of 
all  these  sounds.  For  the  hearing  impaired,  due  to  reduced 
dynamic  range  and  sensitivity  in  amplitude  and  frequency, 
this  problem  is  particularly  severe.  Direct  amplification  in 
a  standard  hearing  aid  with  a  single  microphone  is  useful 
only  in  limited  situations.  It  is  known  that  increasing  the 
desired  speaker’s  SNR  relative  to  all  other  received  sound  is 
a  meaningful  way  to  increase  speech  intelligibility.  Indeed, 
in  many  critical  cases  near  thresholds  of  intelligibility,  a  4-5 
dB  SNR  improvement  may  lead  up  to  50%  improvement  in 
intelligibility. 

A  microphone  array,  with  the  capability  of  forming  a  fo¬ 
cused  beam  toward  the  desired  speaker  and  possibly  reject¬ 
ing  discrete  spatial  sources  and  attenuating  general  back¬ 
ground  reverberant  noises,  is  a  practical  and  effective  method 
for  enhancing  the  desired  SNR.  Besides  the  hearing  aid 
problem,  the  desired  feature  of  increasing  the  effective  SNR 
of  the  speaker  in  an  enclosed  environment  is  also  useful  to 
the  automatic  speech/speaker  recognition,  the  automobile 
hand-free  telephone,  and  the  teleconferencing  problems.  An 
array  of  sensors  can  be  used  to  spatially  and  temporally 
sample  an  incoming  field  in  order  to  synthesize  an  opti¬ 
mum  beam  pattern.  This  technique  has  been  used  exten- 
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sively  and  successfully  in  many  aerospace/avionic  direction 
finding  and  jammer/interference  rejection  applications  [l] 
[2]  [3].  These  problems  are  modeled  by  microwave  narrow¬ 
band  signals  propagating  in  free-space.  In  contrast,  the 
speech  waveform  must  be  modeled  as  a  wideband  signal, 
considerable  amount  of  reverberation  may  exist  in  a  typi¬ 
cal  indoor  environment,  and  near-held  effects  must  be  con¬ 
sidered.  Clearly,  microphone  array  processing  based  on  a. 
narrowband  signal  model  is  inadequate.  Similarly,  a  di¬ 
rect  application  of  the  simple  LMS  type  adaptive  algorithm, 
known  to  be  quite  successful  for  microwave  free-space  beam- 
formation  applications,  is  also  known  to  be  inadequate  for 
correlated  reflections  in  reverberant  room  scenarios. 

In  this  paper,  vve  consider  and  compare  various  wide¬ 
band  signal  model  optimization  techniques  and  associated 
performance  results  for  steerable  fixed  beam  microphone 
array  processing.  In  Section  2,  we  hrst  review  and  compare 
various  conventional  broadband  and  narrowband  array  op¬ 
timization  techniques.  These  include  the  minimization  of 
the  output  array  power  subject  to  desired  signal  distortion 
constraint;  the  maximization  of  the  array  gain  subject  to 
white  noise  gain  and  linear  constraints;  and  maximum  en¬ 
ergy  criterion  subject  to  norm,  linear,  and  quadratic  distor¬ 
tion  constraints.  In  Section  3,  we  present  our  new  results 
on  maximum  energy  criterion  broadband  array  optimiza¬ 
tion  formulated  for  sub-band  processing.  We  consider  the 
uniform  sub-band  as  well  as  the  quadrature  mirror  Alter 
sub-band  approaches.  In  Section  4  we  present  the  feasi¬ 
bility  of  using  coherent  signal  subspace  method  to  perform 
DOA  estimation  for  the  proposed  hearing  aid  application. 
In  Section  5,  we  present  various  simulation  results  under  the 
maximum  energy  criterion  and  DOA  estimation  for  both 
broadband  and  sub-band  approaches  for  both  free-space 
and  reverberant  conditions.  A  short  conclusion  is  given  in 
Section  6. 

2.  ARRAY  OPTIMIZATION 

In  this  section  we  review  several  known  array  optimization 
techniques.  Consider  an  array  with  R  sensors,  each  fol¬ 
lowed  by  a  FIR  filter  with  L  taps.  F  is  the  (unnormal¬ 
ized)  frequency,  F  £  [— 1/(2T),  1/(2T))  Hz,  for  a  sampling 
frequency  of  1/T  Hz,  while  /  =  FT  £  [—1/2.,  1/2)  is  the 
normalized  frequency.  The  range  of  look-directions  'tp  is  in 
the  half-hemisphere  of  [~7r/2, 7r/2).  The  desired  signal  an¬ 
gle  and  frequency  region  are  given  respectively  by  A^o  and 
A/o,  centered  around  '0o  and  /o.  Given  a  linear  array  of  R 
sensors  placed  at  locations  d*,  x  =  0, . . . ,  A  —  1,  with  a  pla- 
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nar  wave  2;(t)  impinging  at  angle  relative  to  the  normal 
of  the  array,  the  signal  received  by  the  »th  sensor  is  de¬ 
noted  by  a:r(t)  =  —  /ir),  where  ho  =  r?oT,  hr  =  r^rT  = 

ho  4-  (dr/c)  sin  V')  =  1, . . . ,  /2  —  1.  The  array  response  func¬ 

tion  can  then  be  written  as 

R-l L-l 

w^(/,^)  =  EE 

r=0  ^=0 

where  the  vectors  w  and  E(/,  are  defined  as  follows 
w  =  (w^oo, . . . ,  ruo,L-i,  t^io, . . . ,  tui,L-i, . . . , 

\H 

WR-i^L-l)  , 

E(/,  ^  ^  ^  ^  g-j2;r/(„0+L-l)  ^  ^  ^  ^  ^ 

g-j27r/(»7i+/^-l)^  ^  ^  ^  ^  g-i27r/(T7H-i  +I.-1)^T 

As  we  see  in  the  following,  for  most  array  optimization  cri¬ 
teria,  we  need  to  compute  integrals  of  the  form 


By  using  the  Lagrange  multiplier  technique,  the  so¬ 
lution  is  given  by  w  =  Xo(Rx  +  AoQ)^^Qwd,  where 
Ao  is  the  root  of  the  equation 

Wrf i2x(f2x  +  AoQ)  ^Q(i2x  +  AoQ)  ^iixWd  =  e, 

and  can  be  solved  by  iterative  techniques.  It  is  also 
possible  to  impose  linear  constraints  in  addition  to 
the  above  constraints[l]. 

•  Maximize  the  array  gain  subject  to  constraining  the 
norm  of  the  weights  and  the  linear  constraints: 

W^Aw  .  H  H 

max  — IT— - ,  with  w  w  =  ^  and  Z)  w  =  f. 

w  W^ifxW 

For  a  narrowband  array  of  frequency  fo  with  a  single 
look-direction,  A  =  dd^  where  d  =  E(/o,  ^o),  one  of 
the  linear  constraint  is  to  ensure  unity  look  direction 
w^d  =  1.  Then  the  optimum  weight  is  given  by: 


J(A/,AV-)=  /  /  \E{f,^)fdfdil;. 

2.1.  Review  of  array  optimization  techniques 

The  array  response  can  be  optimized  to  achieve  several  ob¬ 
jectives: 

•  Maximize  the  output  power  over  the  desired  look  re¬ 
gion  (A/o,  AV>o)  given  by  w^Aw,  where  A  =:J(A/o,  A^o). 

•  Minimize  the  response  to  the  noise.  For  fixed  ar¬ 

ray  design,  if  the  noise  is  assumed  to  be  arriving 
from  some  noise  region  (A/„,A'0„),  the  noise  out¬ 
put  power  can  be  quantified  as  w^5w,  where  B  = 
J(A/n,  AV>n).  For  adaptive  array  design,  the  quan¬ 
tity  considered  is  where  Rx  is  the  data  cor¬ 

relation  matrix. 


w  {R,  + 

where  A©  is  again  the  Lagrange  multiplier.  This  yields 
the  Minimum  Variance  Distortionless  Response  solu¬ 
tion. 

•  Another  class  of  formulation  approximates  the  quadratic 
constraints  by  linear  constraints.  For  instance,  the 
constrciint  to  keep  w^Qw  low  can  be  achieved  by 
taking  the  eigen-decomposition  of  the  matrix  Q\ 

Q = UqAqU^, 

where  Uq  =  [u^, . . . ,  is  unitary  and 

Aq  =  diag(A«, . . . ,  A«  J,  A?  >  A«  >  •  ■  ■  >  A«^  >  0. 

The  constraint  w^Qw  <  e  corresponds  to 


•  Minimize  the  white  noise  gain,  which  can  be  achieved 
by  controlling  the  quantity  w^w. 

•  For  broadband  arrays,  minimize  the  distortion  of  the 
desired  signal.  This  distortion  can  be  quantified  by 
(w  —  Wd)^<5(w  —  Wd),  where  wj  is  the  weight  vector 
that  most  closely  satisfy  the  desired  look-region  re¬ 
sponse.  If  the  distortion  is  measured  over  the  whole 
look  region,  then  Q  —  A.  In  the  next  section,  the 
distortion  will  also  be  quantified  as  w^Cw. 

•  Satisfy  some  linear  constraints,  which  can  be  expressed 
as  jD^w  =  f .  The  constraints  can  be  placed  on  the 
the  array  response  function’s  magnitude,  its  spatial 
derivative  or  its  frequency  derivative.  Linear  con¬ 
straints  are  useful  in  cancelling  narrowband  inter- 
ferers  or  to  approximate  any  of  the  above  quadratic 
forms. 

Among  the  many  formulations  that  have  been  considered 
are: 


RL 

w)  =  ^  |w"uf  |^A9  <  e. 

t=l 

By  imposing  w^xif  =0,  i  =  1, , , . ,  Ae,  where  Ne  < 
RL,  we  achieve  w^Qw  <  A^^  ,+iiiwir- 


2.2.  Maximum  Energy  Array 

The  Maximum  Energy  Array  (ME)  criterion  is  as  follows: [6] 

w^(A  -  aC)w  H  .  . 

max  - — - 7T—,  subj.  to  w  w  =  1  and  D  w  =  0, 

w  w«(i? -h<T7)w  ’ 


where  a  is  the  distortion  controling  parameter  and  a  is  the 
white  noise  controling  parameter.  Equivalently,  the  norm 
constraint  can  be  replaced  by  any  quadratic  constraint  of 
the  form  =  e,  just  by  scaling  the  weight  vector.  We 

note  the  following  features  of  this  approach: 


•  Minimize  the  output  power  subject  to  constrained 
desired  signal  distortion  [3]: 

minw^RxW,  with  (w  —  Wd)^Q(w  —  Wd)  =  c. 


•  If  the  direction  of  interferer  is  unknown,  then  B  is 
replaced  by  the  matrix  of  isotropic  noise  field.  For  the 
adaptive  case,  B  is  replaced  by  the  data  correlation 
matrix  Rx. 
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The  distortion  is  quantified  by 


w"Cw 


=  /  / 

JAllfn  J  A. 


|w«(E(/,V')-E(/o,  Wfd/  dV-. 


By  choosing  the  appropriate  value  of  a,  the  distortion 
can  be  controlled. 

•  The  white  noise  gain,  w^w/w^dd^w,  can  be  con> 
trolled  by  selecting  the  value  rr. 

•  The  linear  constraints  are  used  to  impose  zero  con¬ 
straints  only.  We  can  eliminate  the  linear  constraints, 
by  using  the  subspace  orthogonal  to  the  subspace 
spanned  by  the  columns  of  D,  If  Pc  is  the  {RL)  x  Nc 
matrix  whose  columns  span  the  orthogonal  subspace, 
the  above  maximization  problem  becomes 


w“(Pg(A-aC)Pc7)w 
‘w*  wH(pH(5  +  (r/)Pc)w 


,  subj.  to  w  w  =  1, 


where  w  has  Nc  components,  and  w  Pc^> 

The  optimum  weight  can  be  found  as  the  generalized  eigen¬ 
vector  corresponding  to  the  largest  eigenvalue  of  the  prob- 
lem  P§{A  -  aC)Pcw  =  AP^(P  +  (xI)Pc'W.  Ut  P§(B  + 
aI)Pc  =  LL^ ,  where  L  is  a  lower  triangular  matrix.  Then 
the  corresponding  symmetrical  eigenvalue  problem  is 

Aw  =  L^^P§{A-aC)PcL^^^. 

Then  w  =  and  the  weight  vector  is  given  by  w  = 

PcL^^w. 

If  the  matrix  {A  —  aC)  is  positive  definite,  then  the 
largest  eigenvalue  and  the  corresponding  eigenvector  can 
be  found  by  using  the  power  method.  Otherwise,  the  Si¬ 
multaneous  Itera,tive  Method  [2]  must  be  used. 


jb  =  -  1,.  Each  of  these  signals  is  downsampled 

by  an  integral  factor  Ki  <  K  and  passes  through  an  Lu- 
tap  FIR  filter  with  coefficients  Wrtk^  ^  =  0, . . . ,  L  -  1  and 


transfer  function  Wrk{f)  = 


All 


the  weighted  signals  at  subband  k  are  then  added  together, 
upsampled  by  a  factor  of  Ki,  and  filtered  by  the  output 
subband  filter,  Gk{f)^  Finally,  at  the  output  of  subbaud 
we  have  the  signal 

rssO  m=0 

In  the  desirable  case  where  Gkif)Gk{f)  =  1,  -f 

m/Ki)  =  0,  for  /  €  and  m  =  1, . . . ,  A'l  —  1,  then  no 
alicLsed  repliczLS  of  the  signal  are  present  at  the  output,  and 
yjt(/,  0)  can  be  written  simply  as 

H-l 

r=0 

for  /  6  Note  that  due  to  the  nonzero  roll-off  of  the  fil¬ 
ters  Gk{f)  and  Gk{f)y  the  output  Yk{fyi>)  contains  contri¬ 
butions  from  frequencies  outside  the  band  The  transfer 
function  relative  to  subband  $/c  is  given  by 


_  Ykif)  _  1 


,  R-lLfc-l 


For  a  uniform  linear  array  where  rjr  =  rd  sin  ^/(cT), 


E  E  «'’■ 


— ^27rr/dsin  V'/CcT)  —  j27r/v  j  if 
rAcC  e  , 


3.  SUB-BAND  OPTIMIZATION  TECHNIQUES 

So  far  we  have  dealt  with  the  broadband  optimization  prob¬ 
lem  over  the  entire  speech  frequency  range.  Now  consider 
splitting  the  band  into  a  number  of  adjacent  subbands. 
The  goal  is  to  split  the  general  problem  into  a  number  of 
subproblems,  so  that  they  will  be  less  computationally  de¬ 
manding,  Moreover,  the  task  of  locating  the  desired  source 
and/or  discrete  interferers  is  much  less  challenging  when 
the  signals  are  narrowband.  We  shall  show  that  the  op¬ 
timization  cannot  be  reduced  to  a  number  of  Independent 
problems,  since  some  global  constraints  have  to  be  imposed 
in  order  to  minimize  signal  distortion  for  enhancing  intelli¬ 
gibility. 


for  /  €  ^k-  Note  that  the  choice  of  A'l  <  K  has  the  effect  of 
relaxing  the  requirements  on  the  input  and  output  subband 
filters,  Gk{f)  and  Gk(f)y  respectively. 

For  each  of  the  K  subbands,  we  can  define  matrices  Ak 
and  Bk  as  follows: 

A,  =  f  I  d/dV-, 

J ^i^nAJo  «/ At/>o 

B,  =  f  f  E(/,V-)E(/,  d-r  d/dV-, 

J[-7r/2,7r/2) 

for  fc  =  0, . . . ,  if  -  1.  Now,  let 

So  =  Uk{^k  :  n  A/o  ^  0},  )Co  =  {k:<^kC  Bo]. 


3.1.  Uniform  subbands 

Consider  Fig.  1,  which  displays  a  general  scheme  for  sub¬ 
band  processing  of  sensor  arrays.  Let  us  assume  the  K 
subbands  are  of  uniform  width,  and  define  the  kth  sub¬ 
band  3s^k  =  +  1)/{2K), -k/{2K))  U  [k/{2K),  {k 

1)/(2A’)),  ib  =  0, . . . ,  A^  -  1.  The  signal  at  the  output  of 
the  rth  sensor,  r  =  0, . . . ,  A  —  1,  is  Xr{nT)  =  xo{nT -|-  T/rT), 
T/r  =  dr  sin  V^/(cT),  and  its  Fourier  transform  is  denoted  by 
Xr(f)  =  /  €  [-1/2, 1/2),  This  signal  is 

subsequently  filtered  by  the  K  input  subband  filters  Gfc(/), 


In  other  words.  Bo  is  the  union  of  all  those  subbands  that 
intersect  the  frequency  band,  A/o,  of  interest,  while  /Co  is 
the  set  of  indices  corresponding  to  the  subbands  in  Bo.  Let 
P  be  defined  as 

_ J2k€ICo  _ 

E*€/c„  +  YlkiiCo  ■ 
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A  first  observation  is  that,  in  general,  local  optimization  on 
each  subband  separately  does  not  achieve  the  maximum  /?. 
Moreover,  it  is  clear  from  the  above  expression  for  that 
when  there  are  no  non>zero  constraints  outside  5o,  then  we 
can  limit  ourselves  to  the  subbands  that  do  overlap  with 
A/o,  and  put  Wk  =  0,  k  ^  /Co.  In  this  case,  the  problem 
becomes 

max^  °  - 

subj.  to  I  <  e,  k  =  0....,K-L 


Alternatively,  the  problem  can  be  posed  as 


max 


w^Aw 


=f, 

w^Cw  <  c, 


where  now  the  matrices  A,  B,  and  C  are  block  diagonal, 
with  kill  block  equal  to  Aky  Bky  and  Ck,  respectively,  and 
w  is  given  by  the  stack  of  the  |X:o|  vectors  v/k,  k  G  /Co,  This 
formulation  allows  us  to  obtain  an  interesting  result  which 
warns  us  against  a  direct  maximization  of  p,  which  does  not 
include  global  constraints  of  no  distortion.  Assume  that  we 
want  to  solve  the  maximization  problem,  where  now  we  im¬ 
pose  all  the  constraints  locally  on  all  the  subbands.  Follow¬ 
ing  the  general  technique  described  earlier,  we  project  all 
the  matrices  Ak  and  Bk  onto  a  subspace  orthogonal  to  the 
constraint  space.  But  we  apply  the  projections  separately, 
for  each  subband.  In  other  words,  we  have  transformed  the 
matrices  Ak  and  5^,  =  0, . . . ,  K  ~  1,  into  Ak  =  PfAuPk 

and  Bk  =  Pf  BkPky  respectively,  where  Pk  is  the  matrix 
whose  columns  are  a  basis  for  the  subspace  orthogonal  to 
the  constraint  space  for  that  subband.  Then  our  maximiza¬ 
tion  problem  becomes  the  unconstrained  problem 


max 


EteiCo 


As  we  discussed  earlier,  this  is  equivalent  to  finding  the 
largest  generalized  eigenvalue  and  corresponding  eigenvec¬ 
tor  for  the  matrix  pencil  A  —  \B,  where 
A  =diag(Ao, . . . ,  A/c-i),  5  —  diag(5o,  •  • . ,  Bk-i).  Also; 

let  Aniax(Ajt,  Bk)  be  the  largest  generalized  eigenvalue  of  the 
pencil  Ak  ~  XBk^  The  surprising  result  is  that 


Amax(A,5)  =  max  Xm^xiAkyBk)  =  Amax(A^,.S^), 


and  therefore  the  corresponding  generalized  eigenvector  cor¬ 
responds  to  having  all  the  weights  equal  to  zero  except  for 
subband  k.  The  maximum  value  for  p  is  clearly  obtained 
by  picking  the  subband  with  the  highest  “average  SNR”, 
^k  =  Amax  {AkyBk)y  and  putting  to  zero  the  weights  in  all 
the  other  subbands.  Indeed,  if  we  had  performed  local  op¬ 
timization  on  each  subband,  we  w'ould  have  obtained  gen¬ 
erally  nonzero  weights  everywhere,  and  a  /?avg  given  by 

o  Ylk=0  ykXma,xiAky  Bk) 


which  is  smaller  than  Arriax  (A,  .0),  for  any  set  of  weights 
{yk}kJo^’  The  danger  of  having  only  one  “active”  subband 


is  that  the  distortion  may  be  intolerable.  The  solution  to 
this  problem  clearly  is  to  extend  no-distortion  (or  flatness) 
constraints  to  the  set  of  subbands  of  interest. 

3.2.  Using  quadrature  mirror  filters 

Quadrature  mirror  filters  (QMFs)  have  encountered  a  wide 
range  of  image  and  speech  applications  due  to  the  remark¬ 
able  properties  and  relative  ease  of  implementation  of  the 
multiresolution  analysis  capability.  [7].  In  this  section,  our 
aim  is  to  explore  the  possibility  of  using  QMF  pairs  for 
subband  array  processing. 

Let  //o(/)  and  Ho{f)  be  the  decompositioii/reconstr action 
low  pass  Alter  pair,  and  Hi(f)  and  Hi{f)  be  the  correspond¬ 
ing  high  pass  filter  pair.  VVe  assume  that  the  conditions  for 
perfect  reconstruction  are  satisAed.  In  particular,  we  as¬ 
sume  that 

=  2, 

|i/o(Mo(/+l/2)  +  //i(/)^i(/-|-l/2)|  =  U. 

Consider  a  one-level  decomposition.  At  the  output  of  the 
Alter  H k{f )y  k  0,1,  WQ  have  signals 

x.k(f)  =  Hk(f)Xr{f),  r  =  0,...,R-l. 

Analogously  to  the  uniform  sub  band  approach,  these  signals 
are  subsequently  downsampled  by  2  and  passed  through  Lk- 
tap  FIR  Alters,  with  coefficients  Wriky  i  =  0, . . . ,  Lk-l,  k  = 

0, 1,  and  transfer  function  Vt^rAc(/).  The  weighted  signals 
pertaining  to  the  same  subband,  k  =  0  or  1,  are  added 
together  and  then  upsampled  by  2.  After  filtering  by  Hk{f)y 
k  =  0,1,  we  have  that  the  output  of  subband  k  is  given  by 

R-1 

^  Wr,{2f)  +  Mf  +  1/2) A%(/  +  1/2)) 

r=0 

The  output  of  the  arra}'  is  given  by 
R-l 

i  ^  [Wro(2f)  + 

r  =  0 

+  w.^{2f) (tfitfjA-;  +  '(i)x;(i))]  ^ 

i,)  +  H{H[  wi(f,  ijj))  X'o 

where 

=  X:.  =  Xr{f),  X;,  =  Xo(f), 

R-l 

r~0 

R-l 
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^o"(/,^)  =  P^o'(/+  lAV'),  1/2,^). 

In  only  two  cases  aliasing  is  completely  rejected.  Either 
the  filters  Hoif)  and  Hi(f)  are  ideal  half-band  filters,  or 
^o^(/iV^)  =  Neither  situation  is  desirable,  un¬ 
less  the  condition  Wq  ^  is  imposed  for  one 

specific  look  direction  ip.  With  ip  =  0,  this  condition  trans¬ 
lates  into 

H-l  R-l  K~1  H-l 

Wr£Q  —  ^  ^  • 

r=0  r=:0  r=:0  r=0 

The  above  implies  that  when  the  filter  pairs  {Ho{f),  //o(/)) 
and  Hi{f))  satisfy  the  perfect  reconstruction  con¬ 

dition,  but  are  not  ideal  low  (high)  pass  filters,  it  is  still 
possible  to  achieve  no  aliasing  for  a  given  look  direction. 
At  -ip  =  0,  the  array  behaves  as  a  perfectly  linear  filter, 
with  a  transfer  function 

mf,  0)  =  =  Wiif,  0)  =  i  2  WM2f). 

Note  that  in  this  situation  Wro{2f)  hcis  the  period  of  1/2 
and  therefore  it  has  identical  behavior  on  /  G  [0,1/2)  and 
/  G  [1/2,1),  thus  limiting  our  ability  to  shape  the  spectrum 
of  Y(fyip).  This  set  of  weights  can  be  appropriately  chosen 
to  impose  flatness  on  all  the  frequency  range  for  the  given 
look  direction. 

Consider  now  a  two-level  decomposition.  The  lowpass 
branch  splits  into  two  branches,  respectively  characterized 
by  weight  transfer  functions  Wroo{f)  and  Wroi{f).  Note 
that  we  have  slightly  changed  our  notation,  since  now  we 
identify  the  subbands  with  binary  subscripts.  For  two-level 
decompositions,  subscripts  1,  01,  and  00,  correspond  to  k  = 
0,  A;  =  1,  and  A:  =  2,  respectively.  The  global  output  is 
^(/>^)  =  ^(/j  V')  +  Yi{f^ip).  As  before,  we  have 

R~1 

Yx(fA)  = 

r=0 

♦  [Hi{f)Xr{f)  +  Hoif+  imXrif  +  1/2)]  . 
For  Io(/,  V’)j  3.n  analogous  expression  holds, 

R-i 

r=:0 

+  [^o(/)^r(/)  +  .ffo(/  +  l/2)X.(/  + 1/2)1  , 
where  now  Wro(f)  —  Wroo’{,2f)Ao{f)  +  Wroi{2f)Ai{f),  and 

Aoif)  =  ^Hoif)[Mf)  +  6o{f+m]. 

Mf)  =  \Hx{f)[Mf)  +  6iif  +  m]- 

It  follows  that  the  above  equations  are  still  valid.  In  par¬ 
ticular,  if  one  imposes  the  previous  condition  with  equal 
weight  transfer  function,  one  finds  that 

R-l  R-l  R-l  R-l 

Y,  =  Y  ^Mf),  Y  =  E  ^r00i2f). 

r=0  r=:0  r  =  0  r=0 


All  of  the  previous  equations  can  be  directly  extended  to 
the  case  of  multiple  level  decompositions,  in  an  iterative 
manner. 

Together  with  the  no-aliasing  conditions,  the  weights  in 
each  subband  have  to  satisfy  a  set  of  additional  conditions. 
When  a  discrete  interferer  at  angle  direction  ipj  has  to  be 
cancelled,  a  set  of  two  constraints  per  subband  hcis  to  be 
generated,  each  set  cancelling  both  the  interferer  itself  and 
its  aliased  image.  For  the  case  of  a  two-level  decomposition, 
the  following  constraints  must  be  satisfied 

R-l 

=  0,  6  =  1,01,00, 

r=0 

with  fii  =  2,  ^00  ”  Moi  =  4.  A  condition  that  ensures 
flatness  of  the  frequency  response  for  a  given  look  direction 
has  to  be  added.  Then  the  constraint  must  be  imposed  on 

W{f,0)  =j:r:o  Wro(2f). 

The  difficulty  in  dealing  with  the  QMF-based  subband 
approach  is  that  the  overall  output  is  not  a  linear  function 
of  the  input,  due  to  the  operations  of  down  and  upsam¬ 
pling.  In  the  case  where  the  decomposition/reconstruction 
filters  are  not  ide2d  half-band  filters,  and  the  weights  on 
each  branch  are  different,  the  output  is  always  affected 
by  aliased  versions  of  the  signal.  A  possible  strategy  for 
achieving  maximum  energy  concentration  in  the  range  of 
look  directions  of  interest  is  to  maximize  the  energy  con¬ 
tent  in  the  angle  range  of  the  non-aliased  component 

of  the  signals  at  the  output  of  the  weighting  filters,  Wrb(f) 
(b  =  00,  01, 1  for  the  two-level  decomposition  case),  lor  one 
or  (preferably)  more  frequencies  within  A/q. 

4.  DOA  ESTIMATION  BY  SUBBAND 
APPROACH 

There  is  much  recent  interest  in  DOA  estimation  for  braod- 
band  signals  [9].  It  is  known  that  subspace  DOA  techniques 
such  as  MUSIC  work  efficiently  when  the  signals  are  narrow- 
band.  The  narrowband  assumption  is  based  on  the  product 
of  the  bandwidth  B\  and  the  propagation  time  through  the 
array  T,  satisfying  BT  <C  1.  The  traditional  narrowband 
approach  is  based  on  the  particular  structure  of  the  correla¬ 
tion  matrix  of  the  signals  output  from  the  R  antennas.  If 
is  the  Rxl  vector  of  outputs  at  time  t,  then  the  correlation 
matrix,  Tlx^  is  given  by 

Tl^  =  E{x,xf }  =  ATlsA^ 

where  A  is  the  Rx  D  steering  matrix,  Rs  is  the  D  x  D  cor- 
relation  matrix  of  the  D  impinging  signals,  and  the  noise 
is  assumed  white  Gaussian  with  variance  cr^ .  If  ^  is  full 
rank  and  the  SNR  is  sufficiently  high,  then  Itx  has  numeri¬ 
cal  rank  equal  to  £>,  the  number  of  impinging  signals.  The 
estimated  DOA  are  then  obtained  by  computing  the  noise 
subspace  of  72.x,  deriving  the  steering  vectors  orthogonal  to 
it,  and  exploiting  the  one-to-one  relationship  between  the 
steering  vector  and  DOA  parameters.  In  practical  applica¬ 
tions,  the  correlation  matrix  is  estimated  from  the  sample 
correlation  matrix 

^  X,xf, 
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where  Xt  is  the  R  x  N  matrix  defined  as  Xt  =  (xt-jv+i, 
Xt-JV+2,  •  •  •  >  Xt). 

The  above  derivation  is  highly  dependent  upon  the  nar¬ 
rowband  assumption.  In  fact,  if  the  signals  are  not  nar¬ 
rowband,  the  matrix  Rx  is  always  full-rank.  Among  the 
proposed  approaches,  is  the  so-called  coherent  signal  sub¬ 
space  method  (CSSM),  and  its  variants.  In  this  method, 
the  broadband  signals  are  first  decomposed  into  a  number 
of  narrowband  components,  via  pre-filtering,  subsequently 
all  the  components  are  focused  onto  a  predefined  center  fre¬ 
quency,  and  then  a  MUSIG-like  algorithm  is  applied  to  the 
combined  focused  correlation  matrices.  Techniques  such  as 
CSSM  are  characterized  by  a  rather  high  computational 
complexity.  In  the  application  of  interest  for  a  real-time 
DSP-based  microphone  array,  it  is  necessary  to  keep  the 
complexity  to  a  minimum,  without  giving  up  the  proper¬ 
ties  of  the  narrowband  MUSIC  algorithm.  The  technique 
used  here  is  therefore  summarized  as  follows:  1)  the  arriving 
broadband  signals  are  filtered  and  decomposed  into  narrow- 
band  components;  2)  the  narrowband  MUSIC  algorithm  is 
independently  applied  to  each  subband;  3)  the  number  of 
signals  and  corresponding  DOA’s  are  estimated  by  using 
those  highest  subbands  with  sufficient  energies  capable  of 
yielding  the  desired  fine  spatial  resolutions. 

5.  PERFORMANCE  —  SIMULATION 
RESULTS 

In  this  section,  we  present  several  simulation  results  for  ME 
array  designs  as  well  as  DOA  estimations  based  on  subband 
MUSIC  approach  for  both  free-space  and  reverberant  con¬ 
ditions. 

5.1.  SNR  measurement 

Previous  work  in  speech  quality  measurement  [8]  hcis  shown 
the  objective  segmental  SNR  expression  tracks  well  with 
subjective  measurement  of  the  quality  of  the  coded  speech 
waveforms.  For  hearing  aid  applications,  we  use  a  slightly 
modified  expression  defined  as  the  averaged  A  SNR  and 
given  by 

M-l  M-1 

ASNE""*  =  ^  SNRrut  -  ^  ^  SNIC  (dB), 

m=0  m=0 

where  SNRin  =  20logio  |{  II2/II  II2  SNRout  = 

20logio  II  Vs  II2/II  Vi  II2  (dB),  and  M  is  the  number  of  short 
segments  of  the  speech. 

5.2.  Conventional  broadband  simulation 

Now,  we  investigate  the  relationship  between  the  energy 
concentration  index  and  various  linear  constraints  (mag¬ 
nitude,  frequency  derivative,  and  spatial  derivative)  im¬ 
posed  on  the  array.  The  simulation  program  generates  im¬ 
pulse  responses  from  the  sources  (speaker  and  interferer) 
to  each  microphone  of  the  array  using  cubic  interpolation 
to  implement  spatial  propagation  delays[4,  5].  A  typical 
medium  size  reverberant  room  of  size  5  x  5  x  2.48  meter 
is  modeled  to  have  equal  reflection  coefficients  oi  p  =  0.5 
for  all  six  sides.  Furthermore,  we  assume  the  desired  and 


the  single  interferer  sources  are  both  1.8  meters  from  the 
center  of  the  array  and  the  image  model  of  the  room  has 
perfect  reflection  of  order  three.  The  desired  speaker  is  at 
angle  ^0  =  0®,  the  interferer  is  at  angle  0/  =  50°,  and  the 
energy  concentration  region  is  given  by  A^o  =  ±  15°}, 

Table  1  shows  simulations  with  R  =  i  sensors  and  differ¬ 
ent  constraints  and  tap  values.  While  (3  only  measures  the 
energy  concentration  in  the  look-direction  with  respect  to 
isotropic  noises,  ASNR^'"®  incorporates  the  effect  of  the  in¬ 
terferer  on  the  array  and  thus  is  a  good  measurement  of  the 
improvement  factor  of  the  array.  As  expected,  the  reverber¬ 
ant  performances  are  inferior  compared  to  the  correspond¬ 
ing  free-space  cases  due  to  additional  interferers  generated 
by  the  reflections  from  the  sides  of  the  room. 

5.3.  Subband  simulation 

As  shown  in  Fig.  1,  uniform  sub-band  (USB)  array  par¬ 
titions  the  incoming  signals  into  K  sub-bands  by  using  a 
series  of  ,DCT-IDCT  band-pass  filter  banks.  Each  band 
is  then  processed  separately  through  an  ME  array  with 
weights  adjusted  such  that  there  is  equal  spectral  response 
at  the  mid-point  of  each  band  in  the  desired  look  direc¬ 
tion  (i.e.,  lU^fc(/mid)  ^o)|=const,  for  /:  =  0, . . . ,  A  —  1.  Both 
magnitude  and  derivative  constraints  for  rejecting  the  inter¬ 
ferer  at  in  the  Jkth  sub-band  are  set  at  two  normalized 
frequencies  fi  and  /2  defined  by 

fl  =  /itw  +  ^(/high  ”  /lov/).j  /2  =  /low  +  j(/high  ""  /low)? 

where  /low  =  fc/2A,  /higii  =  k  1/2A ,  /,uid  =  (/low  + 
/high)/2,  such  that  the  constraints  are  equally  distributed 
in  the  speech  band  of  interest  as  shown  in  Fig.  2a.  We 
also  find  that  the  USB  array  has  flatter  look-direction  re¬ 
sponse  and  also  deeper  null  which  results  in  better  overall 
rejection  than  the  conventional  ME  array  response  shown 
in  Fig.  2b.  In  Table  2,  both  the  conventional  ME  and  USB 
arrays  have  R  =  4  sensors,  L  =  2.0  taps,  with  the  same  array 
concentration  region,  speaker  and  interferer  angle  and  dis¬ 
tance  relationships.  An  USB  of  K  =  8  shows  an  ASNR'^  ^ 
improvement  as  compared  to  a  conventional  ME  array  in 
free-space  and  reverberant  conditions.  We  note  the  USB 
array  is  about  3  dB  better  in  free-space  and  about  2  dB 
better  in  reverberant  condition  than  the  corresponding  ME 
array.  Furthermore,  by  comparing  the  attenuation  of  the 
interferer  in  Fig.  2  (computed  from  a  far-field  model)  with 
Table  2  (obtained  from  simulation  with  a  near-field  model), 
we  see  the  incomplete  cancellations  in  Table  2  show  the 
sensitivity  of  the  interferer  rejection  due  to  the  violation  of 
the  plane  wave  assumption.  Better  rejection  can  also  be 
achieved  by  better  matching  the  zero-constraints  to  the  in¬ 
terferer  spectrum  through  narrower  and  deeper  nulls  within 
each  sub-band  through  a  frequency  decompositon  approach. 
Finally,  various  HINT  speech  files  [11]  calibrated  for  equal 
intelligibility  measurement  ^  have  been  processed  by  the 
ME  array  for  various  field  testings  of  human  subjects  with 
and  without  hearing  impairments  in  free-space  and  rever¬ 
berant  environments.  These  experiments  verify  the  the  ef¬ 
fectiveness  of  the  array  for  improving  speech  intelligibility. 

^Each  sentence’s  RMS  power  is  adjusted  depending  on  its 
difficultness 
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6.4.  Real  room  experiment 

Here  we  present  a  real  room  experiment  ^  to  evaluate  the 
potential  performance  degradation  of  performing  spatial  fil¬ 
tering  (beamforming)  due  to  real  room  environment  and 
microphone  array  hardware  implementation  [10].  The  sys¬ 
tem  of  digital  acoustic  array  is  illustrated  in  Fig.  3  where  a 
large  scale  uniformly  spaced  microphone  array  with  8  omni¬ 
directional  elements  is  mounted  on  the  center  of  one  side  of 
a  medium  reverberant  room.  The  room  size  is  3.5  x  4.7 
X  3  m^.  The  desired  and  interfering  sources  are  modeled 
by  two  speech  files  and  are  spatially  separated  by  50®  The 
cut  off  frequency  is  set  at  5  kHz  while  the  sampling  fre¬ 
quency  is  set  at  20  kHz.  The  microphone  spacing  is  16.5 
cm  and  was  choosen  to  perform  source  location  with  high 
resolution.  However  this  distance  results  in  spacial  aliasing 
in  the  ME  criterion  for  frequency  band  higher  than  2  kHz. 
The  near-held  model  has  been  incorporated  to  simulate  the 
ideal  case.  The  same  set  of  weights  is  used  in  both  ideal 
and  real-room  cases.  Fig.  4a  shows  the  waveform  contain¬ 
ing  both  sources  at  the  input  of  the  array,  while  Fig.  4b 
shows  the  processed  waveform  at  the  output  of  the  array. 
The  improved  SNR’s  are  21.32  dB  and  5.7  dB  in  ideal  case 
and  in  real-room  case  respectively.  Factors  causing  this  dif¬ 
ference  are  due  to  background  noise,  imperfections  of  array 
implementation,  and  room  reverberations  in  the  real-room 
case. 

5.5.  DOA  estimation  by  CSSM 

Lastly,  we  consider  a  narrowband  MUSIC  algorithm  for  per¬ 
forming  DOA  estimation  of  bandpassed  broadband  speech 
signals.  The  simulation  results  of  a  eight  element  array  per¬ 
forming  CSSM  in  free  space  and  reverberant  room  of  size 
5  X  5  X.  2.44  m^.  As  shown  in  Fig.  5a,  the  true  speaker 
DOA  denoted  by  o  at  —60®  and  the  true  interferer  DOA 
denoted  by  ★  at  -1-20®  are  well  detected  with  high  resolu¬ 
tions  and  low  biasness  by  the  proposed  a^lgorithm  in  free 
space.  In  the  reverberant  room  case  as  shown  in  Fig. 5b,  we 
observe  the  sources  are  still  detected  with  lower  resolutions 
and  greater  biasness.  This  technique  requires  the  number 
of  sources  for  DOA  estimation  to  be  less  than  the  number 
of  array  sensors.  For  spatial  aliasing-free  requirement,  ar¬ 
rays  with  low  sensor  spacing  become  inefficient  for  DOA 
estimation  base  on  time-delay  correlation  techniques.  On 
the  other  band,  MUSIC  algorithm  can  be  extended  by  using 
root-MUSIC  and  near-held  MUSIC  techniques  for  enhanced 
performance. 

6.  CONCLUSIONS 

In  this  paper  we  first  reviewed  and  compared  the  optimiza¬ 
tion  of  various  fixed  beam  broadband  arrays.  Then  we  con¬ 
sidered  in  some  detail  a  steerable  fixed  array  optimized  un¬ 
der  the  maximum  energy  criterion.  The  purpose  of  this 
array  is  to  copcentrate  the  main  lobe  toward  the  desired 
signal,  while  rejecting  interferences  impinging  from  differ¬ 
ent  directions.  The  resulting  maximization  turns  out  to 
be  equivalent  to  a  quadratically  constrained  maximization 

^We  appreciate  the  real-time  measured  data  supplied  by  M. 
S.  Braiidstein  of  Brown  University. 


problem.  Additional  constraints  are  added  to  further  im¬ 
prove  the  overall  performance  of  the  array.  We  have  shown 
that  this  optimization  procedure  provides  encouraging  re¬ 
sults  for  the  design  of  a  microphone  array  for  hearing  aid 
application.  Several  simulation  results  are  presented  and 
show  the  usefulness  of  this  scheme  in  free-space  and  rever¬ 
berant  environments.  We  have  also  explored  splitting  the 
frequency  band  of  interest  into  a  number  of  sub- bands,  in 
particular,  we  have  used  the  uniformly  spaced  and  QMF 
sub-band  techniques  for  improved  array  processing.  Pre¬ 
liminary  results  indicate  advantage  in  using  the  sub-band 
MUSIC  technique  for  source  locations.  Currently,  the  above 
considered  algorithms  are  under  implementation  with  HEI 
for  hearing  aid  pre-processing.  Our  future  goals  are  to  im¬ 
prove  existing  techniques  an  to  find  new'  challenging  appli¬ 
cations. 
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ABSTRACT 

Convolutional  codes  with  a  distance  distribution  close 
to  that  obtained  in  the  average  by  random  coding  are 
necessarily  recursive.  We  introduce  pseudo-random  re¬ 
cursive  (PRR)  codes  with  a  weight  distribution  which 
fits  well  that  of  random  coding,  at  least  in  its  central 
part.  Decoding  these  codes  is  simple  only  if  their  free 
distance  is  low  and  thus,  the  error-event  high.  Howe¬ 
ver,  combining  such  codes  according  to  the  turbo-code 
scheme  may  be  used  to  decrease  the  bit  error  rate  to 
meet  the  user’s  specification. 

1.  INTRODUCTION 

We  recently  criticized  the  use  of  the  minimum  distance 
criterion  in  the  search  for  good  long  codes.  As  an 
expected  better  criterion,  we  proposed  a  similarity  mea¬ 
sure  of  the  distance  distribution  of  a  code  with  respect 
to  that  obtained  in  the  average  by  random  coding  [1]. 
For  the  sake  of  brevity,  we  shall  refer  to  it  as  “pseudo¬ 
random  criterion”.  Some  block  codes  satisfy  this  cri¬ 
terion,  especially  the  iterated  product  of  single-parity- 
check  codes  [1,  2]  and  the  Reed-Solomon  codes. 

The  main  objection  against  applying  the  minimum 
distance  criterion  to  long  codes  is  that  no  account  is 
taken  of  the  proportion  of  word  pairs  at  the  minimum 
distance.  If  this  proportion  is  very  small,  then  an  er¬ 
ror  in  favour  of  a  word  at  this  distance  of  the  trans¬ 
mitted  word  is  unlikely.  The  minimum  distance  cri¬ 
terion  appears  as  exceedingly  pessimistic  since  it  is 
based  on  the  worst  Case,  whose  occurrence  may  be 
very  infrequent.  The  examples  of  block  codes  mee¬ 
ting  the  pseudo-random  criterion,  especially  the  itera¬ 
ted  product  of  single-parity-check  codes,  actually  show 
that  the  proportion  of  codewords  of  minimum  weight  is 
very  tiny  [2].  Transposing  such  results  to  convolutional 
codes  is  however  not  straightforward. 

Concerning  the  relevance  of  the  pseudo-random  cri¬ 
terion,  it  relies  on  the  remark  that  random  coding  is 
asymptotically  the  best  known  means  since  it  was  used 
by  Shannon  to  prove  the  channel  capacity  theorem. 
Since  the  performance  of  a  code  only  depends  on  the 


distances  between  its  words,  a  deterministic  code  with 
a  distance  distribution  close  to  that  of  random  coding 
has  similar  performance,  but  its  decoding  does  not  de¬ 
mand  exhaustive  search,  at  variance  with  actual  ran¬ 
dom  coding  [3]. 

A  convolutional  encoder  generates  coded  sequences 
of  arbitrarily  large  length,  conveniently  dealt  with  as 
infinite.  The  conventional  notations  thus  consist  of  for¬ 
mal  series  in  the  unit-delay  operator  D  which  represent 
sequences  with  an  origin  at  a  finite  time,  but  infinitely 
extending  in  the  direction  of  positive  indices.  Howe¬ 
ver,  the  performance  and  properties  of  conventional 
convolutional  codes  do  not  match  those  which  may  be 
expected  from  the  asymptotic  case  where  the  codeword 
length  tends  to  infinity.  Their  performance  is  mainly 
determined  by  the  free  distance  and  is  not  very  diffe¬ 
rent  from  that  of  a  block  code  having  this  free  distance 
as  its  minimum  distance.  On  the  contrary,  the  iurbo- 
codes  exhibit  properties  and  performance  in  accordance 
with  those  which  may  be  expected  from  a  code  whose 
length  approaches  infinity,  especially  near- capacity  per¬ 
formance  [4].  We  tried  to  explain  these  features  as  re¬ 
sulting  from  the  recursiviiy  of  turbo-codes,  such  that 
coded  sequences  of  infinite  length  and  weight  are  as¬ 
sociated  with  finite-weight  information  sequences,  at 
variance  with  non-recursive  codes  [5].  We  shall  now 
briefly  review  these  ideas  before  examining  the  specific 
problems  of  decoding. 

2.  PSEUDO-RANDOM  RECURSIVE 
CONVOLUTIONAL  CODES 

The  linearity  of  a  convolutional  code  implies  that  the 
set  of  distances  of  a  codeword  with  respect  to  all  others 
does  not  depend  on  this  word.  One  may  thus  assume 
without  loss  of  generality  that  the  all-0  word  was  trans¬ 
mitted  and  study  only  the  distribution  of  the  word 
weights,  for  the  sake  of  simplicity.  We  restrict  ourselves 
to  binary  rate- 1/2  convolutional  codes  but  extensions 
to  other  alphabets  and  rates  are  straightforward. 

The  set  of  sequences  generated  by  a  convolutio¬ 
nal  encoder  is  conveniently  described  by  a  transition 
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graph  where  any  branch  represents  a  possible  transi¬ 
tion  between  states  of  the  encoder,  defined  as  its  pos¬ 
sible  memory  contents.  Developing  this  graph  in  terms 
of  the  (discretized)  time  results  in  the  trellis  diagram. 
Each  branch  in  this  graph  may  be  labelled  by  a  mono¬ 
mial  I^L  of  the  indeterminates  D^I  and  L,  where 
w  indicates  the  weight  of  the  corresponding  output,  u 
the  information  bit.  The  indeterminate  L  means  that 
including  this  branch  in  a  path  increases  its  length  by 
one.  One  usually  considers  the  zero-to-zero  transition 
graph,  which  describes  the  set  of  paths  stemming  from, 
and  eventually  merging  again,  with  state  0.  Then,  the 
“transfer  function”  defined  as  the  formal  series 

T(D,I,L)  =  535;  (1) 

i  j  k 

is  such  that  the  coefficient  Uijk  of  the  general  term  is 
the  number  of  paths  of  weight  wi^  corresponding  to 
an  information  sequence  of  weight  uj  and  of  length  /* 
in  the  zero-to-zero  transition  graph.  One  also  consi¬ 
ders  the  one-indeterminate  transfer  function  T{D)  = 
T{D,I  =  =  1).  Upper  bounds  on  the  probability 

that  an  erroneous  path  is  chosen  by  the  optimal  deco¬ 
der,  as  well  as  on  the  symbol  error  probability,  can  be 
derived  from  T{D)  and  dT{D^I^L  =  l)/5/,  respecti¬ 
vely,  by  replacing  D  by  a  real  number  depending  on 
the  channel  (the  inverse  of  the  signal-to-noise  ratio  in 
the  case  of  additive  white  Gaussian  noise)  [6]. 

Forney  has  shown  that  recursive  and  non-recursive 
codes  are  equivalent  in  the  sense  that  the  transfer  func¬ 
tion  T{D)  is  the  same  for  codes  of  both  types  having 
the  same  parameters  [7].  The  free  distance,  in  parti¬ 
cular,  is  the  same  for  both.  This  result  has  generally 
been  interpreted  as  meaning  that  recursive  codes  have 
no  particular  advantage  with  respect  to  non-recursive 
ones.  Forney  thus  writes:  . .  An  fact  feedback  buys  no- 
thing.  We  refute  this  conclusion,  not  because  it  relies 
on  an  incorrect  result,  but  because  at  variance  with 
non-recursive  codes  the  transfer  function  of  the  zero- 
to-zero  transition  graph  fails  to  properly  represent  the 
set  of  paths  associated  with  a  recursive  convolutional 
code.  Paradoxically,  the  very  equivalence  with  respect 
to  the  transfer  function  T{D)  is  the  base  of  our  ar¬ 
gument  showing  that  recursive  codes  outperform  non- 
recursive  ones. 

When  coded  sequences  are  represented  by  polyno¬ 
mials  or  formal  series  of  the  indeterminate  D,  the  ent¬ 
ries  of  the  generator  matrix  of  a  non-recursive  code  are 
polynomials,  whereas  those  of  a  recursive  code  contain 
rational  fractions.  Let  us  for  instance  consider  the  sys¬ 
tematic  recursive  encoder  of  rate  1/2  described  by  the 
generator  matrix: 

G  =  [l  N(£>)/Pp)],  (2) 

where  P(D)  is  a  primitive  polynomial  of  degree  K  and 
N(D)  is  a  polynomial  of  degree  at  most  equal  to  K 


(the  constraint  length  of  this  code  is  thus  K  A  1)-  The 
encoder  of  such  a  code  constitutes,  in  the  absence  of 
an  input  information  sequence  but  assuming  the  en¬ 
coder  initial  state  to  be  different  from  0,  a  generator 
of  “maximum  length  sequences”,  often  referred  to  as 
pseudo-random^  so  we  shall  refer  to  codes  generated  by 
this  encoder  as  pseudo-random  recursive  (PRR). 

Now  the  sequence  of  redundancy  symbols  transmit¬ 
ted  in  response  to  an  information  sequence  represen¬ 
ted  by  the  polynomial  u(D)  is  tA(D)N(D)/P(I)).  Its 
development  is  a  formal  series  when  u{D)TS{D)/F{D) 
is  a  rational  fraction,  thus  for  any  u{D)  non-multiple 
of  P(D),  and  its  weight  is  then  infinite  (truncated  in 
practice  because  one  necessarily  uses  a  coded  sequence 
of  finite  length,  but  which  we  may  assume  to  be  ar¬ 
bitrarily  large);  merging  with  the  0-path  never  occurs 
in  this  case.  If,  on  the  contrary,  u{D)  is  a  multiple 
of  P(D),  this  sequence  is  represented  by  a  polynomial 
and  the  corresponding  path  converges  to  the  0-path. 
This  case  occurs  only  for  a  fraction  2  ^  of  all  polyno¬ 
mials  u{D)  of  degree  at  least  equal  to  K.  For  a  non- 
recursive  code,  all  entries  of  the  generator  matrix  are 
polynomials:  convergence  to  the  0-state  always  occurs. 

For  a  recursive  code  with  K  large  enough,  mer¬ 
ging  with  the  0-path  does  concern  only  a  small  frac¬ 
tion  among  all  the  paths,  the  only  ones  however  which 
are  taken  into  account  by  the  transfer  function  T{D). 
It  is  easily  shown  that  the  lengths  of  these  paths  are 
enumerated  by  the  formal  series  development  of 

LK+1 

which  only  depends  on  the  size  K  of  the  encoder  re¬ 
gister,  not  on  a  particular  code.  The  coefficient  an  in 
this  formal  series  is  equal  to  1  for  n  =  iiT  -f- 1 ,  equal  to 
2n-K-2  for  RT  -f  1  <  n  <  2(K  H-  1),  and  smaller  than 
2n~K-2  ^  larger  value  of  n. 

From  the  development  of  (3),  a  rough  upper  bound 
on  the  number  a„  of  paths  of  length  n  whose  weight 
is  enumerated  by  T{D)  is  2”“^”^.  The  paths  which 
have  diverged  from  the  0-path  at  the  initial  instant 
and  reach  the  node  level  -f  /  without  having  visited 
the  0  state  have  a  negligible  probability  of  error  since 
their  weight  increases  with  /.  The  source  of  possible 
errors  then  reduces  to  the  set  of  paths  which  merge 
again  with  the  0-path  after  they  have  diverged  from  it. 
The  number  of  such  paths  is  less  than  2^“^,  among  a 
total  number  of  2^'*"^  —  1  paths  different  from  the  0- 
path.  The  estimates  of  the  error  probability  based  on 
the  transfer  function  T{D)  ignore  the  other  sequences. 
In  order  to  actually  take  into  account  the  set  of  all 
encoded  sequences,  they  should  be  multiplied  by  the 
fraction  of  all  sequences  whose  weights  are  enumerated 
by  T(D),  a  number  less  than  2^”^/(2^'''^  —  1).  This 
bound  is  close  to  so  the  multiplying  factor  is 

very  small  if  K  is  large. 
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3.  DECODING  PRR  CODES 

We  now  look  at  the  problems  of  decoding  PRR  codes. 
It  is  well  known,  and  by  no  means  questioned  here, 
that  the  infinite  set  of  sequences  generated  by  a  non¬ 
recursive  encoder  and  by  the  corresponding  recursive 
one  is  the  same  [7].  For  instance,  the  non-recursive 
generator  matrix  corresponding  with  (2)  is 

G=:[P(D)  N(D)],  (4) 

which  differs  only  from  (2)  by  the  scalar  factor  P(D). 
Clearly,  it  generates  only  finite-weight  sequences  in  res¬ 
ponse  to  finite- weight  information  sequences.  But  a 
generator  matrix  just  specifies  the  one-to-one  mapping 
which  associates  the  information  sequences,  which  are 
necessarily  in  finite  number,  with  a  subset  of  all  se¬ 
quences  which  may  be  generated  by  the  encoder.  The 
difference  between  the  non-recursive  and  the  recursive 
encoders  lies  in  the  fact  that  this  subset  contains  ex¬ 
clusively  finite- weight  sequences  for  the  former,  while 
it  also  contains  infinite-weight  sequences  for  the  lat¬ 
ter.  Using  PRR  codes  with  a  large  register  size  K  and 
a  primitive  polynomial  as  denominator  of  the  rational 
fraction  in  (2)  results  in  the  fraction  of  finite-weight 
sequences  in  the  subset  being  arbitrarily  small. 

The  sequences  generated  by  a  non-recursive  and  by 
a  recursive  encoder  belong  to  the  same  set  and  dif¬ 
fer  only  as  regards  the  probabilities  associated  to  them 
for  a  given  distribution  of  the  input  information  se¬ 
quences.  A  decoder  searches  in  the  infinite  set  of  pos¬ 
sibly  generated  sequences.  It  will  find  sequences  which 
would  often  be  generated  by  a  non-recursive  encoder, 
but  very  unlikely  by  the  recursive  equivalent  one.  One 
would  wish  to  eliminate  such  “spurious  sequences” ,  but 
they  belong  to  the  infinite  set  of  possible  sequences, 
and  two  sequences  close  to  each  other  may  belong  to 
this  set  if  the  minimum  distance  is  small.  From  this 
point  of  view,  the  pseudo-random  distance  distribution 
property  appears  as  misleading,  since  it  refers  to  the 
siaiisiical  properties  of  a  set  of  sequences  although  only 
a  single  joint  realization  of  an  encoded  sequence  and  of 
the  channel  noise  is  available  to  a  decoder. 

The  properties  of  PRR  codes  we  described  can  ne¬ 
vertheless  be  exploited  if  the  set  of  possible  information 
sequences  {«(D)}  is  restricted  to  some  distinctive  sub¬ 
set.  One  is  thus  led  to  a  concatenation  scheme  like 
the  combination  of  PRR  codes  according  to  the  turbo¬ 
code  principle,  as  proposed  in  [5].  The  pseudo-random 
distance  distribution  of  the  coded  sequences  is  by  this 
means  made  usable  by  a  decoder.  Before  starting  to 
describe  it,  we  make  a  few  remarks  on  decoding  PRR 
codes  by  a  conventional  algorithm. 

An  obvious  difficulty  comes  from  the  necessarily 
large  size  of  the  encoder  memory  /<',  which  practi¬ 
cally  prohibits  using  the  Viterbi  algorithm.  We  rea¬ 
lized  that  an  iterated  variant  of  replication  decoding 


could  be  used  [8].  As  for  the  Viterbi  algorithm,  deco¬ 
ding  is  described  by  a  trellis,  but  the  number  of  states 
is  independent  of  K.  In  the  simplest  case,  it  equals 
2^",  where  r  is  the  weight  of  the  polynomial  P(D).  For 
certain  values  of  K,  there  exists  a  primitive  polynomial 
of  minimum  weight  i.e.,  r  =  3,  so  the  number  of  states 
is  only  8. 

Such  a  small  value  of  r  results  in  a  poor  error-event 
probability  since  the  distance  between  two  converging 
paths  in  the  trellis  is  small.  This  distance  equals  the 
sum  of  the  weights  of  P(D)  and  N(Z)).  We  take  these 
two  polynomials  of  same  weight  in  order  to  keep  at 
decoding  a  complete  symmetry  between  the  informa¬ 
tion  and  redundancy  bits,  so  the  free  distance  is  2r.  In 
case  of  error,  the  decoded  sequence  is  the  closer  to  the 
transmitted  one,  the  smaller  the  weight  of  P(D)  since 
then  the  bit  error  rate  in  an  erroneously  decoded  se¬ 
quence  is  small.  Using  a  larger  value  of  r  would  both 
diminish  the  error-event  probability  and  increase  the 
bit  error  rate  in  case  of  error,  but  with  an  exponential 
increase  of  complexity. 

Rather  than  to  increase  the  weight  r,  we  prefer  to 
consider  concatenated  schemes.  The  decoded  sequence 
is  erroneous  with  a  large  probability  for  a  single  code 
but  it  is  not  far  from  the  correct  one,  which  is  favou¬ 
rable  in  a  concatenated  system  as  will  now  be  discussed. 

4.  PRR  TURBO-CODES 

Let  us  now  combine  two  PRR  encoders,  of  same  rate 
R  and  memory  K,  with  interleaving  so  as  to  make  a 
turbo-code  of  rate  =  R/{2  -  R).  One  may  curso¬ 
rily  analyse  this  system  assuming  that  the  two  infor¬ 
mation  sequences  entering  the  encoders  are  random, 
equiprobable  and  independent.  This  last  assumption 
is  obviously  false  since  interleaving  makes  a  one-to-one 
correspondence  between  both  sequences,  but  it  is  ad¬ 
missible  inasmuch  as  it  actually  plays  its  expected  role 
of  “decorrelation” .  Then,  the  two  encoders  jointly  ge¬ 
nerate  a  finite-weight  coded  sequence  only  if  their  in¬ 
put  sequences  are  represented  both  by  a  polynomial 
multiple  of  the  corresponding  denominator  in  (2).  As¬ 
suming  independency,  the  proportion  of  such  cases  is 
raised  to  the  square  and  thus  becomes  2“^^. 

We  find  here  the  advantage  of  turbo-codes  namely, 
that  combining  two  short  codes  conciliates  a  good  per¬ 
formance  with  the  overall  simplicity  of  decoding.  Se¬ 
parate  decoding  of  each  component  code  is  easy,  and 
weighting  the  decoded  symbols  avoids  losing  informa¬ 
tion  when  the  output  of  a  decoder  is  used  as  the  input 
of  the  other  one.  The  performance  is  thus  not  far  from 
that  which  would  result  from  a  single  PRR  code  of 
memory  twice  the  one  which  determines  the  decoding 
complexity,  but  where  the  specific  properties  of  PRR 
codes  are  now  taken  into  account.  Indeed,  interleaving 
results  in  avoiding  with  a  large  probability  that  the  re- 
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Figure  1:  Encoder  of  a  pseudo- random  turbo-code  of  rate 
1/2.  The  boxes  labelled  “Cod.  0,  Cod.  1,  Cod,  AT” 
generate  the  redundancy  bits  corresponding  to  the  input 
sequence  according  to  a  pseudo-random  code  of  rate  1/2. 
The  boxes  labelled  “Entrel.  1,  ...,  Entrel.  N”  represent 
interleavers,  the  delay  of  which  is  compensated  for  in  the 
box  labelled  “Delay”.  Periodic  puncturing  keeps  the  rate. 

sidual  errors  of  a  decoder,  which  occur  according  to 
patterns  made  of  a  small-weight  multiple  of  the  word 
of  minimum  weight,  are  a  defavourable  pattern  also  at 
the  input  of  the  other  encoder  [9] . 

We  may  interpret  interleaving  as  a  means  for  ge¬ 
nerating  redundancy  sequences  which  have  as  weight 
distribution  the  convolution  of  those  of  the  component 
codes.  The  weight  distribution  of  the  redundancy  se¬ 
quence  of  a  PRR  code  is  close  to  that  of  random  coding, 
except  in  its  low-weight  tail.  The  convolution  results  in 
little  change  in  the  central  part  of  the  distribution,  but 
improves  its  tails  since  it  gets  closer  to  the  Gaussian 
one.  The  rate  decrease  is  compensated  for  by  periodic 
puncturing.  More  than  two  pseudo-random  encoders 
can  be  combined  in  this  way,  as  depicted  in  Fig.  1,  so 
the  performance  in  terms  of  bit  error  rate  can  be  adjus¬ 
ted  to  the  user’s  specification  by  increasing  TV,  as  the 
central  limit  theorem  makes  its  low-weight  tail  closer 
and  closer  to  that  of  random  coding. 

5.  CONCLUSION 

Generating  long  encoded  sequences  is  a  necessary  condi¬ 
tion  for  near-capacity  performance  which  is  met  by 
convolutional  codes.  Their  historical  development  gave 
the  main  role  to  non-recursive  ones,  and  we  believe 
this  was  wrong.  As  could  be  foreseen  from  the  perfor¬ 
mance  of  turbo-codes,  the  recursive  codes,  especially 
the  PRR  ones,  actually  outperform  the  non-recursive 
ones  since  they  can  generate  long  sequences,  the  weight 
distribution  of  which  is  very  close  to  that  obtained 
in  the  average  by  random  coding.  Near-capacity  per¬ 
formance  thus  becomes  possible.  However,  the  deco¬ 


ders  do  not  deal  differently  with  recursively  and  non- 
recursively  encoded  sequences  (which  differ  only  as  re¬ 
gards  their  probability  distribution).  The  information 
sequences  should  thus  actually  belong  to  some  distinc¬ 
tive  subset,  which  enables  separating  them  from  the 
added  spurious  sequences  which  result  from  decoding. 
One  is  thus  led  to  concatenation,  especially  involving 
two  or  more  PRR  codes  combined  with  interleaving  ac¬ 
cording  to  the  turbo-code  scheme. 
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ABSTRACT 

In  this  paper,  an  automatic  procedure  is  developed 
to  construct  the  optimal  soft-decision  quantizer  and 
branch  metric,  based  on  the  cutoff  rate  criterion,  for 
the  Viterbi  decoder  over  any  memoryless  channel.  As 
only  samples  of  the  received  symbols  are  required,  this 
procedure  finds  applications  in  many  practical  chan¬ 
nels. 

1.  INTRODUCTION 

The  selection  of  the  soft-decision  quantizers  and  branch 
metrics  are  critical  issues  in  the  design  of  high-speed 
soft-decision  Viterbi  decoders.  For  an  additive  white 
gaussian  noise  (AWGN)  channel  using  QAM  or  MPSK 
signalling,  the  optimal  branch  metric  is  the  Euclidean 
metric  which  is  usually  approximated  using  integer  val¬ 
ues  in  practical  high-speed  implementations.  For  soft- 
decision  quantization,  the  traditional  method  is  to  quan¬ 
tize  the  in-phase  and  quadrature-phase  components  sep¬ 
arately  and  uniformly  (uniform  scalar  quantization)  [5]. 
More  recently,  Parson  [3]  and  Carden  [1]  suggested  that 
a  polar  (phase)  quantization  scheme  could  be  used  for 
MPSK  signals.  Further,  following  the  practice  in  uni¬ 
form  scalar  quantization,  Onyszchuk  [2]  pointed  out 
that  the  step  size  of  a  uniform  quantizer  should  be  ad¬ 
justed  according  to  the  noise  power.  Pafson,  Carden 
and  Onyszchuk  all  based  their  work  on  the  cutoff  rate 
design  criterion  put  forward  by  Wozencraft  [10]  and 
Massey  [7]. 

All  of  the  quantization  schemes  and  the  Euclidean 
metric  above  are  derived  for  an  AWGN  channel.  There 
is  no  guarantee  of  their  suitability  for  other  kinds  of 
channels.  Theoretically,  the  optimal  branch  metric  can 

This  work  was  funded  by  the  CSIRO,  Division  of  Radio¬ 
physics,  Australia.  The  authors  thank  T.  M.  Percival  of  CSIRO 
for  useful  discussion  throughout  the  investigation. 


be  determined  by  calculating  the  conditional  likelihood 
functions  of  the  received  symbols.  It  is  also  possible 
to  design  soft-decision  quantization  schemes,  similar  to 
those  mentioned  above,  based  on  the  cutoff  rate  cri¬ 
terion  for  memoryless  channels  (as  the  cutoff  rate  cri¬ 
terion  is  derived  under  the  assumption  of  a  discrete 
memoryless  channel,  DMC).  However  both  of  these  re¬ 
quire  evaluation  of  the  conditional  density  functions  of 
the  received  symbols  which,  apart  from  a  few  cases, 
are  extremely  hard  to  obtain  analytically.  One  point 
worthy  of  mention  about  the  quantization  schemes  de¬ 
scribed  above  is  that  all  impose  fixed  configurations  on 
the  shapes  of  the  quantization  regions,  namely,  rect¬ 
angles  and  sectors.  However,  this  is  not  an  inherent 
restriction  of  the  cutoff  rate  criterion,  and  the  success 
of  vector  quantization  techniques  in  speech  and  image 
compression,  suggests  that  better  performance  might 
be  achieved  by  removing  the  restriction  on  the  config¬ 
uration  of  the  quantization  regions.  In  this  context, 
better  performance  means  that  fewer  bits  are  needed 
in  the  branch  metric  and  path  metric  calculators  (Add- 
Compare-Select  processors)  in  the  Viterbi  decoder  for 
an  acceptable  error  rate. 

In  this  paper,  we  develop  a  practical  procedure  to 
automatically  determine  the  optimal  soft-decision  quan¬ 
tization  scheme  and  branch  metric  for  any  memoryless 
channel.  Because  the  conditional  density  functions  of 
the  received  symbols  in  general  are  difficult  to  obtain 
analytically,  it  will  be  assumed  that  only  samples  of 
the  received  symbols  are  given.  These  samples  can  be 
obtained  easily  by  experimental  measurements  or  com¬ 
puter  simulations.  We  begin  by  outlining  the  cutoff 
rate  design  criterion  in  Section  2.  In  Section  3,  an  au¬ 
tomatic  procedure  to  determine  the  soft-decision  quan¬ 
tization  regions  and  the  branch  metrics  is  described.  In 
Section  4,  the  procedure  described  in  Section  3  is  ap¬ 
plied  to  find  the  soft-decision  quantizers  and  branch 
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metrics  for  both  a  standard  AWGN  channel  and  an 
asymmetric  AWGN  channel.  Section  4  also  presents  a 
comparison  of  the  error-rate  performance  of  the  opti¬ 
mal  vector  quantization  schemes  and  the  uniform  scalar 
quantization  schemes,  obtained  by  computer  simula¬ 
tion. 

2.  THE  CUTOFF  RATE  CRITERION 

A  continuous  memoryless  channel  employing  coded  M- 
ary  QAM  or  PSK  modulation  and  soft-decision  decod¬ 
ing  with  J  quantization  regions  can  be  viewed  as  a 
M-input  J-output  DMC.  This  DMC  is  completely  de¬ 
scribed  by  the  transition  probabilities  P(j|m),  where 
j  =  0, 1, . . .,  J  —  1  and  m  =  0, 1, . .  .,M  —  1.  With 
equally  probable  input  symbols,  the  cutoff  rate,  Rq^  of 
this  DMC  is  given  by  [6]  [10], 


Ro  ==  logs  ^  -  log2  S  TF  Z) 


Viterbi  [9]  showed  that  when  convolutional  coding  is 
used  with  maximum  likelihood  decoding  (Viterbi  de¬ 
coder),  the  error  probability  in  bounded  by, 

Pe<CRL2-'^\  ifR<Ro  (2) 

where  1/  is  the  constraint  length  of  the  code,  R  is  the 
code  rate,  L  is  the  number  of  bits  encoded  and  Cr 
is  a  constant  independent  of  L  and  ly.  Note  that  this 
bound  is  generally  assumed  to  apply  to  trellis-coded- 
modulation  based  on  convolutional  codes. 

Equation  (2)  implies  that  the  error  probability  is 
minimized  by  maximizing  Ro-  Given  a  memoryless 
channel,  from  (1),  Ro  depends  on  the  arrangement  of 
the  quantization  regions  of  the  soft-decision  quantizer. 
Lee  [6]  observed  that  maximizing  Ro  is  equivalent  to 
minimizing  5,  where 

J-l  TM-l  _ 1  ^ 

j=0  Lm  =  0 

Lee  worked  out  a  necessary  condition  for  the  minimiza¬ 
tion  of  S  which  states  that  for  every  point  r  =  p  on 
the  boundary  of  any  two  adjacent  optimal  quantiza¬ 
tion  regions,  say  distinct  a  and  b  where  0  <  a,  6  <  J, 
if  r  is  a  point  of  continuity  of  conditional  density  func¬ 
tions  /(r|m)  for  0  <  m  <  M  and  P{a\m)  ^  0  and 
P(b\m)  7^  0  for  0  <  m  <  M,  then 

M~i  r  M-i 

E  /pm 

.  M-1  _ 

V-P(“l"»)  7^0  J 


Lee  went  on  to  transform  this  necessary  condition  to 
constraints  in  a  likelihood  space  and  developed  an  iter¬ 
ative  procedure  to  obtain  the  optimal  quantization  re¬ 
gions  in  that  space.  The  major  difficulty  in  implement¬ 
ing  Lee’s  algorithm  is  that  it  is  necessary  to  keep  track 
of  the  adjacency  of  the  quantization  regions  during  the 
iterative  process  and  convert  the  regions  in  likelihood 
space  back  to  the  signal  space  after  an  optimal  parti¬ 
tioning  of  the  likelihood  space  has  been  found.  None 
of  this  is  easy  to  implement  automatically.  Moreover 
convergence  of  the  iterative  process  is  not  guaranteed. 
In  the  next  section,  we  modify  Lee’s  condition  to  allow 
us  to  perform  the  optimization  process  more  easily. 

3.  THE  AUTOMATIC  PROCEDURE 

Let  us  first  assume  that  we  know  the  conditional  den¬ 
sity  functions  /(r|m),  for  0  <  m  <  M.  This  is  deter¬ 
mined  by  the  channel.  For  0  <  j  <  J,  define 


=  E 


7=fF^  E 


Then  it  can  be  shown  that  S  will  decrease  if  for  every 
r,  we  assign  r  to  region  a  such  that 

Da{r)<Dj{r)  , for  all  i  7^  a  (6) 

Hence  by  (6),  a  simple  iterative  procedure  can  be  con¬ 
structed  to  search  for  a  minimum  of  S.  Note  that  con¬ 
vergence  is  guaranteed  as  S  is  lower-bounded  (see  (3), 
every  P{j\m)  >  0). 

Following  this  iterative  process,  we  obtain  a  map  of 
the  optimal  quantization  regions.  The  optimal  branch 
metrics  (the  likelihoods),  5(-|-),  for  the  regions  can  be 
calculated  easily  by 

B{j\m)  =  -  ln(P(i|m))  (7) 

for  0  <  m  <  M  and  0  <  j  <  J.  For  fixed-point  imple¬ 
mentation  of  the  Viterbi  decoder,  if  we  employ  the  2’s 
complement  roll-over  method  in  the  path  metric  calcu¬ 
lator,  no  rescaling  of  the  path  metric  is  needed.  How¬ 
ever  the  maximum  value  in  the  branch  metric,  Bmax, 
must  be  small  enough  such  that 


where  v  is  the  constraint  length  of  the  code  and  is 
the  number  of  bits  in  the  ACS  calculator.  Then  to  suit 
this  implementation,  the  branch  metrics  in  (7)  can  be 
linearly  scaled  and  round  off  to  B\j\m)  as  in  (9). 


B\j\m)  = 


B{j\m)  -  mm{B{  \-)} 
max{R(-|-)}  -  min{R(*|-)} 
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where  [-J  denotes  rounding  to  the  nearest  integer. 

As  stated  in  Section  1,  we  assume  that  only  sam¬ 
ples  of  the  received  symbols  are  given.  Under  this  as¬ 
sumption,  we  replace  the  conditional  density  functions 
required  in  the  iterative  process  above  by  estimates  ob¬ 
tained  via  the  samples  of  the  received  symbols.  While 
any  reasonably  good  density  estimation  method  [see  4] 
can  be  used,  we  adopt  the  ‘pre-whiten’  gaussian  kernel 
method  [4,  pp.  77]  in  this  paper.  As  in  other  applica¬ 
tions  of  density  estimation,  the  amount  of  smoothing 
introduced  is  the  most  important  factor  to  be  chosen. 
In  this  case,  the  amount  of  smoothing  affects  the  reten¬ 
tion  of  detail  in  the  density  functions,  and  hence  in  the 
shapes  of  the  quantization  regions.  It  is  likely  also  to 
affect  the  rate  of  convergence  of  the  iterative  process. 

Summarizing  the  discussion  in  this  section,  the  steps 
of  the  automatic  procedure  generating  the  soft- decision 
quantization  regions  and  the  branch  metric  are  as  fol¬ 
lows: 

1.  Obtain  estimates  of  the  conditional  density  func¬ 
tions  from  samples  of  the  received  symbols. 

2.  Choose  an  initial  map  of  the  quantization  regions. 

3.  Assign  each  point,  r,  in  the  map  according  to  (6). 

4.  Calculate  S  by  (3). 

5.  If  the  change  in  S  is  very  small,  obtain  the  map 
of  the  optimal  quantization  regions  and  go  to  the 
next  step.  Otherwise  repeat  Step  3. 

6.  Calculate  the  branch  metric  using  (7). 

7.  Rescale  the  branch  metric  to  integers  by  (9). 

4.  COMPUTER  SIMULATIONS 

In  this  section,  we  use  the  procedure  in  Section  3  to 
determine  the  optimal  soft-decision  quantizers  and  the 
branch  metrics  for  two  hypothetical  channels,  namely, 
the  AWGN  channel  and  the  asymmetric  AWGN  chan¬ 
nel.  The  asymmetric  AWGN  channel  is  similar  to  an 
AWGN  channel,  except  that  noise  variance  in  the  in- 
phase  component  is  nine  times  of  that  of  the  quadrature- 
phase  component.  This  channel  serves  as  a  simple  ex¬ 
ample  of  one  which  has  significantly  different  charac¬ 
teristics  to  an  AWGN  channel.  The  optimal  branch 
metric  of  the  asymmetric  AWGN  channel  is  not  the 
Euclidean  metric  and,  in  fact,  it  is 

+  {y-s^y  (10) 

where  r  =  (x^y)  and  =  (5^,5^)  are,  respectively, 
the  received  symbol  and  the  m-th  transmitted  symbol. 


for  0  <  m  <  M.  We  assume  QPSK  signals  with  TCM 
is  employed  as  shown  in  Figure  1  and  fix  the  number 
of  quantization  regions  to  32.  Moreover,  the  number 
of  bits,  nj,  in  the  ACS  calculator  is  set  to  7  and  this 
requires  Rmaar  to  be  set  to  12.  Ten  thousand  samples 
of  the  received  symbols  of  the  two  channels,  operating 
at  5  dB  signal-to-noise  ratio  (SNR)  per  symbol,  are 
fed  to  the  automatic  procedure  to  find  the  soft-decision 
quantization  regions  and  the  branch  metrics.  The  maps 
of  the  quantization  regions  for  the  two  channels  are 
given  in  Figure  2. 

Monte- Carlo  simulations  were  performed  to  esti¬ 
mate  the  bit  error  rate  performance  of  the  TCM  system 
in  Figure  1  using  these  soft-decision  quantizers  and  the 
branch  metrics  for  the  two  channels,  respectively.  The 
Viterbi  decoder  is  assumed  to  have  a  decision  depth  of 
32.  The  results  are  shown  in  Figure  3(a)  and  (b)  for 
the  AWGN  and  the  asymmetric  AWGN  channel  respec¬ 
tively.  Figure  3  also  shows  results  for  a  system  with  uni¬ 
form  scalar  quantizer  (3  bits  for  each  component)  and 
Euclidean  metric,  and  for  a  system  with  infinite  preci¬ 
sion  and  the  optimal  branch  metric  (Euclidean  metric 
for  the  AWGN  channel  and  the  metric  from  Equation 
(10)  for  the  asymmetric  AWGN  channel).  From  Fig¬ 
ure  3(a),  the  performance  of  the  vector  quantizer  is  ba¬ 
sically  the  same  as  that  of  the  uniform  scalar  quantizer, 
which  is  about  0.4dB  poorer  than  the  unquantized  sys¬ 
tem  at  a  bit  error  rate  of  10”^.  This  agrees  with  re¬ 
sults  reported  in  [5].  So  we  save  1  bit  by  using  the 
vector  quantizer.  For  the  asymmetric  AWGN  channel, 
the  performance  of  the  vector  quantizer  is  much  better 
(about  1.5  dB)  than  that  of  the  uniform  quantizer.  In 
this  case,  the  main  reason  for  the  better  performance  is 
that  the  branch  metric  of  the  vector  quantizer  is  much 
closer  to  the  optimal  one  (Equation  (10))  than  the  Eu¬ 
clidean  metric  used  in  the  uniform  quantizer.  This  can 
be  seen  from  Figure  2(b)  which  shows  a  map  of  the 
vector  quantization  regions  similar  to  that  of  a  uni¬ 
form  scalar  quantizer  (i.e.  approximate  partitioning 
into  rectangular  regions). 

5.  CONCLUSION 

In  this  paper,  we  have  developed  for  the  Viterbi  de¬ 
coder  a  practical  procedure  based  on  the  cutoff  rate 
design  criterion  that  determines,  automatically,  the  op¬ 
timal  soft-decision  vector  quantization  regions  and  the 
branch  metric  over  any  memory  less  channel.  Only  sam¬ 
ples  of  the  received  symbols  are  needed  and  no  other 
prior  knowledge  of  the  channel  is  assumed.  Computer 
simulations  indicate  that  the  vector  quantizers  perform 
better  than  their  uniform  scalar  counterparts  for  both 
an  AWGN  channel  and  an  asymmetric  AWGN  channel. 
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Figure  1:  TCM  system  with  soft- decision  quantizer 
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STATE  SPACE  APPROACH  IN  TIME- VARYING  BIORTHOGONAL 

FILTER  BANKS 

Aweke  N.  Lemma  and  Ed  F.  Deprettere 


Abstract —  Using  state  space  representations  of  biorthog- 
onal  time  varying  filter  banks,  it  is  possible  to  derive 
conditions  under  which  the  transition  between  two  time- 
invariant  filter  banks  can  be  so  controlled  that  both  the 
decomposition  filters  and  the  reconstruction  functions  gen¬ 
tly  embrace.  For  a  certain  domain  of  filters,  the  transition 
interval  can  be  made  arbitrarily  short.  If  it  is  zero,  then 
the  special  case  of  instantaneous  transition  is  reached. 


1.  Introduction 

A  biorthogonal  filter  bank  (£*,  R)  is  a  linear  expan¬ 
sion  of  a  sequence  or  signal.  Thus  if  u  G  then 

^  ~  where  e„j  and  r-n  are  the  rows  and 

columns  of  bounded  matrix  operators  E  and  respec¬ 
tively,  with  RE  =  /,  the  identity  operator.  We  will  as¬ 
sume  throughout  that  Cn;  and  r;n  are  of  finite  lengths, 
and  that  E  is  lower  triangular  (causal)  and  R  upper  tri¬ 
angular  (anti- causal).  The  entries  of  these  operators  will 
be  finite  dimensional  matrices  of  dimensions  Q  x  P  and 
P  X  Qj  respectively,  with  the  dimensions  possibly  time 
varying  and  Q  >  P.  If  Q  =  P,  then  also  RE  =  I.  If  the 
filter  bank  is  shift-invariant,  then  E  and  R  are  Toeplitz 
and  characterized  by  the  central  row  e  and  central  column 
r,  respectively.  In  this  case  we  write  (^(e), /?.(r))  instead 
of  (P,  R). 


(1) 


R{r)  = 


r 


r 


(2) 


In  the  context  of  filter  banks,  the  rows  of  E  are  called  fil¬ 
ter  weight  vectors  and  the  columns  of  R  are  called  impulse 
response  vectors.  Moreover,  they  have  the  property  that, 
with  F(a:)  the  Fourier  Transform  of  x,  P(e)  and  F{r)  are 
uniform  spectral  decompositions  of  the  base  band  [0,  tt]. 

In  most  applications  a  single  biorthogonal  filter  bank 
(P(e),  R{r))  is  used  to  decompose  and  reconstruct  certain 


signals  «.  However,  as  most  signals  are  not  stationary, 
it  may  be  expected  that  time  varying  filter  banks  will  do 
a  better  job,  especially  when  a  quantization  operator  is 
inserted  in  between  the  two  bank  operators.  In  the  lat¬ 
ter  case,  for  example,  it  is  then  also  necessary  to  guaran¬ 
tee  that  the  spectral  decomposition  properties  of  the  time 
varying  filter  weight  vectors  are  preserved  and  also  that 
the  time  varying  impulse  response  vectors  remain  good 
interpolation  functions. 

In  this  paper,  we  find  perfect  reconstruction  filter  banks 
giving  signal  expansions  of  the  form 

Ho-l  -Hop 

“  =  ^  {w,e„;)r;„+  {u,e2„)r2.„ 

n=-oo  n=no  n=n/ 


where  nj  >  n^,  and  the  el^;  and  e2„;  are  all  shifted  ver¬ 
sions  of  el  and  e2,  respectively  (and  similarly  for  the  rl;„ 
and  r2;n).  In  other  words,  the  behavior  on  the  segments 
[— oo,n(,  —  1]  and  [n/,+oo]  is  shift  invariant,  whereas  the 
segment  [no,n/  —  1]  is  the  transition  region  on  which 
and  r;n  have  doubly  indexed  entries.  Typically,  the  objec¬ 
tive  is  to  have  small  transition  segment  supporting  smooth 
transitions  between  the  stationary  segments  in  both  the 
E  and  R  parts.  Except  when  stated  explicitly  otherwise, 
we  will  have  Q  =  P  throughout. 


II.  State  Space  Representation 

Instead  of  using  input-output  maps  E  and  i?,  the  filter 
bank  can  also  be  represented  by  state  space  realizations. 
For  the  decomposition  part  E",  the  state  space  realization 
at  time  instant  n  maps  present  input  u{n)  G  (S)  and 
present  state^  x(n)  G  (TL)  to  present  output  y{n)  G 
/^(Z)  and  next  state  x{n  -(-  1).  Thus,  denoting  by  m(n) 
this  map,  we  have 


m(n) 


a;(n) 

«(n) 


x{n  +  1) 

yin) 


where  the  (iV  -f  P)  x  (AT  P)  matrix  m(n)  is  explicitly 
written  as 


m(n) 


a(n)  6(n) 
c(n)  d{n) 


(3) 


This  is  shown  in  Fig.l.  Let  x  represent  the  state  sequence 
{x(n)},  n  G  [—00,00].  Likewise  let  y  =  {t/(Ti)}  and  u  = 
{u(n)}.  Then  we  can  write 


The  authors  are  with  the  Department  of  Electrical  Engineering, 
Delft  University  of  Technology,  Delft,  The  Netherlands.  This  re¬ 
search  was  partly  supported  by  Philips  Research,  The  Netherlands. 

^That  is,  tx  is  a  finite  energy  sequence  or  column  vector  with  P- 
dimensional  column  vector  entries  u(n),  n  G 


Zx 

y 


^Like  P  and  Q,  N  (the  dimension  of  a:(n))  can  be  time  varying. 
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Fig.  1.  The  state  space  realization  of  the  decomposition  part  of  the 
Q-channel  filter  bank. 


A  =  diag(. .  .a(n  —  1)  a(n)  a(n -h  1) . .  .)f 
B  =  diag{. . .  6(n  —  1)  b(n)  b{n  +  1) . . .), 

C  =  diag{. . .  c(n  -  1)  c(n)  c(n  +  1) . . .)  and 
D  =  diag{. . .  d{n  —  1)  d{n)  d{n  -h  1)  . . .), 

and  Z  is  a  shift  operator 


is  then  M.  In  the  state  realization  domain,  the  ‘‘perfect 
reconstruction”  property  RE  =  I  transfers  to  MM  =  /, 
or  equivalently  m(n)m(n)  =:  1^,  for  all  n. 

The  decomposition  map  E  and  its  corresponding  state 
space  realization  map  M  are  related  as  follows.  Let 

C{n)  =  [  , .  .a(n  —  l)a(n  ~  2)6(n  —  3) 

a{n  -  l)6(n  -  2)  6(n  -  1)  ]  (6) 

■  c(n) 

c{n  +  l)a(n) 

0(n) c(n  +  2)a(n  +  l)a(n) 

then  W(n)  =  0{n)C{n)  is  the  so-called  Hankel  matrix®  of 
rank  N  at  time  instant  n.  It  relates  past  inputs  and  future 
outputs  as  follows. 


From  these  items  we  obtain  the  following  relations. 
•  From  M  E 


^n;  =  [  c{n)C{n)  d{n)  ]  (9) 

■"  “  [  0(n  +  l)b(n)  J  (10) 


For  the  reconstruction  part  R  we  have,  mutatis  mutan¬ 
dis,  similar  relations^.  In  order  to  distinguish  R  from 
E,  we  add  an  overbar  to  the  symbols  when  related  to 
R  :  m(n),  a(n),  6(n),  c(n),  d(n).  This  is  shown  in  Fig. 2. 
The  state  space  realization  map  on  the  reconstruction  side 


Fig,  2,  The  state  space  realization  of  the  reconstruction  part  of  the 
Q-channel  filter  bank. 

^Recall  that  R  is  an  anti-causal  map,  so  ‘next’  becomes  ‘previous’ 
and  Z  is  to  be  replaced  by  Z'^. 


•  From  E  M 

a{n)  =  Ol(n+l)Ol(n)  =  C(n+l)-*Cl(n) 
h{n)  =  right-most  N  x  P  entry  of  C{n  +  1) 
c(n)  =  top  P  X  N  entry  of  0(n) 
d{n)  =  the  common  entry  of  Cn;  and  e;„ 

where  the  superscript  (1)  denotes  pseudo  inverse,  and 
C-  =  CS^,  =  S^O,  with 

'01  1 
0  0  1  1 
•S'?  =  0  0  0  1  ;  10 

1  0  0 

J  [  0  0  0 

Similar  relations  hold  between  R  and  M .  In  this  paper  we 
address  three  classes  of  time  varying  filters;  a)  filters  which 

^  where  1  stands  for  the  identity  matrix  of  appropriate  order. 
^C(n)  and  0(n)  are  called  the  controllability  and  observability 
operators  at  time  instant  n,  respectively. 
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come  to  existence  at  a  certain  time  and  extend  to  oper¬ 
ate  indefinitely  (bounded  from  the  top),  b)  filters  which 
have  been  working  since  infinite  time  in  the  past  and  dis¬ 
appear  at  a  certain  time  (bounded  from  the  bottom),  and 
c)  filters  of  the  second  class  being  taken  over  by  that  of 
the  first  class  (filters  with  transitions).  In  [1]  the  case  of 
two-channel  orthogonal  filter  banks  is  considered.  There, 
orthogonal  projectors  are  used  to  complete  the  basis  for 
the  right  side  sequence  of  the  bounded  filter  bank.  In  con¬ 
trast,  our  method  uses  the  state  space  approach  to  find  the 
boundary  filters.  The  boundary  filters  so  determined  may 
happen  to  have  smooth  transitional  behavior,  at  both  the 
decomposition  and  reconstruction  sides.  However,  in  case 
the  filters  so  obtained  are  not  satisfactory,  optimization 
procedures  may  be  employed  to  suit  the  application.  We 
address  this  in  section  III-C.2.  In  the  third  class,  there  is 
a  special  case  where  the  takeover  is  instantaneous,  that  is, 
where  the  transition  region  [no,n/  —  1]  disappears.  This 
case  is  addressed  in  section  IV. 

III.  Generic  Classes 

In  this  section,  we  give  conditions  for  the  filter  banks  in 
each  of  the  three  classes  mentioned  in  the  previous  section 
to  guarantee  perfect  reconstruction  as  well  as  minimality 
of  the  transition  region. 

A.  Filters  hounded  from  the  top 

We  start-off  with  bounding  the  doubly  infinite  matrices 
in  (1)  and  (2)  to  the  North  and  to  the  West,  respectively. 


Because  of  the  sudden  change  in  the  filter  coefficients,  the 
perfect  reconstruction  behavior  is  lost  in  the  process.  We 
can  re-establish  the  perfect  reconstructiveness  if  we  pro¬ 
vide  the  system  with  the  necessary  state  (vector)  such 
that  the  map  from  past  inputs  to  this  state  is  equal  to  the 
controllability  operator  of  the  shift-invariant  system.  The 
initialization  must  be  in  such  a  way  that  we  can  recover 
the  input  at  the  reconstruction  side  from  the  outputs  of 
the  succeeding  realizations. 

The  state  space,  model  of  the  switching  process  on  the 
decomposition  side  is  shown  in  Fig. 3,  At  the  initial  time 
step,  the  state  is  initialized  from  the  input.  Therefore, 
the  state  space  realization  m(0)  at  creation  time  n  =  0  is 
such  that  a(0)  =  [],  c(0)  =  []  and  c/(0)  =  [].  Moreover, 
in  order  to  perfectly  recover  the  initial  input  vector  at  the 


reconstruction  side,  its  dimension  must  be  at  most  equal 
to  the  state  dimension  of  the  target  stationary  filter.  For 
maximally  decimated  systems  (P  =  Q),  the  best  choice 
would  be  to  take  the  dimension  of  the  initial  input  to  be 
equal  to  the  state  dimension  as  indicated  in  Fig. 3.  In  this 
case,  6(0)  must  be  an  A  x  TV  non-singular  operator,  where 
TV  is  the  state  dimension  of  the  target  stationary  filter. 


Fig.  3.  Time  varying  system  representation  bounded  from  the  top 


Since  the  filter  coefficients  are  given  as  in  (9),  the 
switching  process  on  the  decomposition  side  would  be  over 
if,  at  a  certain  point  in  time,  the  mapping  from  past  inputs 
to  the  present  states  (the  controllability  operator)  agrees 
with  that  of  the  target  stationary  filter [2].  This  means 
that  the  number  of  inputs  needed  to  establish  the  sta¬ 
tionary  filters  depends  on  the  length  of  the  controllability 
operator. 

The  reconstruction  version  of  Fig. 3  can  be  obtained  by 
reversing  the  directions  of  signal  flows  in  it.  For  this  re¬ 
construction  filter,  the  impulse  response  at  time  step  n, 
which  is  the  same  as  the  column  of  P,  is  given  as 

d{n  -  l)6(n)  ■ 
d(n)  J  ’ 


where  0(n)  is  the  observability  operator  of  m(n).  Clearly 
the  impulse  response  at  time  step  n  would  agree  with  that 
of  the  stationary  filter  if  0{n  —  1)  were  the  observability 
operator  of  the  stationary  target  filter.  In  maximally  dec¬ 
imated  systems  the  controllability  operator  on  the  decom¬ 
position  side  and  the  observability  operator  on  the  recon¬ 
struction  side  have  equal  lengths.  Therefore,  in  order  to 
complete  the  switching  process  on  the  reconstruction  side, 
we  need  as  many  outputs  as  the  inputs  on  the  decomposi¬ 
tion  side.  The  question  remaining  is  how  many  boundary 
functions  both  on  the  decomposition  and  reconstruction 
sides  are  needed  for  the  switching  process. 

Let  the  realization  on  the  decomposition  side  at  n  =  1, 
m(l)(a(l),  6(1),  c(l),  d(l))  be  such  that  [a(l)6(0)  6(1)]  = 
C{2)  =  Cy  where  C  is  the  controllability  operator  for  the 
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stationary  filter,  then  all  the  filters  for  n  >  1  will  agree 
with  that  of  the  stationary  filter.  It  is  evident  from  here 
that,  once  6(0)  is  chosen,  the  values  of  a(l)  and  6(1)  are 
fixed.  They  are  given  by 

ai=Ci:,l:N)bo^ 


and 

bi=C{:,N+l:C) 

where  C(:,i  :  j)  stands  for  columns  of  C  in  the  inter¬ 
val  N  =  is  state  dimension  of  the  target  filter  and 
£=:length  of  C. 

Once  «(i)  and  6(1)  are  fixed,  the  choices  of  c(l) 
and  d(l)  must  be  such  that  the  inverse  realization 
m(l)(a(l),6(l),c(l),J(l))  at  n  =  1  satisfies  the  require¬ 
ments  on  the  observability  operator.  Following  we  will 
show  that,  for  a  particular  choice  of  6(0),  it  is  possible  to 
come  up  with  a  simple  procedure  to  design  the  realizations 
m(l)  and  m(l)  which  satisfy  the  mentioned  requirements. 

Proposition  III.  1:  Let  m(a,6,  c,  d)  and  m(d,6,c,  d)  be 
the  state  space  realizations  of  a  Q-channel  (with  Q=P) 
stationary  target  filter  bank  on  the  decomposition  and 
reconstruction  sides,  respectively.  Let  also  be  given  6(0)  = 
C(:,(£  —  TV  H-  1)  :  C)  (the  N  right  most  columns  of  C). 
Define  K  =  Then,  if  a(l)  =  and 

6(1)  =  C(:,  (£  —  /V-f  1)  :  £),  we  have  [  a(l)6(0)  6(1)  ]  = 

C.  Moreover,  if  we  choose  c(l)  =  0(1  :  K,:)  and  d(l) 
equal  to  the  KxK  matrix  £*(1  :  K,  C+1  :  £+/V),  where  E 
is  as  given  in  (11),  then  m(l)  also  satisfies  the  requirement 
on  the  observability  operator  at  the  reconstruction  side. 

[  '‘"a)'*  ]  =  op) = 0- 

Proof:  The  proofs  of  all  the  propositions  are  omitted 
because  of. lack  of  space.  ■ 

Proposition  III.  2:  Given  the  one  sided  infinite  Q- 
channel  decomposition/reconstruction  pair  of  (11)  and 
(12),  The  number  of  the  boundary  functions  needed  to  re¬ 
establish  the  perfect  reconstruction  is  equal  to  C  —  N  <  /V, 
where  K  =  with  N  the  state  dimension  and  C 

length  of  the  controllability  operator.  ■ 

The  following  proposition  fixes  the  maximum  length  of  the 
boundary  functions. 

Proposition  III. 3:  The  maximum  length  of  the  bound¬ 
ary  functions  is  £  <  TV  -h  /V,  where  TV  is  the  state  dimen¬ 
sion  of  the  system,  C  =:length  of  C  and  K  is  as  defined  in 
proposition  III.l.  ■ 

B.  Filters  bounded  from  the  bottom 

Here,  we  consider  the  case  when  the  stationary  system 
has  been  working  for  indefinite  time  in  the  past  and  ends 
its  operation  at  a  certain  time  n  —  0,  say.  The  corre¬ 
sponding  realization  is  depicted  in  Fig.4.  Since  our  in¬ 
tention  is  to  terminate  the  filtering,  the  boundary  filter 
must  use  the  present  state  and  convert  it  into  output  as 
indicated  in  Fig.4.  The  generation  of  the  output  functions 
must  be  such  that  the  map  from  the  state  provided  by  the 
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Fig.  4.  Time  varying  system  representation  bounded  from  the  bot¬ 
tom 


stationary  system  to  be  terminated  to  the  future  outputs 
approaches  O  as  quickly  as  possible.  This  can  be  achieved 
by  setting  c(0)  equal  to  the  top  TV  rows  of  O.  The  result¬ 
ing  output  functions  are  such  that  at  most  the  bottom  K 
functions  give  the  boundary  filters  and  the  rest  have  the 
same  values  as  the  stationary  filter,  where  K  is  as  defined 
in  proposition  III.l.  This  can  easily  be  verified  by  gener¬ 
ating  the  output  functions  of  the  boundary  system  using 
the  relation  y(0)  =  c(0)C  =  0(1  :  TV,  :)C.  Note  that  c(0) 
must  be  non-singular  in  order  to  perfectly  recover  the  past 
inputs  using  the  output  of  the  boundary  system. 

C.  Transition  between  filter  banks 

In  this  case,  the  question  we  want  to  answer  is:  given 
two  stationary  filters  with  state  space  realizations  mi  and 
m2,  we  would  like  to  design  an  intermediate  realization  m 
such  that  the  transition  durations  on  both  the  decomposi¬ 
tion  and  reconstruction  sides  are  minimal®  and  equal.  Let 
Fig.5  represent  such  a  minimal  system.  Note  that 

{mi  :  ne  [-oo,  (uo  ~  1)] 
m  :  ne  [no,(nf -1)] 
m2  :  n  €  [rif ,  +oo] 

with  no  —  1  =  —1  and  n/  —  1  =  no  =  0.  If  m  is  such 
that  the  map  from  the  past  inputs  of  m(ny)  to  its  current 
state  equals  the  controllability  operator  of  the  overtaking 
filter,  then  the  outputs  for  n  >  n/  will  be  equal  to  the 
stationary  values  of  the  overtaking  filter.  Since  the  initial 
filter  is  assumed  to  have  been  operating  stationarily  up  to 
n  =  no  -  1,  the  outputs  for  n  <  n^  are  unaffected  by  the 
intermediate  system  m.  This  means,  the  filter  behaves  the 
same  way  as  the  initial  stationary  system  up  to  n  =  n^  — 
1.  On  the  reconstruction  side,  for  the  impulse  responses 
to  agree  with  the  respective  stationary  values  before  and 
after  the  transition,  the  intermediate  realization  m  must 
satisfy  O  =  0(n)„>„^  =  O2. 

C.l  Minimal  Structure  for  Switching  Between  Two  Filter 
Banks 

Consider  the  intermediate  realization  m(a,  6,  c,  d)  shown 
in  the  middle  part  of  Fig.5.  Let  the  controllability  oper- 

^  Minimal  means  Q,  the  number  of  transitional  output  functions 
realted  to  m,  is  minimum. 
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Fig.  5.  Time  varying  system  representation,  transition  between  two 
filters. 

ator  on  the  decomposition  side  and  the  observability  op¬ 
erator  on  the  reconstruction  side  for  the  initial  stationary 
filter  and  the  final  stationary  filter  be  Ci,Oi  and  ^2,02) 
respectively.  The  following  conditions  must  be  satisfied 
by  the  transition  system  m  and  its  inverse  m(a,6,  c,  d)  if 
the  time  varying  filter  has  to  behave  as  the  respective  sta¬ 
tionary  filters  outside  the  transition. 

C2  =  [  aCi  b]  (14) 

O2  =  [  ]  .  (15) 

If  C2  =  SCi  for  some  invertible  operator  5,  then  it  can  be 
easily  shown  that  there  exist  operators  a  and  a  such  that 
aCi  =  and  Oia  =  We  address  this  sub-class  of 
filter  banks  in  sections  IV  and  V.  For  now  let  us  assume 
this  is  not  the  case. 

Proposition  III. 4:  Let  m(a,6,  c,  d)  be  the  intermediate 
realization  of  the  time  varying  system  shown  in  Fig. 5. 
Also,  assume  that  C2  /  SC\  for  all  invertible  5.  Then, 
m  and  its  inverse  m  are  minimal  if  and  only  if  there  exists 
an  invertible  matrix  t,  such  that 


Where  m'  is  such  that  the  time  varying  decomposition  op¬ 
erator  E'  is  a  concatenation  (along  the  main  diagonal)  of 
El  minimally  bounded  from  the  bottom  and  E2  minimally 
bounded  from  the  top.  See  (17).  ■ 

Basically  the  above  proposition  is  stating  that,  if  the 
state  space  realization  m  (the  middle  part  of  Fig. 5)  and 
its  inverse  m  represent  minimal  systems,  then  the  state 
transition  operators  a  and  d  are  zero.  Consequently,  we 
can  redraw-  the  detailed  version  of  m  as  indicated  in  Fig. 6. 


. 


Fig.  6.  Details  of  the  intermediate  state  space  realization. 

This  structure  suggests  that  the  decomposition  operator 
E^  of  the  time  varying  filter  has  the  form  shown  below. 


There  is  a  similar  structure  on  the  reconstruction  side  also. 
In  summary,  the  transition  functions  can  be  determined 
by  terminating  and  starting  the  two  filter  banks  before 
and  after  transition  as  described  in  sections  III-A  and  III- 

B,  respectively,  and  combining  the  so  bounded  filter  banks 
as  shown  in  (17).  Taking  proposition  III. 4  the  other  way 
arround,  overlap  on  the  transition  segment  can  then  be 
achieved  by  applying  an  invertible  output  transform.  We 
consider  this  in  the  next  subsection.  Because  the  structure 
of  the  transition  functions  is  similar  to  the  concatenation 
of  independent  bounded  perfect  reconstruction  filters,  the 
number  of  functions  needed  to  switch  between  them  is  the 
sum  of  the  boundary  functions  in  the  independent  filters. 

C. 2  Controlling  the  Behavior  of  the  Transition  Filters 

In  real  applications,  in  addition  to  perfect  reconstruc¬ 
tion,  good  filtering  behavior  is  required.  Therefore,  we 
would  like  to  know  if  we  can  alter  the  behavior  of  the 
transition  functions  as  desired  without  violating  its  per¬ 
fect  reconstruction.  Fortunately,  we  have  some  degree  of 
freedoms  to  do  so. 

Consider  the  state  space  realization  of  the  intermedi¬ 
ate  system  shown  in  Fig. 6.  Cascading  this  system  with 
an  invertible  Q  x  Q  constant  matrix  t  does  not  affect  the 
perfect  reconstructing  nature  of  the  filter.  If  in  addition,  t 
is  optimized  to  give  good,  desired,  impulse  response  tran¬ 
sitions  on  [rio^rij  --  1]  in  R,  while  maintaining  smooth 
filter  weight  vector  transition  on  [rio^nj  —  1]  in  E^  we  can 
improve  the  behavior  of  the  transition  filters  without  de¬ 
stroying  the  perfect  reconstruction.  Due  to  the  mixing  of 
the  filter  coefficients  of  the  upper  and  the  lower  bound¬ 
ary  filters  (see  equation  17)  by  the  cascaded  operator  t. 
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overlapped  transition  is  resulted.  For  further  reading  on 
optimization  methods  the  reader  is  advised  to  refer  to  [1] 
and  [3]. 

In  the  remaining  part  of  the  paper  we  will  show  that, 
if  the  two  filters  under  question  are  such  that  Cb  —  SCa, 
as  stated  earlier,  the  transition  between  the  filters  can  be 
made  arbitrarily  small.  In  the  limiting  case  we  can  instan¬ 
taneously  switch  from  initial  filter  to  the  final  one  without 
any  transition.  For  this  subclass  of  filters,  we  first  sum¬ 
marize,  in  the  next  section,  results  for  the  case  no  =  n/, 
which  were  presented  in  [4]  and  allow  the  construction  of 
piecewise  stationary  biorthogonal  banks.  In  section  V,  we 
extend  these  results  to  the  case  when  nj  >  no  by  intro¬ 
ducing  an  interpolation  method  which  allow  smooth  tran¬ 
sition  between  filter  weight  vectors  in  the  decomposition 
part  of  the  bank,  and  at  the  same  time  smooth  transition 
between  impulse  response  vectors  in  the  reconstruction 
part  of  the  bank.  Examples  that  demonstrate  the  results 
are  given  in  section  VI. 

IV.  Instantaneous  Transition  in  Perfect 
Reconstruction  Filter  Banks 

The  problem  we  want  to  address  in  this  section  is:  given 
two  stationary  filters  with  state  space  realizations  mi  and 
m2 ,  is  it  possible  to  instantaneously  switch  from  initial  to 
the  final  filter  ?  Following  we  formally  state  the  problem. 

Let  (£'(e),  R{r))  be  a  shift  invariant  biorthogonal  filter 

bank.  Put  E(e)  =  -  and  R{r)  =  [Ri  \  i^r],  where 

L  J 

Et  is  the  part  of  E{e)  above  the  central  row  e,  and  Ri  is 
the  part  of  R{r)  to  the  left  of  the  central  column  r. 

Now  let  be  given  E'i(ei)  -  ,  £'2(62)  =  p-’-- 

and  Ri{ri)  =  [Rij  \  =  [^2,/  |  ^2,r,- 

Ri{ri)Ei{ei)  =  I  and  R2lr2)R2{^2)  =  R  The  problem 
we  want  to  address  in  this  section  is  the  following. 
Problem. 

Given  the  above  two  filter  banks  (jE'i(ei),  i^i(ri))  and 
(£*2(^2))  ^2(^2)),  does  there  exist  a  filter  bank  (E'12,  ^12) 
such  that 


P  _ 

Ei2  —  rp 

1  -*^2,6  J 

(18) 

R\2  =  [-^1,;  1  R2,r] 

(19) 

and,  of  course,  R12R12  =  R 

End  problem. 

The  answer  is  partly  contained  in  the  following  proposi¬ 
tion,  a  proof  of  which  can  be  found  in  [4]. 

Proposition  IV,  1:  Let  (jE'i(ei),  i^i(ri)) 

and  (£'2(€2),  ^2(^2))  be  two  shift-invariant  perfect  recon¬ 
struction  filter  banks  with  realization  matrix  pairs: 

(mi(ai,6i,ci,di),  mi(ai,6i,ci,Ji)) 

and 

(m2(a2j  ^2j  C2,  ^2)1  ^2(02, 62,  C2,  ^2)), 


respectively.  iF 

ai  a  =  a2y  bi  ^  b  =  62,  di  =  d  =  021^1  =  c  =  C2, 

C2  =  f  X  Cl,  ^2  =  f  di,  62  =  61  X  s,  ^2  =  di  X  and 
5  X  f  =  /, 


E  1 

then  (  . .  ,[Ri  I  \  Rb  r])  is  a  biorthogonal  filter 

L  ^2,6  J 

bank  with  instantaneous  filter  weight  vector  transition 
in  the  decomposition  part  and  instantaneous  impulse  re¬ 
sponse  transition  in  the  reconstruction  part.  ■ 

In  the  above  proposition  we  have  tacitly  assumed  that 
the  two  banks  (£'i(ei),  iii(ri))  and  (£‘2(^2),  -^2(^2))  have 
the  same  number  of  channels.  This  is,  however,  not  a 
restriction.  Indeed,  if  mi(ai,  61,  ci,  di)  is  the  realization 
matrix  of  -  say  -  a  p-channel  filter  ,  then  a  number  of 
such  realizations,  r  say,  in  (time)  succession  can  be  merged 
(by  eliminating  intermediate  states)  to  obtain  a  realization 
matrix  mi(ai,  61,  Ci,  di)  of  an  r  x  p  channel  filter.  The 
realization  matrix  m2(a2, 62>  C2,  d2)  in  the  proposition  will 
then  also  characterize  an  r  xp  filter  bank.  See  the  example 
in  section  VI. 

In  the  next  section,  we  start  from  this  result  to  extend 
the  transition  behavior  from  instantaneous  to  graceful, 

V.  Interpolated  Transition  in  Perfect 
Reconsti-uction  Filter  Banks 

The  matrix  equation  in  the  previous  section  expresses 
perfect  reconstruction.  The  additional  property  is  that 
the  system  is  state  stationary  over  all  time  including  the 
time  instant  of  instantaneous  transition.  The  transition  is 
instantaneous  because  the  transformation  matrices  t  and 
s  are  constant.  If,  on  the  other  hand,  we  let  these  matrices 
be  time  varying,  then  the  transition  will  follow  a  certain 
trajectory  which  we  will  have  to  control  in  ceise  when  we 
want  to  enforce  meaning  on  the  trajectories  of  the  filter 
weight  vectors  in  the  operator  E  as  well  as  the  impulse 
response  vectors  in  the  operator  R,  One  can  envisage 
several  strategies  to  control  the  transition  behavior  but 
we  shall  confine  to  one  of  them  -  spiral  interpolation  - 
which  has  proven  to  be  simple  and  satisfactory. 

Thus  let  {^(n)}  be  the  sequence  of  real  w  x  w  transfor¬ 
mation  matrices  on  the  transition  interval  where 

t(no)  -  I  and  t{nf)  =  t,  t  being  -  for  example  -  the  ma¬ 
trix  t  in  proposition  IV.  1.  Associated  with  this  sequence 

^  Up  to  similarity  transformations. 


is  the  sequence  of  inverse  matrices  {^(n)},  s{n)t{n)  =  /. 
Now  let  t  =  and  s  =  gX~‘^g‘‘^  be  the  eigenvalue 

decompositions  of  i  and  s.  The  eigenvalues  are  either  real 
Aj  or  appear  in  conjugate  pairs  (|  \k  \  |  Ajt  | 

Proposition  V.l:  Let  for  i  —  and  rio  <  n  < 

^/j  Pt(^)  3,nd  7i(n)  be  real  and  monotonically  increas¬ 
ing  functions  from  0  at  rio  to  1  at  nj.  Put  ^(n)  = 
g[diag(pi(n)  \  A,-  |  +(1  -  <(n)  is  real. 

If  9i(n)  =  9'(")  monotonically 

increasing  from  0  at  rio  to  1  at  n/,  s{n)  =  g[diag{qi(n)  \ 
A,“^  I +(1  — is  real,  and  s{n)xt{n)  = 
L  Moreover,  the  transition  filters  are  such  that  the  overall 
system  remains  perfect  reconstruction.  ■ 

VI.  Illustrative  Examples 

A.  Transition  between  filter  banks 

In  this  example,  the  transition  behavior  between  two 
filters  of  lengths  18  and  10  are  studied,  the  structures 
of  both  filters  are  as  shown  in  Fig. 7.  In  the  initial  case 
/(z)  r=  C2Z~^  +  CiZ~'^  —  ciz^  —  C22r^  and  g{z)  =  C4Z^^  + 
csz"^  —  csz^  —  c^z^ .  The  final  filter  has  f{z)  =  C5Z~^  — 
and  g{z)  =  cqZ^^  —  CqZ,  The  transition  in  the  spec¬ 


tral  characteristics  on  the  decomposition  side  is  shown  in 
Fig. 8,  and  Fig. 9  shows  the  transition  between  the  impulse 
responses  at  the  reconstruction  side  corresponding  to  the 
second  channel.  From  the  plots  one  can  clearly  see  the 
decomposition  behavior  of  the  transition  filter  weight  vec¬ 
tors  and  the  gentle  transition  in  impulse  responses  on  the 
reconstruction  side. 


Fig.  8.  Spectra  of  the  transition  filters  on  the  decomposition  side. 


B.  Instantaneous  transition 

Here,  we  consider  the  case  when  a  4-channel  bank 
takes  over  from  a  2-channel  bank.  The  decomposition 
parts  of  the  two  banks  are  shown  in  Fig.  10,  in  shift- 
invariant  state.  In  this  figure,  f{z)  =  —cq.z  -h  cq^z"^^ 
and  g{z)  =  —ci.z-\-  f  is  a  constant  4x4  matrix  . 


Fig.  9.  IVansitions  in  impulse  responses  of  the  second  channel  on 
the  reconstruction  side. 


Fig.  10.  A  2-chaimel  and  a  4-channel  decomposition  part  of  2  perfect 
reconstruction  jfilter  banks 


The  Fourier  transforms  of  the  filter  weight  vectors  of  the 
two  decomposition  parts  are  shown  in  Fig.ll. 

If  in  the  second  filter  in  Fig.  10  the  matrix  t  is  replaced 
by  the  identity,  then  the  resulting  flow  graph  is  essentially 
twice  the  first  filter.  Thus  if  t  is  taken  away  from  the 
second  filter,  then  it  represents  two  time  steps  of  the  2- 
channel  decomposition  filter.  This  filter  can  be  run  for 
a  while  and  then  -  at  n  =  tzo  say  -  t  can  be  cascades 
to  the  two  2-channel  filters  and  start  running  as  a  single 
4-channel  decomposition  filter.  The  take  over  is  instan¬ 
taneous.  That  is,  the  spectral  characteristics  switch  in¬ 
stantly,  at  n  =  Uo,  from  the  top  spectra  in  Fig.ll  to  the 
bottom  spectra.  Moreover,  the  impulse  responses  of  the 
reconstruction  filters  do  also  have  an  instantaneous  tran¬ 
sition  at  n  =  Mo,  as  shown  in  Fig. 12  for  the  fourth  channel 
of  the  two  reconstruction  filters^. 

C.  Interpolated  transition 

Instead  of  appending  the  constant  matrix  t  at  time  in¬ 
stant  no,  we  now  ’spiral’  -  as  explained  in  proposition  V.l 
-  along  the  matrix  trajectory  t{n)  =  g[diag(pi{n)  \  A,*  [ 
-|-(1  —  which  takes  off  at  t{no)  =  I 

and  is  constant  t{nj)  =  t  from  n  =■  nj  on.  And 
similarly  for  the  reconstruction  filter:  the  input  matrix 
starts  off  from  s(no)  =  /  and  spirals  along  the  trajectory 

®The  reconstruction  filters  are  not  shown  as  they  are  easily  ob¬ 
tained  by  reversing  the  direction  of  signal  flow  from  output  to  input 
in  the  filters  from  Fig.lO,  [5]. 
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FTM^uMocy  I 


Fig.  1 1 .  Fourier  transforms  of  filter  weight  vectors  of  2  and  4-channel 
filters 


O.S., 


Fig.  12.  Instantaneous  switching  in  the  impulse  response  corre¬ 
sponding  to  channel  four 

s(n)  =  </[dm<7(gi(n)  1  A.r'  |  +(1  - 

to  end  at  constant  ^(n/)  =  s  at  time  instant  n  =  nj. 
As  explained  in  proposition  V.l,  perfect  reconstruction  is 
guaranteed  whenever  3i(n)  =  ' 

The  smooth  transitions  of  both  the  spectral  characteris¬ 
tics  at  the  decomposition  side  and  the  impulse  responses 
at  the  reconstruction  side  are  clearly  seen  in  the  plots 
shown  in  Fig,  13  for  channel  four.  The  other  channels  tran¬ 
sit  in  a  likewise  gentle  way.  As  a  final  plot,  the  trajectories 
of  the  eigenvalues  of  <(n)  and  s(n)  are  shown  in  Fig.  14. 
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ABSTRACT 


an  MSLMS  filter  is  expressed  els  a  sum  of  2L  microfilters 


The  MSLMS  filter  is  a  new  kind  of  nonlinear  filter, 
which  stems  from  the  filter  bcised  on  the  threshold  de¬ 
composition  approach.  In  this  paper,  the  MSLMS  fil¬ 
ter  is  studied  from  another  point  of  view,  i.e.  that  of 
the  piecewise-linear  (PWL)  filtering  theory.  Prom  this 
study  it  is  seen  that  the  MSLMS  filter  belongs  to  a  sub¬ 
set  of  known  PWL  filters.  It  is  theoretically  investigated 
that  the  use  of  the  MSLMS  filter  is  limited  since  the 
subset  it  belongs  to  hEis  a  drawback  in  approximating 
a  desired  filtering  operator.  This  drawback  is  overcome 
by  another  PWL  filter,  e.g.  the  so-called  CPWL  filter, 
which  may  be  taken  els  a  generalization  of  the  MSLMS 
filter.  An  example  of  our  simulations  is  presented,  which 
verify  the  study. 


1.  INTRODUCTION 

To  overcome  the  Hmitations  of  the  traditional  linear  fil¬ 
ters  in  modern  applications  of  signal  processing,  people 
are  working  to  find  more  powerful  nonlinear  filters.  Due 
to  the  lack  of  a  general  theory,  various  kinds  of  non¬ 
linear  filters  are  developed  on  dififerent  mathematical 
backgrounds.  Among  them  the  theory  of  threshold  de¬ 
composition  serves  els  the  mathematical  bELsis  of  a  large 
family  of  nonUnear  filters,  including  rank-order  filters, 
the  stacker  filter  and  its  generalized  versions,  which  have 
been  proved  useful  in  many  applications.  The  microsta¬ 
tistic  (MS)  filter  is  a  new  member  of  this  family  [1]. 
Instead  of  using  nonlinear  operations  at  the  threshold 
level,  the  linear  operation  is  adopted  in  the  sub-  (micro-) 
filters  to  reduce  the  computation  complexity.  As  a  re¬ 
sult,  the  hnear  system  theory  can  be  used  to  analyze 
the  filtering  performance,  although  the  whole  operation 
is  still  nonlinear.  The  so-called  MSLMS  filter  proposed 
in  [2]  modifies  the  MS  filter  with  a  real  valued  thresh¬ 
old  decomposition  and  an  adaptive  feature,  so  that  the 
complexity  of  the  microstructured  filters  is  further  re¬ 
duced  and  the  performance  is  enhanced.  The  filter  can 
“learn”  to  assign  its  parameters  without  requiring  much 
a  priori  knowledge  of  the  signal  and  noise  statistics. 

Considering  a  real,  discrete- time  sequence  {x(ti)} 
which  can  be  denoted  for  an  iV-long  observation  win¬ 
dow  ELS 


x'^{n)  =  [x(n),  x(n  -  1), . . . ,  a;(n  -  A"  +  1)]  (1) 


L  L 

y{n)  =  ^  yk{n)  =  ^  ajTkxin)  (2) 

k^-L  k^~L 

k^O 


where 

yk{n)  =  alTkx{n)  (3) 

with  Ok  6  denotes  the  fc-th  (linear)  microfilter. 


Tkx{n)  =  [rit[a:(n)],Tfc[a:(n-l)],. . .  ,Tk[xin- N+l)]f 

(4) 


where  Tk[]  is  the  threshold  operation 


T’Uei  = 


0 

^  -  ik-i 
Ik  —  lk-1 


^  <  lk~l 
h-1  <  i  <lk 
lk<^ 


for  ^  >  0  and 


(5) 


T-k[i] 


0  ^  >  l-k+i 

^  l~k-\-l  l-k-^1  >  ^  ^  l-k  (6) 

l-k  —  l~k+l  l~k  >  i 


for  ^  <  0  and  1  <  A:  <  L.  Ik^  ioi  k  =  — L, . . . ,  L,  is  the 
threshold,  which  satisfies 


-oo  l-L  <  -  •  <  lo  <  l\  <  ’  ■  ■  <  II  =  d-oo.  (7) 

Static  and  dynamic  methods  have  been  introduced  in  [2] 
to  estimate  the  thresholds,  and  an  LMS  algorithm  wels 
given  to  adjust  the  parELmeters  ajt-  The  network  struc¬ 
ture  of  the  MSLMS  filter  is  depicted  in  Fig.  1.  The  supe¬ 
riority  of  the  MSLMS  filter  over  a  conventional  adaptive 
linear  filter,  i.e.  the  LMS  filter  in  a  non-Gaussian  signal 
and/or  non-Gaussian  noise  case  has  been  shown  in  [2]. 

The  authors  of  [2]  have  noticed  that  the  MSLMS  fil¬ 
ter  has  a  close  relation  to  nonlinear  filters  based  on  the 
theory  of  piecewise-linear  (PWL)  systems,  but  they  did 
not  give  any  further  discussion  there.  This  paper  studies 
the  MSLMS  filter  from  the  viewpoint  of  the  PWL  the¬ 
ory.  The  PWL  characteristics  of  the  MSLMS  filter  are 
investigated.  This  study  is  thought  of  significant  since 
it  indicates  a  possibility  of  generalization  of  the  MSLMS 
filter.  Moreover,  it  may  be  taken  els  a  part  of  the  effort 
towards  the  goal  of  unifying  different  nonfinear  filters. 

This  paper  is  organized  els  follows.  In  the  following 
section  we  briefly  introduce  the  theoretical  framework 
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of  PWL  filters.  Ba^ed  on  that,  Section  3  studies  the 
PWL  characteristics  of  the  MSLMS  filter.  This  study 
reveals  that  the  MSLMS  filter  mathematically  belongs 
to  a  clcLSS  of  PWL  filters,  i.e.  the  so-called  CPWL  filter. 
It  is  also  discussed  there,  that,  as  a  subset  of  the  CPWL 
filter,  the  MSLMS  filter  has  some  limitations  which  can 
be  overcome  by  the  latter.  In  Section  4,  a  simulation 
example  is  given  to  illustrate  the  result  of  our  study. 

2.  PIECEWISE-LINEAR  FILTERS 

A  filter  is  mathematically  an  operator  between  two  sig¬ 
nal  spaces.  We  consider  here  only  time-invariant  digital 
filters  with  a  finite  support.  Such  a  filter  can  be  ex¬ 
pressed  by  [8] 

y(n)  =  /(x(n))  (8) 

where  /  :  »  R  is  an  iV-vaiiate  real  function.  The 

main  task  in  developing  a  general  meaning  of  the  non¬ 
linear  filter  is  to  find  a  class  of  operators  which  should 
be  able  to  represent  or  approximate,  as  many  as  possi¬ 
ble,  given  filter  operators  and,  on  the  other  hand,  which 
can  be  implemented  in  practice  by  usual  physical  com¬ 
ponents.  To  this  end,  PWL  filters  are  developed.  In  a 
PWL  filter  a  nonlinear  filter  function  /  as  given  in  (8) 
is  approximated  in  global  by  a  series  of  linear  subfunc¬ 
tions  which  are  defined  in  properly  partitioned,  usually 
by  hyperplanes  (boundaries),  subregions  of  the  domain. 
A  very  important  class  of  PWL  functions  is  the  so-called 
canonical  PWL  function  [4]  which  is  up  to  now  the  only 
method  to  represent  all  the  local  subfunctions  as  well  as 
the  domain  partition  of  a  PWL  function  in  a  compact 
explicit  formula.  The  canonical  PWL  function  takes  the 
form: 

<T 

f{x)  —  a-\-  <b,x  >  +  ^Cil  <  ai,x  >  -Pi\  (9) 

t=i 

where  <,>  denotes  the  inner  product  of  two  vectors, 
and  a, Cl  e  R,  6,ai  6  <  ai,x  >  -fii  =  0  for 

i  =  l,2,...,cr  characterize  the  a  hyperplanes,  which 
partition  the  domain  in  small  subregions. 

The  existence  condition  of  the  canonical  PWL  func¬ 
tion  has  been  studied  in  [4].  Although  it  is  not  suit¬ 
able  for  all  kinds  of  PWL  functions,  the  canonical  PWL 
function  has  proved  a  “universal”  approximant,  that  is, 
it  is  able  to  approximate  any  continuous  function  to  an 
arbitrarily  given  precision  [6]. 

Based  on  the  canonical  representation  a  nonlinear  fil¬ 
ter  structure  called  CPWL  filter  is  proposed  in  [5].  The 
CPWL  filter  is  expressed  as 

L 

y{n)  =  uo(n)  -h  ^c/|uz(n)|  (10) 

where  ci  for  Z  =  1,  2, . . . ,  L  are  real  parameters,  and 

N-\ 

ui{n)  =  hi[x{n)]  —  hio  -f  hi{i)x{n  -  i)  (11) 

t=0 

expresses  a  linear  sub-filter  with  hw  being  its  offset  and 
hi{i)  its  impulse  response. 


Because  of  the  universal  approximation  property  of 
the  canonical  PWL  function,  the  CPWL  filter  can  be 
taken  as  a  general  meaning  of  filter  which  is  used  to 
approximate  a  desired  nonlinear  filtering  operator  on  a 
given  support.  Prom  the  implementation  point  of  view, 
the  CPWL  filter  is  also  favorable,  since,  besides  the  ba¬ 
sic  operators  which  are  necessary  for  a  linear  filter,  only 
an  “absolutor”  is  additionally  required,  which  is  easy  to 
implement.  An  adaptive  algorithm  was  developed  in  [5] 
to  adjust  the  parameters  as  well  as  the  weights  of  the 
linear  sub-filters. 

3.  PWL  CHARACTERISTICS  OF  MSLMS 
FILTER 

The  nonlinearity  involved  in  the  MSLMS  filter  is  due 
to  the  threshold  operators  Tk[-]  which  is  a  continuous 
PWL  function  of  one  variable.  Mathematically,  a  linear 
combination  of  PWL  functions  is  still  a  PWL  function 
where  the  domain  partition  is  composed  of  the  bound¬ 
aries  of  the  sub-functions.  If  the  sub-functions  are  all 
univariate  ones,  the  PWL  function  has  a  particular  geo¬ 
metric  property,  i.e.,  it  takes  a  lattice  domain  partition 
where  the  boundaries  are  only  hyperplanes  perpendicu¬ 
lar  to  the  coordinate  axes.  Such  a  function  can  always 
be  represented  as  a  canonical  PWL  function  [4].  In  fact, 
after  a  rearrangement  of  the  formula  (2)  -  (8)  it  is  easy  to 
find  that  the  output  of  the  MSLMS  filter  can  be  written 
as 

N~1  N-l  L-l 

y{n)  =  ao'^bix{n-i)  +  '^  ^  Cij\x(n  -  i)  -  lj\ 

t=0  <=0  ;  =  -(I.-I) 

3  9^0 

(12) 

where  ao,6i,  and  aj  for  i  =  0, 1, . . . ,  iV  —  1;  j  = 
— (L  —  1), . . . ,  (L  —  1),  are  a  linear  combination  of  the 
coefficients  of  the  2L  micro-filters. 

Comparing  (12)  with  (10),  it  is  seen  that  the  MSLMS 
filter  is  a  CPWL  filter.  As  a  special  issue,  the  hnear 
subfilters  characterizing  the  lattice  boundaries  of  the 
domain  partition  are  those  of  x{n  —  i)  —  Ij. 

This  simphfied  issue  has  an  advantage,  i.e.  it  is  easy 
to  deal  with  the  parameters,  especially  with  that  of  the 
boundaries.  Once  all  Ij  being  fixed,  the  determination 
of  the  parameters  ao,  6i,  and  Cij  is  a  linear  programming 
problem  and  easy  to  solve.  However,  such  filters  con¬ 
stitute  only  a  small  subset  of  the  CPWL  family.  It  has 
been  pointed  out  [6]  that  the  universal  approximation 
property  of  the  canonical  PWL  function  is  lost  by  the 
subset  which  has  a  lattice  domain  partition.  A  PWL 
function  of  this  subset  is  incapable  of  approximating  an 
arbitrarily  given  function  to  a  desired  precision,  no  mat¬ 
ter  how  finely  the  domain  is  partitioned.  That  means, 
unlike  the  CPWL  filter,  the  MSLMS  filter  can  in  gen¬ 
eral  not  be  used  to  achieve  a  desired  nonlinear  filtering 
operator.  From  this  viewpoint,  it  can  not  be  taken  as  a 
general  meaning  of  nonlinear  filter  as  the  former. 

4.  A  SIMULATION  EXAMPLE 

Simulations  have  been  conducted  to  examine  the  above 
discussions.  In  all  our  simulations  an  MSLMS  filter  can 
always  be  implemented  (or  learned,  in  an  adaptive  case) 
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by  a  CPWL  filter  with  a  lattice  domain  partition.  On 
the  other  hand,  as  a  generalization  of  the  MSLMS  filter 
a  CPWL  filter  improves  the  performance  of  the  former. 
Here  we  present  the  results  of  an  example  of  the  sim¬ 
ulations.  In  this  example,  a  signal  of  a  8-state  Markov 
chain  is  disturbed  by  a  contaminated  noise  with  the  dis¬ 
tribution. 

where  ^(•)  is  a  normalized  Gaussian  distribution  func¬ 
tion,  ai  is  the  variance  of  the  nominal  distribution  and 
(72  is  the  variance  of  the  outlier,  and  e  governs  the  pro¬ 
portion  of  contamination. 

Three  filters,  i.e.  a)  the  linear  (LMS)  filter,  b)  the 
MSLMS  filter  (L  =  4),  and  c)  the  CPWL  filter  {a  =  4) 
were  used  to  restore  the  signal.  The  results  are  com¬ 
pared  in  Fig.  2.  One  notices  that,  in  this  simulation, 
the  MSLMS  filter  used  more  free  parameters  than  the 
CPWL. 

From  this  example  we  see  that  the  MSLMS  filter  out¬ 
performs  the  conventional  linear  filter  in  achieving  a 
smaller  average  error.  On  the  other  hand  the  perfor¬ 
mance  of  the  MSLMS  filter  is  improved  ty  the  CPWL 
filter,  i.e.  the  latter  can  achieve  a  further  smaller  error 
with  a  relatively  small  number  of  free  parameters.  In 
addition,  the  CPWL  filter  hcis  the  best  performance  in 
restoring  edges  of  the  useful  signal. 

5.  CONCLUSIONS 

The  study  of  the  PWL  characteristics  of  the  MSLMS  fil¬ 
ter  leads  to  the  conclusion  that  this  kind  filter  is  math¬ 
ematically  equivalent  to  a  subset  of  the  CPWL  filter. 
We  do  not  mean  that  the  former  should  already  be  re¬ 
placed  by  the  latter,  (though,  from  the  viewpoint  of 
the  implementation  cost,  such  a  replacement  is  by  no 
means  worthless.)  In  the  CPWL  implementation  of  an 
MSLMS  filter,  the  original  meaning  of  microstatistics, 
i.e.  the  linear  filtering  of  the  signal  at  different  levels, 
becomes  implicit,  while  this  meaning  may  be  significant 
in  some  applications.  Nevertheless,  the  revelation  of 
the  PWL  characteristics  of  the  MSLMS  filter  can  help 
us  to  analyze  its  behaviour,  to  improve  its  performance, 
and  to  generalize  it  for  more  application  possibilities. 
Moreover,  this  study  has  opened  a  way  to  unify  the  two 
kinds  of  nonlinear  filters  developed  on  different  theoret¬ 
ical  backgrounds. 
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ABSTRACT 

A  new  analytical  expression  for  the  variance  of  resulting 
quantization  error  at  the  output  of  recursive  cascade  filter  is 
proposed.  The  final  results  are  given  according  to  the  filter's 
type  and  its  order  (even  or  odd).  A  design  methodology  for 
optimal  recursive  filters  with  minimum  resulting  error  is 
presented  aimed  also  to  facilitate  the  computational  process. 
A  detailed  numerical  example  is  given,  too. 

1.  INTRODUCTION 

One  basic  problem  in  designing  recursive  digital  filters  is 
the  roundoff  effect  in  their  fixed-point  presentation.  There 
are  many  papers  dealing  with  the  quantization  error 
problem.  Part  of  them  are  aimed  to  obtain  new  analytical 
expressions  for  the  resulting  error  at  the  output  of  recursive 
digital  filter  [1,2].  Some  authors  look  for  variance 
approximation  formulas  to  facilitate  the  computational 
process  or  apply  heuristically  approaches  [3,4].  The 
"dynamic  programming",  "minimax  noise"  principle  etc.  are 
used  as  optimization  methods  for  obtaining  the  cascade 
structure  with  a  minimal  variance  of  noise  quantization. 

The  purpose  of  this  paper  is  to  derive  a  new  analytical 
expression  for  the  variance  of  resulting  error  for  an  even-or- 
odd-order  recursive  cascade  filter.  The  derived  relations 
depend  on  the  coefficients  of  the  first-and-second-order 
transfer  functions  and  are  oriented  to  CAD  applications.  The 
calculations  are  based  on  the  transfer  functions,  directly 
found  by  means  of  Butterworth,  Chebyshev  and  Elliptic 
approximations  followed  by  the  BLT  pass,  which  implies  the 
wide  application  of  the  derived  expression.  One  essential 
difference  between  the  new  expression  for  the  variance  and 
the  known  ones  is  the  introduction  of  a  set  of  parameters 
describing  the  noise  spectral  density  of  each  section. 


2.  SOLUTION 

The  variance  of  resulting  quantization  error  at  the  output  of 
the  filter  is  [5]  : 


2 

(Jr  = 


1 

iTT.j 


|z|=l 


(1) 


where  is  the  spectral  density  of  the  output  resulting 


error  : 


•  Ay 

0.(z)=  I  0^(2)  + of  (4  (2) 

(J)'^(z)  describes  the  spectral  density  of  resulting  error  at 

the  output  of  cascaded  filter  caused  by  the  quantization  error 
of  the  /'-th  section,  M  is  the  number  of  sections.  For  even- 
order  filter  M=N/2,  for  odd-order  filter  M=(N+l)/2,  where  N 
is  the  order  of  filter.  The  function  ot(^)  introduced  above 
depends  on  the  error  spectral  density  at  the  output  of  the 
/-th  section  o' (^) : 

M 

(Dt(z)  =  oUz).  nWs(z).Ws(z-')  ^  (3) 

5'  =  /+l 

where  is  the  transfer  function  of  the  S-th  section 

from  the  cascade.  Replacing  (2)  and  (3)  into  (1)  it  is  found  : 

A/-1 


where : 

2?2''7|z|=1  S=i-hl  ^ 

2  1 

To  derive  it  is  necessary  (eq.4)  to  be  taken  into  account 
the  ]Y.{z)  fo™  (see  Table  1).  The  examination  process  for 

lowpass  filter  will  be  shown  here  (the  expressions  for  the 
other  three  types  of  filters  are  similarly  derived). 


Section  i 

Lowpass 

(LP) 

Highpass 

(HP) 

Bandpass 

(BP) 

Bandstop 

(BS) 


_ Second  order 

_ fVi(-) 

///(l  +z~*) 

I  +aiZ~^  +  biZ~^ 

H  ;(l  -Z~') 

1  +a<z~’  +  6,-z~^ 
H>{\  -z~^) 

1  +  +  bi-z~^ 

Hi{\  +c,.z~4z-^)' 

1  +«/-z”'  +  ^/Z'^ 


First  order 

WM) 

H>{\  +2-’) 
1  +  a  i-z'' 

Hi{\  -z-^) 

1  +aiz'' 


Table  1 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


439 


Consider  the  product  : 

M 

UWs(^)Ws(2~')  =  W2(^)W2(2~b . = 

_  s=2 _ 

M  M 

ri(l+a^-2+6i  Z  )  Y{0-'^ as  Z~^^hs-z~^) 

s=2  s^2 

The  rest  of  products  from  (4)  are  evaluated  at  the  same  way. 
An  important  moment  in  deriving  the  final  relation  for  the 
variance  is  presenting  o'.  in  the  following  form  : 

^  (J  XAi^Bi’Z  '^CrZ^yiAi+  ^  (5) 

(1  -^as'Z  ■^bs'Z^)i\  ^Qs'Z^^^bs'Z^^) 

where^  parameters  describe  the  /-th  circuit 

structure  and  is  the  "white"  noise  variance.  The 

correctness  of  the  upper  relation  was  proved  out  from  the 
investigations  of  different  circuit  structures.  Depending  on 
the  structure  complexity  and  components'  number  (adders, 
multipliers,  delay  elements),  some  of  the  parameters 
AjyBjyCi  ^  equal  to  zero.  After  replacing  <J) 

/  =  1,2,  ...,M  (eq.5)  into  the  common  expression  for  the 
variance  (4)  it  is  obtained  : 


i  t  M,w.-+  I  MmW-  • 

Ll,|=i  ,=1  ^  |,|=l  Z  J 

M 

where;  M/(z)=  U  HI  Y  i(z~')  > 

S  =  i+\ 

M  =  Y  m(^)=  +  ^Cm-z  , 

1  a  M -z  +  0  M  -z 

..  /■-,  Bi  Z  +C,-z^)  (l+z)^^'^  .  (7) 

M 

n(i  -^hsz) 

S=i 

In  order  to  be  solved  the  circulation  integrals  in  (6)  it  is 
necessary  to  prevent  y'.(r)  in  fractional-rational  form  : 

2M-2i  +  2 


,  /=  1,  2, ....  ,M-1.  (8) 


Y^i^)=2W^2 -  ’  ’ 

Z  (Fs-z^) 

S  =  0 

The  binomial  theorem  was  used  to  present  the  following 
product : 

(Ai  +  Bi-z  +Ci  Z^)  i\+zy^^  ’^  =  Ai  + 

+z  ■iAi-CiM-ji'^ Bi)"^ z  ■iAi  C^M-2i'^ Bi-C^iM-iACi)'^ 
+Z^(AiC2M-2i  +  BiC2M-2i  +  C-C2A/-2,)+ . + 

2M-2J-\  I  .  ^A^-2/-l  ,  n  ^M-V-2  ^  ^M-2i-2x, 

+Z  ■\Aj(^2M-2i  '^■Di-^2M-2i  ■^C,'C2A/-2/ 


+Z^^-^''-(A,-  +  BrC2u:2f~'  +  C-CaMlI"')  + 
+z"^'''"‘-(5,-+c,-c2m:2:"Vz^^"^'’"^C/  •  (9) 

where  are  binomial  coefficients.  Comparing  (8)  and 

(9)  the  following  common  expression  for  the  coefficients 
(eq.8)  was  found  : 

Ls = + a-cf«-2- +G  di2, . 

where  5=  0,!,....  ,2M-27+2.  The  denominator  of  y  ,  (z)  do 
not  depend  on  the  type  of  the  filter.  To  determine  p  ^ 

(eq.8)  two  types  computational  formulas  were  proposed  [6], 
according  to  the  S  value  (even  or  odd).  Calculations  of  the 
product  at  the  denominator  of  (7)  have  been  done  for  /  =  1 
and  M  ==  1,  2,..., 10.  The  observed  tendency  have  been 
summarized  for  arbitrarily  M  with  the  aid  of  mathematical 
induction.  In  case  of  odd-order  lowpass  (highpass)  filter 
three  considerations  were  performed  depending  on  the  first- 
order  section  location  in  the  cascade  (at  the  input,  output  or 
inside  -  Table  3).  The  parameters  A^k  ^  B[(  A^k 

describe  the  noise  spectral  density  of  lowpass  (highpass) 
first-order  sections,  k  denotes  the  place  of  the  first-order 
section  in  the  cascade  filter.  The  ^  ^  expressions  for  all 

types  even-order  filters  are  generalized  in  Table  2.  Because 
of  the  more  complex  ff/.(z)  for  bandstop  section  the 

variance  expression  becomes  too  complicated.  So,  besides 
the  exact  variance  formula  a  computationally  more  efficient 
relation  is  proposed  (Table  2),  based  on  the  assumption 
that  the  coefficients  ^ .  (/==1,...,  M)  at  the  nominator  of  the 

transfer  functions  Jj/  .(z)  are  equal.  The  investigations 

showed  that  these  coefficients  are  nearly  equal  for  some 
cases,  so  it  is  aptly  to  apply  the  simplified  variance  formula. 

3.  DESIGN  METHODOLOGY 

A  design  methodology  for  optimal  recursive  filters  with 
minimum  resulting  error  is  proposed  on  the  basis  of 
previously  derived  expressions.  This  methodology  is  a  part 
of  the  Program  Package  for  Digital  Filter  Design  [7]  and  its 
input  data  are  as  follows  :  the  order  of  the  filter  N  (number 
of  sections  M),  the  filter's  type  and  jy  -(z)  coefficients. 

Step  1.  Obtaining  all  circuit  configurations  for  all  possible 
pole-zero  pairings.  This  step  is  obligatory  for  bandstop 
filters  and  for  elliptically  approximated  filters.  The  exact 
number  of  pairings  is  (  A/  !)^'  displacement  of  the 

sections  is  taken  into  account  at  the  next  steps,  so  the 
number  of  pairings  is  M! 

Step  2.  Composing  Ml  structures  for  filter  realization 
differ  each  other  by  the  sequence  of  the  sections  into  the 
cascade.  Filters,  built  up  from  bandstop  sections  needs  this 
step  to  be  done  for  each  structure,  derived  from  step  1 . 

Step  3.  For  each  circuit  configuration  as  an  output  of  Step  2 
the  following  sub-steps  are  required : 
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Filter 

Coefficients  ^  ^  ( i=l . M-1  ;  S=0, .  .  2M-2j+2) 

LP 

Ls  ~  Ai-ClM-2i  Bi-ClM-li'^  Ci-ClM-li 

HP 

Ls  ~  ^•Ai-C2M-2i~^  BrC2M-2i'^^-CrC2M-2i  ’ 

m  =  1  (s-even  number);  m  =  -1  (s-odd  number) 

BP 

S-even 

S-odd 

Ls  =  ^-ArCMli-^-CrCMll'  - 

m  =  1  (s/2-even  number);  m  =  -1  (s/2-odd  number) 

m  =  1  for  (s-l)/2  -  even  number  ;  m  =  -  1  for  (s-l)/2  -  odd  number 

BS 

Exact 

formula 

Ls  -  F  s  [6]  for s  =  i :  ai=Bil A’  bi- Ci!  Ai ;  for  s  #  i ;  Qs-  Cs’bs~^ 

Approx. 

formula 

Ls~  Ai-Ls'^  Bi-Ls-\'^Ci-Ls-2  '  where: 

S /2 

r '  _  V  ^2t  ^sn-t  s  -  even  number 

L>s~  ZjCj  M-i  '^sn+i 

1=0 

t'  _  2/+1  ^(5-i)/2-r,  S  -  odd  number 

Ls-  Zj  Cj  M  i  •C(5  +  l)/2+f 

1=0 

L's  =  0  ,  2M-2/  <  S  <  0 

Table  2 


Filter 

Coefficients 

LP 

k=\ 

Ls  -  Ajc-^2M-2k'^  Bk-C2M-2k  ’  S  =  0,..,2M-2i-t-l 

Ls  =  Ai.ClM-2i  +  Bi  .ClM-2i  +  Ci  .CtJ-2i  ’ 

for  i  =  2..M-l,  S  =  0,..,2M-2/+2 

1<A:<M 

Ls  =  Ai.c!u-2t  +  •  f»'  '-K.  S-0,  .2M.2,-H 

Ls~  Ai  ■C2M-21-I'*'  Bi  •C2M-2i-l'^  Ci  -ClM-ti-l  ’ 

for  1  < /<  k,  S  =  0,..,2M-2/+l 

Ls  -  Ai  •C2M-2i  Bi  .C2iJ-2»  Ci  •C2M-2i  ’ 

for  k<  /■  <  M-1  ,  S  =  0,..,2M-2/+2 

A=M 

Ls  -  Ai  -C 2M -2i-\  Bi  •C2M-2i-\  Ci  ‘C2M-2i-\' 

for  1=1,.. .M-1,  S  =  0,..,2M-2/+l 

HP 

/t=l 

Ls  =  m.Ak  •CiM-2k- m.Bk  ■C2M-2k  -  '  =  >^>  s  =  o,...2m-2i+i 

Ls  =  m .Ai  •C2M-2i  ~  m.Bi  ^C2M-2i  m.Ci  ■C2M-2i  ’ 

for  i  =  2,.„M-l ,  S  =  0,..,2M-2i+2 

Ls  ‘  m.Af  ■Clu-21,-  m.Bt  -CIm-ii:  •  f“  '"k-  s-o...,2m.2(+i 

Ls  -  m.Ai  •C2M-2i-\  ~  m.Bi  •C2M-2i-\  "h  m.Ci  ■C2M-2i-\  ’ 

for  1  <  1<  k,  S  =  0,..,2M-2i+l 

Ls  -  m.Ai -C2M-2i  ~  m.Bi  'CIm-ii  m.Ci  •C2M-2i  > 

for  k<  1  <  M-1  ,  S  =  0,..,2M-2i+2 

*=M 

Ls  -  m.Ai  C 2M-2i-\  -  m.Bi  •C2A/-2/-1  m.Ci  •C2M-2i-\  ’ 

for  1  =  1,...M-1  ,  S  =  0,..,2M-2i+l 

m  =  1  when  s  -  even,  m  =  -  1  when  s  -  odd  -  for  all  HP  cases 


Tables 


3.1.  Choice  of  an  optimal  low-noise  structure  for  each 
section  (from  a  knowledge-base  [8]  of  elementary  circuits). 

3.2.  Computation  of  the  coeflBcients,  describing  the  noise 
spectral  density  of  each  section  :  ^,  ,5,  ,  (7,-  (second  order) 

Ak’Bk^Ak  ^Bk  (first order). 

3.3.  Derivation  of  the  coefficients  £  ^  and  p  ^  from  the 
Table  2  and  Table  3. 


3.4.  Calculation  of  the  circulation  integrals : 

h  =:^.  #  Yiiz  )Yiiz~')-  .  /•  =  i,2...,m 

lz|=l  ^ 

3.5.  Variance  calculation. 


Step  4.  The  variances  for  different  circuit  configurations 
which  was  derived  in  the  upper  steps  are  compared  to  find 
this  with  the  minimum  value. 


4.  EXAMPLE 

A  great  number  of  numerical  examples  based  on  the 
previously  discussed  methodology  have  been  examined  (with 
the  Program  Package  PROSIS  [7]).  The  next  one  will  be 
given  below  :  HR  bandpass  filter  with  specifications  - 
sampling  rate2;Trad/s,  stopband  edges  2  rad/s  and  3  rad/s, 
passband  edges  2,5  rad/s  and  2,8  rad/s,  maximum  passband 
ripple  2.5  dB,  minimum  stopband  attenuation  30  dB. 
Applying  Chebyshev  approximation  we  obtain  order  N=6 
and  with  the  Elliptic  one  -  order  N=4.  For  filter,  based  on 
the  Chebyshev  approximation  the  optimal  cascade  consists 
of  the  P5LL,  PIOLL  and  PIOLL  elementary  sections  (fig.l) 
with  a  minimum  value  ~  1 .507.  (j  (according  to  step 
2  from  the  design  methodology  six  different  connections  are 
available  with  e  [1-507,  7.054],^^  ).  For  elliptically 
approximated  filter  the  variance  depends  on  the  pole-zero 
pairing  and  have  a  minimum  value  2.093.  ^  with 

an  optimal  structure  consisting  of  two  sections  which  circuit 
configuration  is  given  in  [9].  The  parameters  a,  b  and  H 
from  fig.1  correspond  to  the  q.  [y.  and  from  Table  1. 


This  paper  describes  an  alternative  way  for  determination 
the  variance  of  recursive  cascade  fixed-point  digital  filter, 
easy  applicable  in  CAD  design.  The  software  tool  available 
allows  an  optimal  cascade  structures  to  be  realized  as  well  as 
a  new  elementary  low-noise  circuits  to  be  added  to  the 
knowledge-base  in  a  view  of  engineering  application. 
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ABSTRACT 

This  paper  describes  a  unique  multichip  module  technology 
based  on  highly  impermeable  liqmd  crystal  polymers  (LCPs) 
to  interconnect  and  package  Microwave  and  Millimeter  Wave 
Monolithic  Integrated  Circuits  (MIMICs).  Because  of  the  low 
moisture  permeability  of  the  LCPs,  the  packages  can  be  made 
hermetic  without  heavy  expensive  housings,  and  can  be  two  to 
four  times  lighter  and  one-fifth  the  cost  of  conventional  ce¬ 
ramic  based  transmit/receive  (T/R)  modules.  Preliminary  re¬ 
sults  indicate  that  the  LCP  has  a  low  dielectric  constant,  and 
low  loss  transmission  lines  can  be  manufactured  on  LCPs  us¬ 
ing  large  area  processing  techniques  that  provide  a  tremendous 
cost  advantage  over  competing  technologies.  Using  flip  chip 
bonded  MMICs  attached  to  a  high  thermal  conductivity,  low 
coefficient  of  thermal  expansion  substrate,  this  innovative  tech¬ 
nology  can  meet  a  variety  of  commercial,  military  and  space 
requirements. 

1.  INTRODUCTION 

On  account  of  their  ability  to  form  multiple  beams  and  provide 
variable  powCT  among  beams,  phased  array  antennas  using 
GaAs  MIMICs  are  gaining  significance  in  aircraft  and  space- 
based  radar  as  well  as  communication  applications.  These  sys¬ 
tems  require  highly  complex  transmit/receive  (T/R)  modules, 
each  containing  a  large  number  of  GaAs  MMICs  required  to 
meet  the  wide  range  of  system  performance,  and  also  have  the 
following  properties:  low  cost,  small  size,  high  density  hous¬ 
ing  and  thin  profile. 

Although  multichip  packaging  approaches  have  been  in  use 
for  many  years  for  digital  ICs,  packaging  microwave  and 


millimeter  wave  MMICs  is,  in  general,  more  demanding  than 
packages  typically  encountered  for  digital  and  low  frequency 
analog  applications.  These  packages  are  characterized  by  a 
low  lead  count,  high  lead-to-lead  isolation,  tightly  controlled 
transmission  line  impedance  levels,  and  very  low  interconnect 
and  transition  losses. 

At  present,  typical  T/R  modules  contain  several  MIMICs  which 
must  be  interconnected  using  single  or  multilayer  RF  substrates. 
The  interconnect  should  have  low  insertion  loss,  wide  band¬ 
width,  low  dispersion,  small  size  and  ease  of  reproducibility. 
In  addition,  the  substrate  should  be  able  to  incorporate  both 
RF  and  control  signal  lines  as  well  as  DC  bias  lines  without 
mutual  interference  and  buried  compact  structures  such  as  cou¬ 
plers,  filters,  power  dividers,  etc. 

Traditionally  these  substrates  have  metallization  of  both  sides 
and  have  either  a  thick  film  or  a  thin  film  circuitry  on  the  top 
surface,  whereas  the  bottom  surface  is  either  soldered  or  at¬ 
tached  using  epoxy  to  a  metal  base  (4-5).  Alumina  (er~9.9)  is 
typically  used  as  a  substrate  in  the  microwave  frequencies 
whereas  quartz  (Er~3.78)  is  used  in  millimeter  wave  range.  With 
increased  system  requirements  it  becomes  necessary  to  include 
many  GaAs  MMICs  as  well  as  digital  ICs  together  with  thin 
film  bypass  capacitors  and  off-chip  matching  networks  for 
power  amplifiers  on  the  same  package.  One  method  of  solv¬ 
ing  this  problem  is  by  combining  thick  film  and  thin  film  met¬ 
allization  on  the  same  substrate.  The  RF  circuitry  is  on  the  top 
thin  film,  while  a  multilayer  thick  film  interconnect  circuit  is 
at  the  bottom.  Laser  drilled  via  holes  connect  the  thick  film 
and  thin  film  metal  layers.  Although  such  a  technique  has  some 
advantages,  these  substrates  do  have  many  drawbacks  of  which 
the  following  are  of  prime  importance: 
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•  Increased  overall  cost  due  to  non-compatibility  of  thick  and 
thin  film  processes. 

•  Interconnection  vias  limit  the  chip  density  and  also  degrade 
the  electrical  performance. 

•  Thin  film  passive  RF  components  and  network  occupy  a 
large  area  resulting  in  further  decrease  in  chip  density, 
thereby  increasing  the  overall  package  cost. 

•  Need  for  a  hermetic  package  to  protect  the  MMICs. 

Our  unique  approach  solves  these  problems  by  combining  the 
RF,  control  signal  lines  and  the  DC  bias  lines  on  a  single  mul¬ 
tilayer  LCP  substrate. 

2.  THE  TECHNICAL  APPROACH 

Our  approach  is  based  on  exploiting  the  barrier  and  dielectric 
properties  of  liquid  crystal  polymers  to  interconnect  and  pack¬ 
age  MMICs.  To  meet  the  permeability  requirements  the  key  is 
the  utilization  of  the  extremely  low  permeability  of  liquid  crys¬ 
talline  polymers  compared  to  other  polymers.  The  permeabil¬ 
ity  of  Vectra^^  and  other  LCP  films  has  been  measured  be¬ 
cause  of  the  interest  in  their  use  as  barrier  films  by  the  food 
packaging  industry.  Although  unoriented  films  exhibit  good 
barrier  properties,  biaxially-oriented  films  show  orders  of  mag¬ 
nitude  lower  values.  Foster-Miller  has  developed  a  unique  ex¬ 
trusion  process  based  on  a  patented  counter-rotating  die  tech¬ 
nology.  This  process  differs  from  conventional  extrusion  pro¬ 
cesses  in  that  it  biaxially  orients  the  polymer  chains  as  they  are 
being  extruded,  imparting  balanced  mechanical  properties. 
Thus,  due  to  the  low  moisture  permeability  of  the  LCPs,  the 
packages  can  be  made  hermetic  without  heavy  expensive  hous¬ 
ings,  and  can  be  two  to  four  times  lighter  and  one-fifth  the  cost 
of  conventional  ceramic  based  T/R  modules. 

2.1.  RF  Characterization 

In  order  to  fully  exploit  the  dielectric  properties  of  the  LCP  it 
is  necessary  to  determine  the  dielectric  constant  and  loss  tan¬ 
gent  of  the  LCP  material  as  well  as  the  characteristics  of  vari¬ 
ous  planar  transmission  lines  designed  using  this  material  in 
the  microwave  and  millimeter  wave  region.  Traditionally  used 
microstrip  lines  and  coplanar  waveguides  (CPWs),  shown  in 
Figure  1,  were  the  planar  transmission  lines  chosen  to  be  char¬ 
acterized.  Moreover,  the  highly  accurate  microstrip  ring  reso¬ 
nator  techniques  were  used  to  measure  the  dielectric  proper¬ 
ties  of  the  LCP  material. 

For  measurement  purposes  the  LCP  films  were  bonded  onto  a 
20  mils  thick  alumina  substrate  having  metallization  on  one 
side  and  laminated  with  conventional  high  ductility  copper  foil 
of  9  \m  thickness.  The  tooth  structure  of  the  copper  file  gives 
excellent  adhesion  over  a  wide  range  of  temperature,  although 
its  roughness  may  increase  losses  at  higher  frequencies.  Nu¬ 
merous  microstrip  and  CPW  structures  having  a  wide  range  of 


impedances  were  designed  and  placed  on  the  LCP  substrate 
(Table  1).  These  LCP  substrates  have  thicknesses  of  approxi¬ 
mately  4,  5, 6  and  20  mils,  respectively. 

These  microstrip  and  CPW  test  structures  were  characterized 
(S -parameters)  on-wafer,  in  the  0.5-40  GHz  frequency  range, 
using  high  frequency  Cascade  Microtech  Probes  and  an  HP 
85  IOC  network  analyzer.  The  dielectric  properties  of  the  LCP 
were  calculated  using  the  multiple  resonances  in  transmission 
( I  S21 1 )  measurement  in  the  ring  resonator  structures.  Since 
the  resonant  frequencies  are  measured  very  precisely  by  the 
network  analyzer,  the  calculated  dielectric  properties  of  the 
material  are  very  accurate.  Similarly,  the  complete  character¬ 
istics  of  the  transmission  lines,  namely,  characteristic  imped¬ 
ance  (Zq),  effective  dielectric  constant  (eeff )  and  loss  (a)  were 
calculated  from  the  measured  S-parameters. 

Figure  2  shows  the  measured  multiple  resonances  in  a  typical 
ring  resonator  structure  and  Table  2  shows  for  the  first  time  the 
derived  relative  dielectric  constant  (er)  of  the  LCP  substrate. 
From  this  it  is  clear  that  the  LCP  has  a  fairly  uniform  relative 
dielectric  constant  of  3.08  in  the  0.5-40  GHz  range.  Figure  3 
shows  the  measured  ‘Zq’,  ‘£eff’»  ‘cc’  of  a  5012  microstrip 
line  and  an  80f2  conductor-backed  CPW.  These  results  indi¬ 
cate  that  the  lines  have  extremely  low  dispersion  and  uniform 
characteristic  impedance  in  the  0.5-40  GHz  region.  Moreover, 
the  line  losses  of  '-'0.3  dB/cm  (@  35  GHz),  although  slightly 
higher  than  similar  lines  on  Duroid,  are  still  low  enough  for 
packaging  applications.  In  addition,  these  results  were  found 
to  be  in  good  agreement  with  theoretical  predictions.  We  are 
highly  encouraged  by  these  initial  results  which  represent  first 
ever  measurements  on  an  unoptimized  LCP  package  in  the  mi¬ 
crowave  and  millimeter  wave  frequency  band.  Finally,  since 
reducing  the  conductor  surface  roughness  will  result  in  lower 
losses,  we  believe  that  the  RF  performance  of  an  optimized 
LCP  package  in  a  system  will  greatly  exceed  that  of  the  state- 
of-the-art  packages.  Figure  4  shows  the  design  process  of  LCP 
package  technology. 

2.2.  Advantages  of  Our  Approach  Over  Current 
Approaches 

This  unique  and  new  approach  provides  many  advantages  over 
current  packaging  techniques.  Our  approach  uses  the  highly 
reliable  flip  chip  technology  which  is  ideally  suited  for  large 
volume  applications  compared  to  ribbon  bonding.  The  inter¬ 
connects  can  be  designed  either  as  microstrip  lines  or  CPW. 
The  need  for  a  hermetic  package  is  eliminated  due  to  the  highly 
impermeable  characteristics  of  the  LCP,  which  results  in  tre¬ 
mendous  cost  and  weight  savings.  Finally,  this  new  approach 
enjoys  many  advantages  of  the  chips-first  approach  (separa¬ 
tion  of  electrical  and  thermal  paths,  high  device  packaging  den¬ 
sity,  low  inductance,  short  chip  interconnections)  but  overcomes 
the  following  limitations: 
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•  Easier  defective  IC  replacement  -  no  need  to  rebuild  the 
entire  interconnect. 

•  ICs  not  subjected  to  processing  stresses  -  chips  last  versus 
chips  first. 

•  Weight  and  cost  penalty  due  to  the  hermetic  package. 

3.  CONCLUSIONS 

This  paper  describes  a  novel  technique  for  packaging  MMIC 
devices  which  takes  advantage  of  the  unique  dielectric  and 


barrier  properties  of  LCPs.  Preliminary  data  on  microstrip 
line  transmission  lines  on  the  LCP  indicate  that  the  material  is 
suitable  for  microwave  and  millimeter  wave  system  applica¬ 
tions.  Helium  leak  tests  clearly  indicate  that  near-hermetic 
packages  can  be  achieved  without  using  conventional  metal 
or  ceramic  packages.  Elimination  of  a  conventional  hermetic 
package  coupled  with  large  area  processing  techniques  results 
in  a  low  cost  approach  for  MMIC  based  T/R  module  array 
systems. 

Table  L  Dimensions  of  a  Few  Test  Structures 


dielectric  substrate 


ground 


(a)  Microstrip  Line 


(b)  Conductor-Backed 
Coplanar  Waveguide 


(c)  Coplanar  Waveguide 


Structure  Type 

Impedance  (Q.) 

W(pm) 

S(nm) 

Mstrip 

20 

965 

Mstrip 

35 

475 

Mstrip 

50 

275 

M  atrip 

75 

150 

CBCPW 

90 

50 

100 

CBCPW 

66 

100 

150 

CBCPW 

50 

125 

150 

CPW 

110 

50 

150 

CPW 

150 

50 

350 

Length  =  1250  jxm,  2500  |xm 

Mstrip  -  Microstrip,  CPW  -  coplanar  waveguide. 

CBCPW  -  Conductor  backed  CPW 


Table  2.  Results  from  Ring  Resonator  Measurements 


Resonator 

Number 

Resonant 

Frequency 

(GHz) 

Relative 

Dielectric 

Constant 

LCPRR-l-1 

33.02 

3.18 

LCPRR-l-2 

32.15 

3.07 

LCPRR-IM 

10.43 

3.08 

20.81 

3.08 

31.13 

3.08 

LCPRR-lll-1 

6.97 

3.08 

13.93 

3.08 

20.87 

3.08 

27.78 

3.08 

34.66 

3.08 

(d)  Ring  Resonator  Circuit 

Figure  L  (a),  (b)  &  (c)  Cross-section  of  Microstr ip  Line, 
Conductor-Backed  Coplanar  Waveguide  and  Coplanar 
Waveguide  Transmission  Lines;  (d)  Ring  Resonator 
Circuit  Layout 
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Figure  2.  Measured  Transmission  Characteristics 
of  Microstrip  Ring  Resonators 
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Figure  3.  Measured  Characteristics  of  Microstrip  and  Coplanar  Waveguides  on  LCP 


LCP  Rim 
\ 


1 .  Bond  LCP  film  to  a  Rigid  Tool 
Optional  Copper  Foil  for  EMI 
Protection 

2.  Build  RF,  DC  and  Low  Frequency 
Interconnects  Gold  or  Nickel 
Metallization  for  Bump  Interconnects 


Laminate  LCP  Frame 
Test  Substrates 

Bump  MMIC's,  Flip  Chip  Bond  to 
Substrate  and  Test 


5.  Heat  and  Remove  T ool 

6.  Flip  and  Attach  to  a  High  Conductivity 
CTE  Matched  Substrate/Ground 

@  Attach  ICs  With  Thermally  and 
Electrically  Conductive  Material 

(b)  Thermosonically  Weld  LCP  to 
Substrate 


LCP  Frame 
MMIC 


Multilayer  LCP 
RF  and  Digital  I/OS 
High  Conductivity  CTE-Matched 
Substrate  (Ground  Plane) 


Assembled  MMIC  Multichip  Module 
Using  Embedded  Microstrip  Lines 


283-OH-4341-5 


Figure  4.  Process  Flow  of  LCP  Packaging  Technology 
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ABSTRACT 

This  paper  introduces  a  novel  approach  for  the 
characterization  of  high  power  devices  of  total  gate 
periphery  (G.P.)  around  20nim.  Electrical  models  of 
two  power  FET  chip  devices  (G.P.=4.8mm  and 
12.6mm)  are  obtained  from  scattering  parameters 
measurements  based  on  broadband  impedance 
transformers  and  from  DC  pulsed  measurements.  A 
small  scaling  factor  applied  on  these  electrical  models 
allows  to  get  those  of  high  power  devices. 

1.  INTRODUCTION 

High  power  chip  devices  with  large  G.P.  around 
20mm  have  been  developed  in  order  to  delivre  a  high 
RF  output  power. 

The  characterization  of  these  devices  is  required  for 
the  design  of  solid  state  power  amplifiers. 

These  power  devices  are  made  by  several  elementary 
cells  in  parallel  configuration,  consequently  the 
impedance  values  of  device  ports  are  quite  small 
(«1Q). 

The  usual  technique  consists  of  measuring 
S-parameters  of  a  unit  cell  (G.P.«lmm)  on  50-ohms 
impedance  environment,  then  a  large  scaling  up 
(10-20  times)  allows  to  achieve  the  equivalent 
electrical  circuit. 

The  drawbacks  of  this  technique  are  the  following: 

-  Large  scaling  factor  introduces  a  large  error  in  the 
equivalent  electrical  circuits. 

-  Cells  in  power  chip  devices  don’t  have  the  same 
environment  as  a  single  cell  chip  alone,  such  as 
electrical  coupling  effects,  thermal  coupling  effects, 
dispersion  in  the  processes  manufacturing,  ...etc. 


In  this  paper  we  suggest  a  new  technique  of  modelling 
the  high  power  chip  devices  without  the  above 
mentioned  drawbacks. 

This  technique  is  based  on  characterization  of  a  big 
G.P.  device  and  then  using  a  small  scaling  factor  for 
the  very  big  devices  (factor  «2). 

The  use  of  test  jig  which  transforms  50-ohms  in  few 
ohms  at  the  ports  of  device  is  required  for  S-parameters 
measurements.  This  broadband  transformation  is 
realized  by  using  exponential  taper  on  microstrip 
technology. 

A  specific  test  bench  is  used  for  DC  pulsed 
measurements. 

We  present  the  characterization  of  two  power  chip 
devices,  a  MESFET  transistor  with  a  G.P.  of  12.6mm 
and  a  HFET  transistor  with  a  G.P.  of  4.8mm. 

2.  TEST  JIG  CIRCUIT  DESIGN 

In  standard  techniques,  S-parameters 
measurements  of  the  device  under  test  are  made  on 
50Q  impedance  environment. 

Power  chip  devices  have  low  input  and  output 
impedances  (few  ohms)  and  specific  test  jig  is  required 
for  the  S-parameters  measures.  This  test  jig  must 
remove,  in  wide  band  of  frequency,  high  VSWR 
conditions.  The  impedance  transformer  can  be 
obtained,  for  example,  in  using  quarter-wave  length 
cascaded  transmission-line  sections  [1]  or  various 
forms  of  non-uniform  transmission-lines  [2,3]. 
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Figure  la  shows  the  different  forms  of  tapers 
(Tchebyscheff,  hyperbolic,  exponential)  and  figure 
lb  shows  values  of  corresponding  reflection 
coefficients  [4]. 

The  test  jig  used  for  the  characterization  of  the 
power  discrete  chip  devices,  is  made  of  two 
exponential  tapers  fabricated  on  microstrip  alumina 
substrate  of  0.254mm  thickness  (fig.2a),  which 
drops  significantly  the  high  VSWR  with  a  minimum 
of  length  [5].  In  order  to  modellize  the  exponential 
form,  each  taper  is  discretized  as  ten  same  unit 
length  tapered  transmission-line  sections. 

The  electrical  model  of  the  exponential  taper  has 
been  verified  by  experimental  measurements  on  two 
cascaded  tapers  for  several  configuration  of 
connection  with  various  bond  wires  or  ribbons. 
Figure  2b  shows  the  measured  and  modelled 
insertion  loss  of  two  cascaded  exponential  tapers. 

3.  POWER  CfflP  DEVICES 
CHARACTERIZATION 

RF  tests  and  DC  pulsed  tests  have  been  made  on 
commercial  power  FET  chip  devices  (a  MESFET 
with  G.P.  of  12.6mm  and  a  HFET  with  G.P.  of 
4.8mm),  for  several  bias  points. 

A  linear  and  non-linear  electrical  model  of  power 
FET  transistors  developed  by  Alcatel  Espace  are 
shown  in  figure  3. 

The  de-embedding  method  is  applied  to 
S-parameters  measured  on  the  previous  test  jig  with 
input  and  output  tapers,  in  order  to  extract 
S-parameters  of  device. 

The  linear  model  element  values  are  optimized  to  fit 
calculated  and  extracted  S-parameters.  Figure  4a 
and  5a  represent  the  modelled  and  measured 
small-signal  Sn  and  S22  parameters  of  the  MESFET 
and  HFET  devices. 

The  non-linear  model  parameters  are  determined  by 
specific  DC  pulsed  measurements  [6]  on  power 
devices. 

Four  non-linearities  have  been  taken  into  account. 
The  breakdown  current  source,  Idg,  and  the  direct 
current  source,  Ig,  are  modellized  empirically  by 
exponential  equations.  The  capacitor  Cgs  is 
described  by  a  hyperbolic  tangent  form  versus  the 
gate  voltage,  Vg.  The  drain  current  source.  Id, 
defined  by  bicubic  spline  function  [7],  allows  to 
represent  I-V  characteristics. 


Figure  4b  and  5b  shows  the  simulated  and  measured 
I-V  relationship  of  these  devices. 

CONCLUSION 

A  specific  test  jig  has  been  developed  for 
broadband  impedance  transformation  in  order  to 
measure  scattering  parameters  of  power  chip  devices 
with  big  G.P.,  in  addition  to  the  DC  pulsed 
measurements.  Power  chip  devices  of  type  MESFET 
and  HFET  have  been  successfully  characterized. 
High  power  chip  devices  (G.P.‘=<20mm)  are  obtained 
by  a  small  scaling  factor.  This  novel  approach 
allows  to  reduce  errors  which  happen  in  using  a 
more  large  scaling  up. 
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Abstract 

In  this  paper  circuit  design  techniques  for  high 
temperature  electronics  are  presented.  It  will  be 
shown  that  standard  Bulk-CMOS  technology  may 
be  suitable  for  circuits  operating  well  at 
temperatures  up  to  250  °C,  if  special  design 
techniques  are  considered. 

While  using  standard  circuits  at  elevated 
temperatures,  main  difficulties  arise  from  the 
drain  leakage  current  of  MOSFETs  which  is 
amplified  by  a  parasitic  bipolar  transistor  (known 
from  latch-up)  and  therefore  becomes  much 
higher  than  the  original  diffusion  leakage  current 
caused  by  the  pn-junction  of  the  drain-bulk  diode. 
A  low  leakage  current  for  MOSFETs  can  be 
achieved  by  a  special  circuit  design  technique. 
Further  problems  result  from  the  drift  of  the 
threshold  voltage  depending  on  temperature  and 
time. 

As  an  example  a  Commutating  Automatic  Zero 
Operational  Amplifier  (CAZ-OPA)  is  presented 
in  order  to  demonstrate  the  capabilities  of 
standard  Bulk-CMOS.  This  CAZ-OPA  works 
with  a  digital  circuit  for  offset  compensation. 

1.  Introduction 

In  many  fields  of  sensor-applications  it  is 
necessary  to  have  an  OPA  for  signal-conditioning 
close  to  the  sensor  to  achieve  a  good  signal  to 
noise  ratio.  There  are  a  lot  of  dc-coupled 
applications  demanded  for  high  temperature 
operation  (up  to  250  °C)  which  need  OP  As  with 
dc-characteristics  like  high  input  impedance,  low 
input  current  and  low  input  referred  offset 


voltage.  We  still  have  not  found  any 
commercially  available  amplifier  fulfilling  all 
these  requirements  at  the  same  time  while 
operating  at  high  temperature. 

We  like  to  present  a  high  temperature  circuit- 
design  technique  for  integrated  MOS-circuits 
achieving  good  high  temperature  performance 
combined  with  a  well  known  technology  and  low 
price.  A  high  temperature  OPA  with  a  low  input 
referred  offset  voltage  will  be  taken  as  an 
example. 

It  should  be  stated  that  we  are  not  working  in  the 
area  of  special  materials  and  technologies  like 
gallium-arsenide,  silicon-carbide  or  diamond.  Our 
objective  is  to  find  optimized  circuit  designs  so 
that  the  circuit  function  becomes  independent  of 
device  degradation  due  to  high  temperature. 

2.  High  temperature  performance  of 
commercially  available  CMOS  devices 

The  investigation  of  commercially  available 
CMOS  devices  shows  three  main  phenomena 
causing  malfunction  of  systems  at  elevated 
temperatures: 

•  high  input  leakage  currents, 

•  high  input  offset  voltage  and  offset  voltage 
drift, 

•  latch-up  at  high  temperatures  (in  most  cases  at 
about  1 80  °C). 

An  analysis  of  these  phenomena  shows  two  main 
problems: 

1st  Problem:  The  leakage  current  of  reversely 
biased  diodes  increases  exponentially  with  rising 
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temperature  and  causes  input  currents  (protection 
circuitry),  operating  point  drifts  and  latch-up 
problems.  The  main  problem  is  the  parasitic 
bipolar  transistor  that  can  be  found  in  MOS- 
transistors  (figure  1).  This  bipolar  transistor  is 
well  known  from  CMOS  latch-up  and  causes  an 
internal  amplification  of  the  drain  leakage  current 
at  high  temperatures  (e.  g.  with  a  current  gain  of 
Bk20@L  =  2  pm). 

Gate 


Figure  1:  MOS-transistor  with  parasitic  bipolar 
transistor. 

Solution:  By  using  a  bulk-source  reverse  voltage 
it  can  be  achieved  that  the  leakage  current  itself 
will  no  longer  be  the  base  current  of  the  parasitic 
bipolar  transistor,  so  that  no  amplification  will 
occur.  Figure  2  shows  the  drain  current  of  a  n- 
channel  MOSFET  versus  gate  source  voltage  at 
300  °C  for  different  bulk-source  reverse  voltages. 


Figure  2:  /pg  versus  Fj,g  of  MOS  transistor  for 
different  Bulk-Source  voltages  Fgg  at  250  °C. 

Obviously  a  reverse  voltage  of  about  -400  mV  is 
sufficient  to  prevent  the  amplification  and  to 


minimize  the  drain  leakage  current.  The  intro¬ 
duction  of  a  reverse  voltage  means  that  all 
existing  cell  libraries  for  integrated  MOS  circuits 
have  to  be  modified  in  order  to  achieve  a 
separation  between  ^SS  and  substrate  potential. 
First  test  circuits  built  up  with  such  cells  have 
shown  no  latch-up  with  substrate  biasing  at 
temperatures  up  to  250  °C  and  even  more  (latch- 
up  occured  at  »180  °C  without  substrate  biasing). 

2nd  Problem:  A  well  known  problem  using  MOS- 
transistors  at  high  temperatures  is  the  threshold 
voltage  instability  due  to  mobile  ion 
contamination  of  the  gate-oxide.  For  OPAs  this 
instability  causes  a  time  and  temperature 
dependent  offset  voltage  in  the  range  of  ±20mV 
or  even  more.  Figure  3  shows  the  result  of  an 
offset  voltage  measurement  done  at  300  °C.  The 
device  was  a  simple  integrated  differential 
amplifier.  It  can  be  seen  that  an  offset  voltage 
drift  in  the  range  of  20  mV  took  place  in  less  than 
100  hours.  The  final  offset  voltage  is  not  stable, 
but  depends  on  the  current  operating  conditions. 


Figure  3:  Offset  voltage  Vos  versus  time  at  300  °C 
(differential  amplifier). 


Solution:  In  order  to  get  an  OP  A  with  low  offset 
voltage  over  the  full  temperature  range  and 
independent  of  time,  self-calibrating  circuit 
designs  have  to  be  applied.  A  known  principle  for 
normal  temperature  applications  is  the 
Commutating  Automatic  Zero  (CAZ)  amplifier 
with  a  capacitor  as  analog  offset  correction 
storage  (figure  4). 
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Figure  4:  CAZ-Amplifier  with  analog  offset-correction 
(one  of  two  channels). 

This  CAZ-OPA  does  not  work  at  elevated 
temperatures  because  of  the  rather  low 
capacitance  values  available  for  high  temperature 
application  combined  with  the  leakage  currents  of 
analog  switches.  The  capacitor  is  discharged 
during  amplifying  mode  (O,  active),  offset 
correction  information  is  falsified  and  a  high 
offset  voltage  ripple  remains.  Therefore  we 
propose  a  digital  storage  which  is  rather 
independent  of  leakage  currents. 

3.  A  Commutating  Automatic  Zero  Amplifier 
with  digital  offset  correction 

Figure  5  shows  one  of  the  two  identical  channels 
of  a  CAZ-OPA  using  a  lossless  digital  storage  for 
offset  correction.  With  the  input  pins  shorted  the 
offset  information  at  the  output  of  the  amplifier 
has  to  be  converted  to  digital  code.  This  is  done 
by  a  simple  analog-to-digital  converter  with 
A-modulator.  It  consists  of  the  internal  OPA 
itself,  a  comparator,  a  binary  counter,  a  digital-to- 
analog  converter  (DAC)  and  control  logic. 

A  main  advantage  of  this  circuit  is  that  there  is  no 
need  for  any  compensation  capacitors  with  high 
capcitance  values  so  it  becomes  suitable  for 
complete  monolithic  integration. 

Using  the  substrate  biasing  technique  mentioned 
above  we  have  had  no  problem  to  realize  the 
internal  OPA,  the  comparator,  the  switches  and 
control  logic  for  operation  up  to  at  least  250  °C. 
The  influence  of  the  comparator's  offset  voltage  is 


reduced  by  the  open  loop  gain  Vq  of  the  internal 
OPA. 


Figure  5:  One  channel  of  the  CAZ  amplifier  with  digital 
offset  compensation. 

The  remaining  problem  for  complete  monolithic 
integration  in  a  standard  CMOS-technology  was 
to  design  a  suitable  digital-to-analog  converter. 

Another  advantage  of  the  proposed  CAZ-circuit  is 
that  there  are  no  critical  requirements  for  the 
DAC.  Because  of  the  closed  loop  operation  we 
have  no  essential  problems  with  offset  voltage, 
gain  error  or  integral  linearity-error.  The  strongest 
restriction  is  the  differential  linearity-error  that 
limits  the  useable  resolution  of  the  DAC  and 
therefore  the  remaining  offset  voltage  of  the 
CAZ-OPA. 

The  digital-to-analog  converter  has  been  built  up 
with  a  R-2R  resistor  network.  For  reducing  the 
influence  of  the  switches’  leakage  currents  we 
have  chosen  the  voltage  switch  mode.  The 
matching  errors  due  to  the  use  of  standard 
polysilicon  resistors  let  us  expect  a  maximum 
useful  resolution  of  about  8  bits.  For  higher 
resolutions  laser  trimmable  thin  film  resistors  are 
recommended. 

Because  of  a  design-error  in  the  resistor  network 
the  first  converter  realized  has  shown  an  initial 
differential  linearity-error  of  more  than  1  LSB  at 
room-temperature.  Nevertheless,  the  performance 
of  the  converter  is  rather  independent  of  tem¬ 
perature.  Actually  a  resolution  of  7  bit  is  available 
for  the  offset  compensation  purpose  up  to  300  °C 
(figure  6). 
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Therefore  with  the  proposed  digital  offset 
correction  a  supposed  maximum  amplifier  offset 
voltage  of  ±20  mV  can  be  reduced  to  less  than 
240  pV  over  the  full  temperature  range  (worst 
case,  but  with  low  pass  filter  to  reduce  the 
influence  of  the  alternating  DAC  output). 


Figure  6:  DAC's  maximum  differential-linearity  error 
versus  temperature  for  8  bit  operation. 

A  second  attempt  with  corrected  resistor  network 
is  still  undergoing,  but  simulation  promises  8  bit 
resolution  over  full  temperature  range  which  will 
lead  to  further  reduction  of  the  C  AZ-OPA's  offset 
voltage. 

The  investigation  of  a  14  bit  DAC  with  thin  film 
resistor  network  and  substrate  biasing  has  shown 
a  differential  linearity-error  less  than  ‘A  LSB  up  to 
250  °C.  With  the  same  assumptions  this  would 
lead  to  a  remaining  offset  error  of  about  2  pV  for 
the  CAZ-OPA.  Actually  the  decrease  in  offset- 


voltage  by  increasing  the  DAC's  resolution  may 
be  limited  by  the  comparator's  offset  and  the 
OPA's  open  loop  gain  which  typically  increases 
respectively  decreases  with  rising  temperature. 

4.  Conclusion 

Operating  electronic  circuits  and  systems  in  a 
wide  temperature  range  makes  it  necessary  to  use 
adapted  circuit  design  techniques  to  make  them 
independent  of  the  devices’  degradation 
properties.  Especially  bulk  biasing  and  the 
consequent  use  of  self-calibrating  systems  make 
standard  CMOS-processes  suitable  for 
applications  up  to  250  °C  and  even  more.  In  the 
case  of  the  presented  CAZ-OPA  (Commutating 
Automatic  Zero  Operational  Amplifier)  the 
digital  offset  compensation  leads  to  another 
advantage  compared  to  common  CAZ-OPAs:  a 
complete  monolithic  integration  is  possible 
without  external  capacitors  for  offset 
compenstation  purposes. 
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Abstract 

A  MESFET  power  amplifier  has  been  designed 
using  a  large-signal  quasi-static  model  with 
bias  dependent  elements.  This  model  has  been 
derived  from  experimental  S  parameters  and  dc 
measurements. 

The  analysis  and  gain  optimization  of  the 
amplifier  is  performed  by  using  the  describing 
function  technique.  Optimum  bias  device 
conditions  in  C  class  are  obtained  for  maximum 
gain.  Experimental  results  show  an  excellent 
agreement  with  the  theoretical  analysis. 

L  Introduction. 

The  knowledge  of  the  small-signal  equivalent 
circuit  of  the  active  device  is  an  important  part 
of  any  microwave  design  task  that  must  have 
specific  nonlinear  requirements.  In  order  to 
achieve  good  performance  predictions,  the 
appropriate  device  model  parameters  should  be 
extracted  as  accurately  as  possible.  Therefore, 
the  bias  dependency  of  the  different  elements  of 
the  model  should  be  taken  into  accoimt. 

In  this  paper,  a  bias  dependent  nonlinear 
MESFET  model  has  been  derived  for  the  design 
and  analysis  of  a  MESFET  RF-power  amplifier. 
The  device  bias  conditions  are  optimized  in 
order  to  achieve  maximum  gain.  The  RF-  power 
transistor  is  characterized  using  the  describing 
function  technique  where  the  terminal  transistor 
currents  are  assumed  sinusoidal. 


Fig.  1. 

parameters.  Large-signal  information  is 
obtained  by  the  characterization  of  device  S 
parameters  over  a  broad  range  of  operating  bias 
conditions.  In  this  work  the  measurements  were 
performed  at  100  different  bias  points  within  a 
range  where  the  gate-soiuce  voltage  varied  from 
nearly  pinch-off  to  slightly  forward  biased.  The 
drain-source  settings  varied  from  low  to  high 
bias  level  values  where  the  device  was 
saturated.  The  S  parameters  for  each  particular 
bias  condition  were  used  to  determine  a  unique 
set  of  values  for  the  small-signal  circuit 
elements. 

In  order  to  simplify  the  optimization  process 
for  the  circuit  elements,  the  extrinsic  elements, 
(parasitic  resistances  and  inductances),  were 
first  determined. 

The  values  of  the  parasitic  resistances  were 
obtained  under  forward-bias  gate  conditions 
using  the  Fukui’s  method  [1]. 


IL  Determination  of  the  equivalent  circuit 
elements. 

Figure  1  shows  the  conventional  small-signal 
equivalent  circuit  of  a  MESFET.  The  intrinsic 
elements  are  functions  of  the  bias  conditions 
whereas  the  extrinsic  ones  are  considered  to  be 
independent  of  the  operating  point. 

The  element  values  for  this  equivalent  circuit 
are  found  from  dc  and  ac  small  signal 
measurements.  These  values  are  then  optimized 
in  order  to  fit  the  measured  device  scattering 
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Figure  2  shows  the  three  experimental  dc 
current-voltage  measurements  for  the  NEC 
71083.  Although  this  transistor  is  not  a  power 
device,  an  appropriate  test  fixture  was  readily 
available  and  therefore  it  was  used  in  this  work. 
The  values  of  R^,  Rg  and  R^  are  obtained  from 
the  deviation  of  the  slopes  of  the  ideal  diodes. 
The  parasitic  inductances  are  obtained  using  the 
cold-chip  technique  proposed  by  Diamant  and 
Laviron  [2].  The  inductance  values  are 
computed  from  the  imaginary  components  of  the 
extrinsic  device  Z  parameters  at  zero  drain  bias 
voltage.  Figure  3  shows  the  values  of  Lj,  Lg  and 
Ls  as  a  function  of  frequency. 


F(GH^ 


Fig.  3. 

The  experimental  intrinsic  y  parameters  are 
derived  from  the  measured  S  parameters  by  de¬ 
embedding  the  extrinsic  parameters.  The  bias 
dependent  circuit  elements  are  then  determined 
by  fitting  the  analytical  y  parameters  of  the 
equivalent  circuit,  shown  in  Figure  1,  to  the 
experimental  ones.  Figure  4  shows  the  imaginary 
parts  of  yi2  ,  y22  and  yn  versus  frequency  for  a 
particular  bias  condition  [3]. 
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Fig.  4. 

Device  capacitances,  Cjg  and  Cgs,  are  obtained 
from  the  slopes  of  the  graphs.  This  process  is 
repeated  for  different  bias  conditions  in  order  to 
know  the  variation  of  the  capacitances  with  the 
drain-source  and  gate-source  voltages,  as  it  is 
shown  in  figures  5  and  6. 

The  bias  dependence  of  the  capacitances  is 


obtained  fitting  the  experimental  results  to  the 
following  expressions  [4]: 


Fig.  5. 


Qg  Qgo 


c  =  c 


exp(CiF^)+  C2(Fg, 


Finally,  using  the  Curtice-Ettenberg  model  [5], 
the  drain  current,  the  transconductance  and  the 
output  conductance  are  calculated  as  functions 
of  the  bias  voltages. 

III.  Analysis  and  optimization  of  the 
amplifier. 

A  4  Ghz  MESFET  C-class  amplifier  was 
designed  using  the  model  previously  obtained. 
For  the  analysis,  the  describing  function 
characterization  of  the  RF-power  transistor  was 
used  assuming  sinusoidal  components  for  the 
terminal  currents. 

In  order  to  apply  the  describing  functions,  the 
expressions  for  the  mean-values  and  the 
different  harmonic  components  of  the  transistor 
terminal  voltages  must  be  obtained. 
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According  to  the  model  shown  in  Figure  7  and 
considering  the  drain  and  gate  currents  given  by: 

Id=Ici^+Id,'^oifp-(pa)  (3) 

4  =  (4) 

the  transistor  terminal  voltage  components,  Vj 
and  V,,  may  be  resolved  in  the  following  terms: 

\  =  ^G„  +  ^Do  +  +  ^RGSo 

^s,  =  ^Z,G,  +  ^RG,  +  ^G,  +  l^«GS,  +  ^LS,  +  ^RS,  1 

^1,  =  ^RS,  +^Z,S,  +^RGSi  +^G,  +^RD,  +^LD,  J 
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Fig.  8. 


■  Experimental  data 
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Fig.  9. 

system  of  nonlinear  equations  that  must  be 
solved  simultaneously  for  the  unknown 
variables:  terminal  currents  waveform 

parameters  and  the  bias  conditions  Vjs  and  Vgj. 
For  computational  purposes,  the  system  of 
nonlinear  equations  is  divided  into  five 
subsystems  as  [7]: 
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The  contributions  to  the  terminal  voltages  from 
the  linear  elements  are  obtained  from 
straightforward  circuit  analysis  and  Fourier 
series  calculations.  The  contributions,  Va  and 
Vg,  from  the  nonlinear  elements  are  calculated  as 
a  function  of  the  total  charge  Qa  and  Qg 
respectively,  obtained  from  the  experimental 
drain  and  gate  C-V  characteristics.  For  the 
device  used  in  this  work  a  parabolic 
approximation  such  as  [6]: 

^d,g  =  ^d,gQd,g  Pd,gQd,g  (5) 

gives  an  excellent  fit  for  both  charge-voltage 
relationships  as  shown  in  figures  8  and  9. 

Once  the  device  terminal  currents  and  voltages 
are  knovm,  the  describing  functions  are  then 
calculated  as  the  nonlinear  impedances  at  each 
frequency  component. 

The  complete  amplifier  is  represented  by  a 


^  J  ^bias  j  ^load  >  ^ drive  >  ^ opt  |  “  9  (6) 

where  Sbasic  includes  the  charge  balance 
equations  at  the  gate  terminal,  Sbias  is  related  to 
the  bias  circuit,  S^ad  and  Strive  include  the  input 
and  output  matching  circuits  and  Sopt  is  the  set  of 
equations  that  provides  the  optimum  bias 
conditions  for  maximum  gain.  A  good  efficiency 
can  be  obtained  if  the  amplifier  is  made  to 
operate  in  C-class.  However,  the  design  is 
generally  quite  complicated  when  using 
conventional  methods.  The  describing  function 
technique,  used  in  this  work,  proves  to  be 
perfectly  adequate  to  handle  the  mathematical 
complexity  involved. 

Figure  10  shows  the  amplifier  output  power  for 
different  bias  conditions.  A  maximum  output 
power  of  19  dB  is  obtained  at  4  GHz  for  the 
particular  values  of  Vds=  3.5V  and  Vgs  =  -1.55V 
provided  by  the  solution  of  the  system  of 
nonlinear  equations. 
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Fig.  10. 


IV.  Experimental  results. 

The  microstrip  layout  of  the  experimental 
amplifier  [8]  is  shown  in  Fig  11.  The  maximum 
output  power  value  of  18  dBm  is  obtained  at  4.3 
GHz  for  the  biasing  conditions,  Vgs  =  -1.55  V 
and  Vds  =  3.5  V.  These  values  are  in  excellent 
agreement  with  the  theoretical  results  mentioned 
above. 

mamm  mmmm 

^ni 

Fig.  11. 

Also,  the  optimum  values  of  Vds  and  Vgs  provide 
the  flattest  gain  as  a  function  of  the  input  power 
as  well  as  low  distortion  as  shovm  in  Figure  12. 
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Conclusions 

A  C-class  RF  amplifier  has  been  designed, 
analyzed  and  optimized  by  using  a  bias 
dependent  large  signal  MESFET  model.  The 
large  signal  model  has  been  obtained  from 
experimental  S  parameters  and  dc 
measurements.  The  use  of  the  describing 
function  technique  shows  to  be  a  very  powerful 


tool  for  the  nonlinear  analysis  of  the  active 
device  in  C-class.  This  technique  also  makes 
possible  the  appropriate  design  of  the  input  and 
output  matching  networks  and  the  determination 
of  the  optimum  bias  conditions  for  maximum 
gain  The  experimental  results  show  the  validity 
of  the  assumptions  made  a  priori  for  the  device 
terminal  voltages  and  currents  in  order  to  solve 
the  set  of  nonlinear  equations. 
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ABSTRACT 

For  conventional  junction  isolated  silicon  integrated 
circuits,  leakage  currents  increase  with  temperature.  These 
currents  across  reverse  bias  Junctions  can  easily  modify 
circuit  performance  and  even  cause  latch  up.  SOI  (Silicon 
On  Isolator)  devices  greatly  minimize  such  leakage  currents 
[1].  Realizing  high  temperature  linear  circuits  the  SIMOX 
(Separation  by  IMplantation  of  OXygen)  technology  is 
used,  because  this  technology  can  produce  fully  depleted 
thin-film  MOSFETs  with  high  electrical  performance  [2]. 

In  this  paper  a  simple  SIMOX-model,  which  is  verificated 
in  dependence  on  temperature  and  back-gate  potential  is 
presented.  By  the  help  of  this  model  linear  circuits  (i.e.: 
operational  amplifiers  (OPA),  voltage  references,  etc.)  are 
developed,  which  can  be  operated  up  to  300°C. 

Furthermore,  first  long-term  examinations  are  accomplished 
to  predict  the  reliability  of  SIMOX-MOSFETs  at  high 
temperatures. 

1.  INTRODUCTION 

One  of  the  research  activities  of  the  Institute  for  Electronics 
is  circuit  design  for  temperatures  up  to  +300°C.  In  this 
temperature  range  SIMOX-MOSFETs  may  be  used.  These 
devices  cannot  be  modelled  sufficiently  using  standard 
SPICE  MOSFET  models. 

We  developed  a  simple  macromodel,  based  on  the  SPICE 
MOS  Level  1  model,  which  is  capable  of  simulating  a 
SIMOX-MOSFET  in  a  wide  range  of  operating  conditions 
and  uses  physically  established  parameters.  The  quality  of 
this  macromodel  is  verified  by  different  MOSFET  circuit 
stages,  and  a  high  temperature  SIMOX  OPA  with  low 
offset  voltage  and  offset  voltage  drift  by  operating  tempera¬ 
tures  up  to  300°C  has  been  realized. 

2.  SIMOX-MOSFET  BASICS 

In  SIMOX-MOSFETs  the  film  is  separated  from  the  bulk 
by  a  buried  oxide  layer  (Fig.  1).  Due  to  this  the  drain-bulk 


junction  area  is  much  smaller  than  in  conventional  ‘bulk- 
silicon’  technology.  Reduction  of  this  area  significantly 
decreases  the  leakage  current.  Furthermore,  the  fully  de¬ 
pleted  thin-film  SIMOX-MOSFET  possess  the  advantages 
of  less  threshold  voltage  temperature  drift,  higher  integra¬ 
tion  density  and  the  lack  of  latch-up  [2].  Therefore,  SOI- 
MOSFETs  are  very  attractive  for  high  temperature  applica¬ 
tions. 


A  parasitic  property  of  these  transistors  is  the  ‘back-gate- 
effecf .  Regarding  bulk  as  a  second  gate,  a  dependence  of 
the  frontgate  threshold-voltage  on  back-gate  potential 
can  be  observed  [3]. 

2.1  SIMOX-MOSFET  Macromodel  and  Pa¬ 
rameter  Extraction 

In  the  used  macromodel  (Fig.  2)  the  back-gate-effect  is 
considered  using  a  back-gate-voltage  controlled  voltage 
source  in  series  to  the  gate  of  a  conventional  MOSFET 
model  (MOS  Level  1  [4]).  Due  to  the  lack  of  temperature 
dependence  of  the  drain  and  source  resistors  in  SPICE 
MOS-models,  external  resistors  have  to  be  added,  while  the 
internal  resistors  have  to  be  defaulted  to  zero  ohms.  This 
model  has  shown  to  be  valid  for  temperatures  up  to  300°C 
in  the  regions  of  back-surface  depletion  and  the  beginning 
of  the  back-surface  accumulation  region  [5]. 
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Fig.  2:SIM0X  macromodel  based  on  Bulk-MOS 
Level  1 

Parameter  determination  for  a  macro-model  like  this  may 
be  a  problem,  because  most  of  the  existing  model  parameter 
extraction  and  optimization  software  tools  are  limited  to 
conventional  models  or  are  pure  extraction  tools  or  are 
expensive  and  only  available  on  workstations.  Therefore  a 
MS-Windows  PC-based  system  called  MIPEXOS  (Modular 
Interactive  Parameter  Extraction  and  Optimization  System) 
has  been  developed,  which  is  capable  of  extracting  and 
optimizing  parameters  from  measured  data  and  is  not  limi¬ 
ted  to  conventional  semiconductor  models  [6].  In  fact, 
models  can  be  coded  in  a  PASCAL-like  description  lan¬ 
guage  making  it  possible  to  realize  customized  models  like 
the  presented  macromodel  as  well  as  verifying  new  models. 

By  the  help  of  MIPEXOS  and  the  SIMOX  macromodel  we 
have  simulated  the  transfer  characteristics  in  dependence  of 
back-gate-source  voltage  Vcb  (Fig*  3)  and  temperature 
(Fig.  4)  to  show  the  temperature  independent  operating 
point  (ZTC).  The  transfer-characteristics  with  respect  to 
Vcb  can  be  divided  into  three  regions  of  the  back-surface 
state: 


back-surface 
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simulation  and  meas¬ 
urement 
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Fig.  3;  Transfer  characteristic  of  N-SIMOX-MOSFET 
depending  on  Vob 


Fig.  4:  Transfer  characteristic  of  N-SIMOX-MOSFET 
depending  on 'd 


2.2  SIMOX  Circuits 

By  the  help  of  simple  circuits,  i.  e.  current  mirrors,  diffe¬ 
rential  amplifiers,  inverters,  output  stages  and  transmission 
gates  the  performance  of  the  expanded  MOSFET  model  is 
examined  with  regard  to  temperature  and  back-gate  influ¬ 
ence.  Fig.  5  and  Fig.  6  present  good  correspondence  betwe¬ 
en  simulation  and  measurement  characteristics  of  a  source- 
stage  in  series  to  a  source-follower  (see  transistors  M7  - 
MIO  in  Fig.  7)  in  dependence  on  Vcb*  which  is  varied  from 
OV  to  -6V  and  temperature,  which  is  varied  from  50°C  up 
to  290°C.  The  other  stages  show  similar  results. 
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Fig.  5:  Transfer  characteristics  of  a  source-stage  with  a 
source-follower  depending  on  Vch 


In  Fig.  7  one  of  the  first  realized  SIMOX-OPAs  is  presen¬ 
ted.  The  transistors  Ml-MlO  operate  in  the  ZTC  (Zero 
Temperature  Coefficient)-operating  point.  To  reduce  the 
design  and  simulation  effort,  only  parallel  connected  unit 
cells  have  been  used.  Furthermore  an  external  current 
supply  is  necessary  to  influence  the  operating  point.  The 
frequency  compensation  is  reached  by  an  external  con¬ 
nected  capacitor.  Tab.  1  gives  an  overview  of  the  electrical 
characteristics  of  this  universal  frequency  compensated 
OPA  up  to  300°C. 


20°C 

150°C 

250°C 

300°C 

U.JmV 

-4 

-3 

1 

2 

V„n.idB 

96 

92 

89 

76 

CMRR/dB 

127 

128 

129 

- 

Inn/mA 

48 

31 

22 

19 

Tab.  1:  Electrical  characteristics  of  the  OPA 
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Fig.  6:  Transfer  characteristics  of  a  source-stage  with  a 
source-follower  depending  on  temperature 


3.  OFFSET-VOLTAGE  DRIFT  AND  DEG¬ 
RADATION  OF  A  SIMOX-OPA 

Pad  20  Pad  19 


At  each  temperature  the  input  bias  current  remains  smaller 
than  IpA  (without  input  protection  circuit). 

In  spite  of  the  small  offset-voltage  drift,  the  SIMOX-OPA 
possess  an  extreme  high  offset-voltage  degradation  at  high 
temperatures  under  operating  conditions  (Fig.  8),  which 
has  to  be  examined  more  intensive  on  single  transistors  and 
single  circuit-stages. 


h 

Fig.  8:Offset-voltage  versus  time  at  250°C 

3.1  Long-Term  Degradation  of  the  Electrical 
Behaviour  of  SIMOX-Components  at  High 
Temperatures 

Concerning  charge  trapping  effects  near  the  Si/Si02  inter¬ 
face,  MOSFETs  are  the  most  sensitive  devices.  These  ef¬ 
fects  influence  the  threshold  voltage  Vjh  and  the  transcon¬ 
ductance  gm  of  MOSFETs  in  dependence  on  time,  tempera¬ 
ture  and  the  operating  point  [7].  In  the  past  decades,  a  lot  of 
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measurement  methods  were  developed  to  prove  the  quality 
of  the  Si/Si02  interface  [8], 

In  this  paper,  degradation  mechanisms  which  occur  or  are 
accelerated  at  realistic  operating  conditions  and  temperatu¬ 
res  will  be  pointed  out.  First  of  all,  the  electrical  degradati¬ 
on  of  single  SIMOX-MOSFETs,  which  are  operating  in  the 
ZTC,  will  be  considered.  This  measurement  method  is 
called  BTA  (Biased  Temperature  Aging). 

Reliable  long-term  measurements  require  a  reliable  moun¬ 
ting  technology  with  small  long-term  degradation  at  high 
temperatures.  Detailed  examinations  have  shown,  that  the 
thick  film  technology,  which  uses  Al203-ceramics  as 
substrate,  is  excellently  suitable  for  high  temperature  appli¬ 
cations,  if  special  paste  systems  are  used  [5,  9]. 

In  Fig.  9  and  10  the  results  of  long-term  measurements  of 
various  transistors  with  different  width  are  shown.  Smaller 
transistors  possess  higher  long-term  degradation,  which  is 
caused  by  the  existence  of  side-wall  transistors. 


Fig.  9:  N-SIMOX-MOSFET  long-term  drift  at  300°C 


Fig.  10:  Transfer  characteristic  before  and  after  BTA 

4.  CONCLUSIONS 

The  presented  macro-model  has  shown  to  be  valid  and 
useful  for  simple  simulations  under  common  operating 


conditions  with  temperatures  up  to  300°C.  For  more  detai¬ 
led  analysis  specialized  „native“  SOI-models  (which  inclu¬ 
de  leakage  currents,  a  wider  range  of  operating  conditions 
and  other  effects)  have  to  be  used. 

The  short-term  measurements  have  shown,  that  it  is  pos¬ 
sible  to  realize  analog  circuits,  which  can  operate  at  tempe¬ 
ratures  up  to  300°C.  In  further  works  the  electrical  cha¬ 
racteristics,  their  temperature  drift  and  their  long-term  de¬ 
gradation  at  high  temperatures  have  to  be  improved. 
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Abstract  -  Due  to  features  useful  in  microwave  and  millimeter 
wave  integrated  circuit  design,  unconventional  planar 
transmission  line  geometries  deserve  large  attention  in  the 
present  technical  literature.  In  particular,  V-shaped  and  elliptic 
or  circular-shaped  microshield  lines  have  been  analysed  in  recent 
works.  In  this  paper,  accurate  conformal  mapping  procedures 
are  used  to  compare  and  to  complete  previous  results  on  quasi 
TEM  propagation  characteristics,  and  to  investigate  useful 
coupled-line  geometries. 


I.  Introduction 

V-shaped  and  elliptic-  or  circular-shaped 
microshield  lines  have  been  recently  proposed  and 
analysed  in  the  technical  literature,  considering  botli 
homogeneous  and  inhomogeneous  dielectric  media  [1]- 
[3],  and  quasi  TEM  propagation  models. 

In  [1],  an  inhomogeneous  V-shaped  structure  was 
mapped  into  a  rectangular  geometry  by  means  of 
numerical  inversion  of  the  Schwarz-Christoffel 
conformal  transformation  (SCNI).  This  allowed  easy 
application  of  successive  over-relaxation  techniques 
(SOR)  to  the  resulting  smoothed  inhomogeneous  line 
structure,  following  a  numerical  procedure  similar"  to 
the  analytical  one  described  in  [4],  Moreover,  no 
assumption  was  necessary  on  tire  field  behavior  at  the 
dielectric-air  interface. 

In  [2],  a  more  complete  analysis  of  a  V-shaped 
structure  was  performed,  including  coupling 
characteristics,  by  a  moment  method  procedure.  In  [3], 
the  analysis  was  extended  to  elliptic-  and  circular- 
shaped  microshield  lines,  by  means  of  complex 
mapping  procedures  and  assuming  magnetic  walls  at 
the  dielectric  interfaces;  top  conductors  at  finite 
distance  were  considered. 

As  known,  tlie  magnetic  wall  model  leads  normally 
to  good  accuracy  [5]-[8].  However,  it  seems  interesting 
to  compare  various  literature  results  witli  confonnal 
mapping,  using  refined  numerical  techniques  [9]-[ll], 
both  to  allow  transformation  of  complex  geometries 
and  to  discuss  the  accuracy  of  the  magnetic  interface 
assumption.  Moreover,  flexible  coupling  characteristics 
are  obtained  by  means  of  controlling  die  separation 
interval  between  coupled  V-shaped  microshield  lines, 
and  tins  generalises  the  coupled-line  analysis  presented 
in  [2]. 


n.  V-Shaped  microshield  line 

V-shaped  and  elliptic  or  circular-shaped  microshield 
lines  geometries  are  shown  in  Fig.  1.  Open  sided 
structures  are  assumed  for  simplicity,  but  electric  or 
magnetic  wall  can  be  easily  introduced,  and  the  upper 
conductor  can  be  removed. 


b) 

Fig.  1  Line  geometrie.<::  a)  V-shaped  microshield  line;  b)  elliptic- 
shaped  microshield  line. 

In  table  I,  some  characteristic  impedance  values  are 
reported  for  die  V-shaped  geometry  considered  in  [2, 
Table  I  and  Fig.  1].  No  top  conductor  is  considered.  In 
[2]  the  geometrical  parameters  were  h=  240  mils  and 
a=  60  degrees.  The  relative  permittivity  was  er=2.55. 
(a  corresponds  to  2p,  w  to  2a,  and  h  to  h2  of  Fig.  1  a). 
Results  obtained  by  applying  a  SOR  procedure  after  a 
whole  geometry  conformal  mapping,  following  [10], 
are  also  reported. 
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TABLE  I 

COMPUTED  AND  MEASURED  CHARACTERISTIC  IMPEDANCE  VALUES 


w/h 

Z0[Q] 

(SCNI) 
Magnetic  wall 

Z0[O] 

(SCNI  +  SOR) 
Whole  geometry 

Z0[O] 

[2] 

Computed  values 

zo[a] 

[2] 

Measured  values 

0.2583 

106.79 

104.77 

115 

107 

0.5208 

73.20 

71.70 

78 

76 

The  results  show  good  agreement,  further  validating 
the  magnetic  wall  assumption.  However,  it  has  to  be 
noted  that,  for  V-shaped  structures,  the  field  lines  near 
the  external  ends  of  die  dielectric  interface  are  not 
likely  to  be  almost  parallel  to  the  dielectric  interface, 
even  when  interfaces  lengdis  are  small.  This  causes 
errors  in  magnetic  wall  computations  more  relevant 
than  in  traditional  structures.  Decreasing  of  errors  due 
to  increased  relative  permittivity  is  confirmed. 
Characteristic  impedances  have  been  computed  for  die 
V-shaped,  top  conductor  line  in  Fig.  1  a),  for  various  V 
angles  in  inhomogeneous  dielectric.  The  results  are 
reported  in  Fig.  2. 


Charact^istic  impedance  [ohms] 

Fig.  2  Characteristic  impedance  of  the  V-shaped  microshield  line  of 
Fig.  1  a)  with  er=2.55,  h  i=2  mm,  h2=0.4  mm,  for  p=7i:/3,  tc/4  and  7i/6. 

The  effects  of  the  different  V  angles  on  the 
characteristic  impedance  ai*e  well  evident,  showing 
saturation  when  the  V-shaped  ground  is  far  away  from 
the  inner  conductor. 


III.  Elliptic-  and  circular-shaped  microshield  lines 

In  Fig.  3,  characteristic  impedances  are  shown  for 
tlie  geometry  in  Fig.  1  b),  i.e.,  the  elliptic-shaped 
microshield  line  witii  top  conductor  and  thin  inner 
conductor.  The  curves  ate  very  similar  to  those  derived 
from  the  formulas  reported  in  [3].  Magnetic  walls  are 
assumed  in  the  inliomogeneous  dielectric  computations, 
because  tliis  assumption  is  likely  to  lead  to  good  results 
for  this  geometry.  In  fact,  tlie  accuracy  was  analysed  in 


air,  and  results  similar  to  those  derived  in  [8]  for 
coplanar  lines  were  found. 


0  40  80  120  160 
Characteristic  impedance  [ohms] 

Fig.  3  Characteristic  impedance  of  elliptic-shaped  microshield  line  of 
Fig.  1  b)  witli  er=2.55,  hi=  3  mm,  h2=  0.4  mm,  for  b=h2*tanP 
(P=7C/3). 

The  results  obtained  for  the  same  geometry,  but 
assuming  Uiick  inner  conductors  are  shown  in  Fig,  4, 
where  comparison  is  made  with  the  lower  and  upper 
bounds  supplied  in  [3].  Again  the  agreement  is  good, 
as  conformal  mapping  results  are  found  between  the 
bounds. 


Fig.  4  Characteristic  impedance  of  tlie  elliptic-shaped  microshield 
line  of  Fig.  1  b),  inner  conductor  strip  thickness  0.05  mm,  £^=2.55, 
hi=  3  mm,  h2=  0.4  mm,  b=h2'tanp  (p=7c/3).  Dotted  lines  show  lower 
and  upper  bounds  evaluated  by  means  of  formulas  (14)  and  (15)  in 
[3]. 
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SimUar  results  have  been  obtained  for  tlie  circular¬ 
shaped,  top  conductor  microstrip  line,  both  for  thin  and 
thick  strip  conductors  as  shown  in  Fig.  5  and  Fig.  6, 
where  comparison  is  made  with  [3].  The  agreement  is 
good.  The  conformal  mapping  results  are  likely  to 
provide  reliable  data,  as  they  were  compared  in  [14] 
and  [15]  with  various  data  available  for  similar 
geometries,  i.e,  coaxial  striplines  and  cross  inner 
conductor,  circular  outer  conductor  coaxial  lines. 


Fig.  5  Characteristic  impedance  of  the  circular-shaped  niicrosfueld 
line  of  Fig.  1  b)  witli  £^=2.55,  hi=  3  mm,  h2=  0.4  mm,  for  b=h2  tanp 
(P=7C/4). 


This  modification  consists  in  providing  a  conductor 
surface  coplanar  to  the  dielectric  interface  between 
adjacent  V-shaped  outer  conductor  grounds,  allowing 
more  easy  manufacturing  of  the  whole  structure. 

The  whole  geometry  is  shown  in  Fig.  7,  and 
coupling  coefficients  are  shown  in  Fig.  8  and  Fig.  9, 
where  the  coupling  coefficient  is  expressed  as 

coupling  coefficient  =  -201og  ^vcm^odd 

Zeyen+Zodd 

where  Zeven  and  Zodd  are  the  even-  and  odd-mode 
impedances,  respectively.  These  impedances  as  been 
computed  by  assuming  a  vertical  electric  or  magnetic 
wall  on  the  symmetry  axes  of  the  whole  structure.  Note 
that,  due  to  the  flexibility  of  the  transformation 
procedure  no  assumption  on  magnetic  wall  in 
correspondence  of  the  strip  centre  was  necessary  and 
that  half  structures  are  considered  as  a  whole. 

Fine  adjustment  of  the  coupling  coefficient  is 
allowed  by  selection  of  the  distance  2c,  i.e.,  of  the 
curve  parameter.  In  fact,  a  centre  ground  conductor  of 
adjustable  dimension  has  been  introduced,  and  the 
whole  geometry  is  likely  to  provide  a  well-defined  very 
low  coupling  with  small  overall  dimensions,  as  the 
coplanar  waveguide  structure  analysed  in  [16]. 


Fig.  6  Characteristic  impedance  of  tlie  circular-shaped  microshield 
line  of  Fig.  1  b),  with  a  conductor  strip  tliickness  0.02  mm,  £^=2.55, 
hj[=  3  mm,  h2=  0.4  mm,  b=h2-tanP  (p=7i/4).  Dotted  lines  sliow  lower 
and  upper  bounds  evaluated  by  means  of  formulas  (14)  and  (15)  in 
[3]. 


rV.  Coupled  V-shaped  microshield  lines 

Coupled  V-shaped  microshield  lines  have  been 
analysed,  both  with  thin  and  with  thick  strip 
conductors,  to  discuss  the  thickness  effects.  The 
flexibility  of  the  numerical  conformal  mapping 
procedures  allows  to  discuss  the  effect  of  a  slight 
modification  of  the  geometry  with  respect  to  [2,  Fig.  2]. 


Fig.  7  Coupled  V-shaped  microshield  line. 


Fig.  8  Coupling  coefficient  for  thick  coupled  V-shaped  microshield 
line  (B=:60°,  h2=l,  1=0.1). 
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Fig.  9  Coupling  coefficient  for  tliin  coupled  V-shaped  microshield 
line  (B=60^  h2=l). 


V.  Conclusions 

Numerical  conformal  mapping  procedures  have 
been  applied  to  various  microshield  line  geometries, 
and  results  have  been  presented  which  confinn  and 
complete  previous  data.  The  same  procedures  are  likely 
to  be  as  well  effective  in  analysing  similar  geometries 
and,  among  tliem,  tlie  new  sti'uctures  presented  in  [17]. 
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Abstract 

This  paper  describes  the  investigation  of  two  different 
approaches  to  model  microwave  active  devices  using  the 
finite- difference  time-domain  (FDTD)  analysis.  Norton- 
equivalent  and  Thevenin-equivalent  approaches  are  used 
in  the  extended  FDTD  method  to  model  the  interaction 
between  the  three- terminal  active  device  and  the  electro¬ 
magnetic  field  by  placing  equivalent  sources  in  the  active 
region.  A  typical  microwave  amplifier  is  analyzed,  and 
the  simulation  results  agree  well  with  expection. 

1  Introduction 

As  the  size  of  microwave  circuits  decreases,  the  Finite- 
Difference  Time-Domain  (FDTD)  method  provides  a 
powerful  solution  to  the  simulation  of  electromagnetic 
phenomena  of  the  entire  circuit.  The  FDTD  method 
solves  Maxwell’s  equation  in  time  domain  directly  and 
performs  an  accurate  three-dimensional  full-wave  analy¬ 
sis.  Therefore,  the  mutual  coupling  between  elements  ai'e 
accounted  for  automatically.  The  transient  behavior  of 
the  propagating  wave  can  also  be  observed  during  the 
simulation. 

The  FDTD  method  has  been  generally  applied 
to  the  analysis  of  microwave  circuits.  For  passive  cir¬ 
cuits,  frequency-dependent  characteristics  and  the  scat¬ 
tering  parameters  are  studied  [1,  2].  In  addition,  it  is 
extended  to  include  lumped  passive  devices  and  active 
devices.  Passive  components,  sucli  as  resistors,  capaci¬ 
tors,  and  inductors,  are  treated  as  distributed  elements 
and  incorporated  into  the  coefficients  of  Yee’s  algorithm 
[3].  For  two-terminal  active  devices,  this  methodology  has 
been  applied  to  analyze  an  active  antenna  [4].  A  Gunn 
diode  in  the  antenna  is  modeled  as  an  equivalent  active 
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region  and  the  equivalent  lumped  circuit  of  the  diode  is 
incorporated  into  the  FDTD  grid.  For  a  three- terminal 
active  device,  equivalent  current  sources  are  used  to  sub¬ 
stitute  for  the  device,  and  the  value  of  the  current  source 
is  specified  by  the  device  equivalent  circuit  model  [5,  6]. 

The  active  device  is  the  engine  of  microwave  cir¬ 
cuits.  The  simulation  of  microwave  circuits  heavily  de¬ 
pends  on  the  interaction  between  the  active  device  and 
the  electromagnetic  field.  This  paper  compares  simula¬ 
tions  of  two  different  approaches  of  modelng  active  de¬ 
vices  in  the  FDTD  analysis.  One  is  the  Norton-equivalent 
approach  using  current  sources,  and  the  other  is  the 
Thevenin-equivalent  approach  using  voltage  sources.  A 
typical  microwave  amplifier  is  analyzed  using  these  two 
approaches. 

2  Device  Models  in  FDTD  Anal¬ 
ysis 

To  simulate  microwave  circuits,  the  FDTD  algorithm  is 
extended  to  include  active  elements.  The  idea  used  in 
[5,  6)  is  the  Norton-equivalent  approach  which  substitutes 
the  active  device  with  equivalent  current  sources  in  the 
active  region.  Since  the  equivalent  current  source  fully 
characterizes  scattering  properties  of  the  active  device, 
the  current  source  gives  the  same  response  of  the  reflected 
wave  and  the  transmitted  wave  as  that  of  the  active  de¬ 
vice.  Differently  speaking,  the  current  source  also  stands 
for  the  device  current. 

The  implementation  is  to  add  the  current  source 
into  Ampere’s  current  law  as  a  source  term.  The  integral 
form  of  Ampere’s  law  is  then  written  as 

dV 

Ivotal  ”  ^  ^ Devices  (I) 
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where  Ixotai  is  the  total  current  flowing  through  the 
FDTD  cell,  I  Device  is  the  device  current,  V  is  the  voltage 
drop  across  the  FDTD  cell,  and  C  represents  the  space 
equivalent  capacitance  of  the  FDTD  cell.  The  value  of  E 
field  at  integer  time  scale  is  evaluated  from  the  voltage 
drop  V.  Since  the  value  of  I  Device  is  determined  from  the 
device  equivalent  circuit  model,  a  straightforward  way  to 
solve  V  is  to  use  the  equivalent  circuit  of  Eq.(l),  as  shown 
in  Fig.  1(b). 

The  Norton- equivalent  approach  is  suitable  for 
current-controlled  devices.  For  voltage-controlled  de¬ 
vices,  the  Thevenin-equivalent  approach  is  more  appropri¬ 
ate.  Instead  of  using  current  sources,  equivalent  voltage 
sources  are  placed  in  the  active  region  to  substitute  for  the 
active  device.  Similarly,  because  the  voltage-current  rela¬ 
tionship  at  input  ports  of  the  device  agrees  with  the  scat¬ 
tering  property,  the  voltage  source  behaves  in  the  same 
way  as  the  active  device.  The  placement  of  the  voltage 
source  is  shown  in  Fig.  1(c),  where  one  end  of  the  source 
connects  to  the  microstrip  line  and  the  other  end  connects 
to  the  ground  plane  through  vias. 

The  formulation  for  implementing  the  Thevenin- 
equivalent  approach  relies  upon  Faraday’s  Law.  The  in¬ 
tegral  form  of  Faraday’s  law  is  a  dual  form  of  Eq.  (1), 

^Total  “  ^Device  f  (2) 

where  Vxotai  is  the  total  loop  voltage  of  the  FDTD  meshes 
with  voltage  sources,  V^evicc  is  the  device  voltage,  I  is 
the  mesh  current  of  the  FDTD  cell,  and  L  represents 
the  space  equivalent  inductance  of  the  FDTD  cell.  Note 
that  the  value  of  VDevice  is  related  to  the  integration  of 
the  E  field  along  the  FDTD  edge  of  the  voltage  source. 
Thus,  when  updating  field  values  in  the  FDTD  algorithm, 
the  value  of  VDevice  on  an  integer  time  scale  needs  to  be 
determined  as  time  increments. 


3  Results 

The  system  under  consideration,  as  shown  in  Fig.  2,  is  an 
amplifier.  The  entire  circuit  contains  DC  biasing  circuits, 
impedance  matcliing  circuits,  lumped  passive  elements, 
and  a  three-terminal  GaAs  MESFET. 

The  FDTD  simulation  is  performed  with  uniform 
grids,  but  the  staircase  approximation  is  used  for  radial 
stubs  in  the  DC  biasing  circuits.  The  lumped  resistors 
and  capacitors  are  treated  as  distributed  elements  and  in¬ 
corporated  into  the  FDTD  algorithm.  Norton-equivalent 
and  Thevenin-equivalent  approaclies  are  applied  to  model 
the  MESFET.  Since  the  two  approaches  are  duals  of  each 
other  and  both  rely  upon  the  implementation  of  equiva¬ 
lent  sources,  the  simulation  results  are  expected  to  be  the 


same.  The  observed  time  responses  at  port  1  and  port  2 
are  shown  in  Fig.  3.  According  to  these  time  waveforms, 
S-parameters  are  calculated  and  plotted  in  Fig.  4.  Both 
results  are  very  close  to  each  other  as  expected. 


4  Conclusion 


Norton-equivalent  and  Thevenin-equivalent  approaclies 
for  the  modeling  of  the  active  device  have  been  studied. 
With  the  use  of  equivalent  sources,  the  FDTD  method 
has  been  extended  to  include  three-terminzJ  active  de¬ 
vice  and  analyze  the  entire  microwave  circuit  as  a  whole. 
The  two  approaches  are  duals  of  each  other  and  produce 
the  same  results  for  the  microwave  amplifier.  Both  ap¬ 
proaches  maintain  the  features  of  the  full-wave  analysis 
and  perform  accurate  electromagnetic  field  simulations  of 
microwave  and  millimeter  wave  circuits. 
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Equivalent  Current  Source 


Ground  Plane 

(c)  (d) 


Figure  1:  (a)  The  placement  of  equivalent  current  sources  in  the  Norton- equivalent  approach,  (b)  The  equivalent 
circuit  of  the  integral  form  of  Ampere’s  law.  (c)  The  placement  of  equivalent  voltage  sources  in  the  Thevenin- 
equivalent  approadi.  (d)  The  equivalent  circuit  of  the  integral  form  of  Faraday’s  law. 


Power  pad 


Power  pad 


Figure  2:  The  layout  of  a  typical  microwave  amplifier. 
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port  #1 


port  #2 


Figure  3:  Time  responses  using  Norton-equivalent  and  Thevenin- equivalent  approaches  observed  at  port  #1  and 
port  #2. 


Figure  4:  The  calculated  S-parameters  from  the  time  response. 
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ABSTRACT 

Experimental  data  are  analysed 
in  order  to  determine  the  synchronous 
fretpiency  at  which  the  minimum  insertion 
lass  occurs,  in  a  surface  Acoustic  wave 
fSA  WJ  delay  line  .  The  insertion  loss  of  a 
locally  fabricated  delay  line  is  measured  to 
be  29.23  dB  at  a  synchronous  frequency  in 
the  order  of  6  MHz.  The  transmission 
losses  are  deduced  as  a  result  of  the 
comparison  between  the  experimental  data 
of  the  total  insertion  loss  of  the  delay  line 
and  the  theoretical  results  of  the 
transducer  losses  .  This  technique  can  be 
considered  as  a  base  to  the  development  of 
laboratory  experiments  to  acquaint  the 
undergraduate  communication  engineering 
major  with  SA  W  technology . 

1 .  INI  RODUCTION 

SAW  devices  are  now  used  in 
considerable  range  of  applications  in 
modern  electronics  [I], [2]  and  they  play  a 
key  role  in  consumer  communication 
applications  [3]  .  One  of  the  most 

important  factors  that  influence  the 
characteristics  of  these  devices  is  the 
insertions  loss  [4], [5]  .  It  is  therefore, 
necessary  to  understand  all  the  mechanisms 
contributing  to  the  total  insertion  loss  to 
accomplish  the  optimum  design  of  such 
devices  .  Fig.  1,  shows  diagramatically 
various  contributions  that  make  up  the  total 
insertion  loss  (IL).  This  loss  is  devided  into 
two  parts  :  the  transducer  losses  (Lt)  and 


the  transmission  losses  (Lp)  .  Lr  are  the 
losses  due  to  the  conduction  loss,  the 
electrical  mismatch  loss  and  the  bi¬ 
directional  loss.  On  the  other  hand,  Lp  is 
the  loss  due  to  the  interaction  between  the 
surface  waves  and  thermal  lattice  wave,  the 
effect  of  air  loading,  the  presence  of  surface 
imperfections,  radiation  of  bulk  waves, 
beam  steering  and  diffraction  [6]  .  It  is 
diflicult  to  derive  a  theoretical  loss  Lp 
because  it  depends  on  the  fabrication 
conditions  of  the  device  . 


Directional  Ream  spread  and  Directional 


Fig.  L  (a)  Losses  in  a  SAW  delay  line.(b)  Tuned  transducer 
connected  to  source . 


In  this  paper,  we  propose  a  simple 
technique  for  determining  the  transmission 
losses  of  SAW  delay  lines  experunentally. 
Tliis  technique  based  upon  the  comparison 
between  the  experimental  data  of  the  total 
insertion  loss  of  the  delay  line  and  the 
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simulation  results  of  its  transducer  losses . 
Also,  this  technique  can  be  considered  as  a 
base  to  the  development  of  laboratory 
experiments  to  acquaint  the  undergraduate 
communication  engineering  major  with 
SAW  technology  .  It  is  the  first  approach 
indicated  which  has  guided  our  work,  hi 
section  2  ,  a  simple  model  for  calculation 
the  transducer  losses  Lx  is  explained .  In 
section  3  ,  the  measured  delay  line 
insertion  loss  is  analysed  and  compared 
with  the  simulation  results  of  Lx  for 
estimating  Lp  experimentally,  followed  by 
the  conclusions  and  the  more  relevant 
references . 


circuit  description  of  the  transducer  . 
A  simple  analysis  ,  based  on  this  circuit, 
will  .show  that  the  minimum  insertion  lo.ss 
for  untuned  device  is  >  6  dB  . 

According  to  Smith  at  al  [7]  the  IDT  can 
be  represented  by  a  series  circuit  consisting 
of  the  transducer  capacitance  Ci, 
a  radiation  resistance  R  a  (ti)),and  an  acoustic 
reactance Za(o)), Fig. l.(b).  At  the  acoustic 
synchronism  „  ,^„(ry„)=0  and 

(<y  0 )  =  (4  /  (I  /  0  <^V )  ’ 
is  the  electromechanical  coupling 
constant  and  N  is  the  number  of 
interdigital  periods . 


2.  THEORY 

The  basic  building  block  element  of  any 
SAW  device  is  the  delay  line.  It  consists  of 
metallized  interdigital  transducers  (EDT’s) 
produced  by  photoetching  aluminum  on  the 
surface  of  a  piezoelectric  substrate  as 
shown  in  Fig.l  (a)  .  The  input  IDT  converts 
the  received  electromagnetic  signal  into 
two  surface  acoustic  waves  which  travel  in 
opposite  directions  .  Only  one  of  these 
waves  is  intercepted  by  the  receiving 
transducer  and  this  represents  a  3  dB  loss 
in  power  .  A  further  loss  occurs  because 
the  receiving  IDT  can  not  absorb  all  the 
energy  fi'om  the  surface  wave  .  The  electric 
load  determines  the  amount  of  the  loss,  but 
at  best  50%  of  the  energy  is  observed;  the 
balance  is  equally  shared  by  a  reflected 
wave  and  a  transmitted  wave  .  The 
minimum  bi-directional  loss  in  a  two- 
transducer  device  is  therefor  6  dB.  If  the 
transducer  is  interfaced  directly  with 
resistive  source  or  load  (untuned),  the 
interelectrode  capacitance  superimposes  a 
frequency  mismatch  in  the  desired  acoustic 
response,  and  transducer  insertion  loss  is 
excessive.  A  more  mathematical  description 
of  these  effects  follows  from  an  equivalent 


Transducer  loss  is  defined  as  the  ratio 
between  the  power  which  could  be 
delivered  to  a  load  (transducer)  from  a 
matched  signal  generator  and  the  actual 

power  leaving  the  acoustic  part  in  the 
desired  direction  [7]  .  Thus  assuming  a 
current  I  flow  tlnrough  the  transducer 
circuit  in  Fig.  1(b),  and  assuming  for  the 
moment  zero  resistance  in  the  aluminum 
fingers,  transducer  eflSciency  (Te)  can  be 
written  as 

f  1  ^ 

2  (±K^  N]a„CrRG 

n:  -  ^ ^ ^  (I) 

[^a>„CT  Rg  +1 

where  Rg  is  the  soiuce  or  load  resistance  . 
Because  Tg  is  only  the  fractional  power 
transmitted  per  transducer  the  delay  lure 
transducer  insertion  loss  (Lx)  then  becomes 

[L  +i] 

L,=-i-=  ^-7 - J - 1 - (2) 

(Ts)  2[±K^N^a>„CrR^ 
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Miniimim  transducer  inseition  loss  is 
obtained  when  the  transducer  capacitance  is 
adjusted  such  that : 


vStart  and  including  the  iiiipcdance  ol'  the 
lossy  tuning  inductor  .  In  (his  case  (lie  delay 
line  transducer  inserlion  lass  I  -i  is  given  by 


The  transducers  insertion  loss  is  then  : 


(4) 


For  (4  /  ;r )  N  «  1,  these  expressions 
reduced  to  : 


L,.  {dB)=  10  log  4 


•(7) 


Where  Rc  represents  the  conduction  loss  in 
the  transducer  fingers  and  in  the  tuning 
inductor , 


Ra^O))  = 


4k^N 

7t  0}  qCj 


2 


(8) 


and 


cOoCr—  .  (5) 

% 

.  (6) 


Using  (6),  the  minimum  insertion  loss 
for  a  given  substrate  material  and 
transducer  finger  pairs  can  be  determined  at 
the  synchronous  ffequency(tyo) .  The 
transducer  capacitance  required  to  obtain 
minimum  insertion  loss  at  ((y(,)can  be 
calculated  using  (5)  .  Because  it  is 
important  to  keep  transducer  insertion  loss 
at  a  reasonable  and  predictable  level  and 
also  to  minimize  phase  and  amphtude 
distortion  across  the  acoustic  passband,  it  is 
desirable  in  most  cases  to  introduce  a  series 
L  to  tune  out  the  effects  of  interelectrode 
capacitance  Ct  .  Thus,  the  preceding 
simplified  analysis  could  have  been 
performed  more  exactly  by  assuming  a 
nonzero  transducer  resistance  fi-om  the 


4K^N  fSinlx-lx^ 
2x^  ) 


’  (10) 


\  0}^  ) 


3.  EXPERIMENTAL  RESULTS 

hi  order  to  determine  the  SAW  delay 
line  lasses  experimentally,  insertion  loss 
measurements  are  carried  out  on  a  delay 
line  was  locally  fabricated  with  the  available 
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Iccluiology  in  tlic  Ulcclronic  Rcscarcli 
institute,  National  Research  Center,  Cairo, 
Egypt.  Tlie  transducers  were  fabricated  by 
standard  photoetching  techniques  of 
aluminum  deposited  on  LiNbOj  substrate  . 
The  available  techniques  are  limited  in 
resolution  to  line  width  of  about  150  pm, 
which  corresponding  to  fundamental 
frequency  of  interdigital  transducers  of  6 
MHz.  The  number  of  interdigital  periods  of 
each  transducer  is  4,  the  center-to-center 
distance  between  transducers  is  16  mm.  and 
transducer  aperture  width  is  6  mm. 

A  pulsed  sinusoidal  voltage  was  applied 
to  the  input  transducer  and  the  output 
voltage  from  the  output  transducer  was 
observed  on  the  oscilloscope  for  each 
frequency  variation  of  the  input  voltage. 
The  tuning  inductance  was  varied  to 
maximize  the  voltage  output  on  the  scope 
at  (w„). 

The  experimental  data  of  the  total 
insertion  loss  versus  frequency  for  the 
tuned  6  MHz  delay  line  are  given  in  Fig.  2. 
Tlie  following  cubic  polynomial  form  was 
found  to  fit  the  experimental  points  by  the 
least-squares  approximation. 

/L=6l936-9.919F+0.557/^’"  -I-0.031F’  (12) 

Wliere  EL  is  the  insertion  loss  in  dB  and 
F  is  the  frequency  in  MHz  .  Ihitting  the  first 
derivative  of  (12)  equal  to  zero  gives  the 
synchronous  frequency  Fo  =  5.93  MHz,  and 
the  minimum  insertion  loss  ILTOi„=29.23  dB 

A  comparison  of  the  measured  insertion 
loss  described  by  (12)  with  the  theoretical 
loss  calculated  by  using  (7)  is  shown  in 
Fig.  2.  The  transducer  losses  Lj  is 
determined  to  be  24.5  dB  at  Fo  .  In  order  to 
obtain  the  experimental  value  of  the 
transmission  losses  Lp  at  Fo  ,  we  should 


sub.stracl  Li  iVom  IL„,„ .  fhe  Iran.smission 
los.ses  L|.  of  (he  delay  line  are  I'omul 
to  be  4.73  dB  (.see  Fig.2.) 


Fig.2  Theoretical  and  measured  insertion  loss  versus  frequency 
for  a  6M1  Iz  l.iNbOj  SAW  delay  line. 

4.  CONCLUSIONS 

A  simple  technique  has  been  described 
to  determine  the  synchronous  frequency  at 
which  tlie  minimum  insertion  loss  occurs  in 
a  SAW  delay  line  .  Third  order  polynomial, 
obtained  by  a  least-squares  approximation, 
properly  fit  the  experimental  data  of  the 
delay  line  insertion  loss  versus  frequency . 
Furthermore,  a  simple  model  for 
calculating  the  transducers  losses  of  the 
delay  line,  based  on  the  equivalent  circuit  of 
an  interdigital  electrode  transducer,  has 
been  presented  .  The  transmission  losses  of 
the  delay  line  are  deduced  as  a  result  of  the 
comparison  between  the  experimental  delay 
line  insertion  loss  and  the  simulation  results 
of  its  transducer  losses  .  The  technique 
presented  here  not  only  to  demonstrate  the 
fimdamental  behavior  of  SAW  devices,  but 
also  can  be  considered  as  a  base  for 
developing  a  laboratory  experiments  to 
acquaint  the  undergraduate  communication 
engineering  major  with  SAW  technology . 
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ABSTRACT: 

In  computer-aided  analysis  methods, 
the  SPICE  program  is  perhaps  the  most 
popular  because  of  its  simplicity  and 
easy  in  programming.  In  lightly  dampening 
systems  the  solution  with  SPICE  program 
could  extend  over  many  periods  making  the 
computation  costly.  The  modified  nodal 
algorithm  used  with  SPICE  is  poor  in  the 
analysis  of  distributed  and 

lumped/distributed  networks.  In  this 
paper  the  state  space  approach  is  a 
powerful  technique  in  dealing  with  lumped 
/  Distributed,  linear  dynamic  networks  in 
both  time  and  frequency  domains.  The 
results  show  a  considerable  reduction  in 
the  amount  of  time  necessary  to  compute 
the  steady  state  response. 

l-Introduction 

Most  circuit  analysis  is  performed 
generally  in  two  steps.  The  first  step  is 
to  formulate  the  system  of  equations 
describing  the  circuit  using  two 
kirchhoff’s  laws  and  elements 

characteristics  (laws  of  elements).  The 
second  step  is  to  solve  these  equations 
using  suitable  graphical,  analytical  or 
numerical  techniques.  For  linear 
networks,  the  nodal-  analysis  method 

[1]“[2]  is  perhaps  the  most  popular 
because  of  its  simplicity  and  easy  in 
programming.  However,  this  method  has  the 
following  main  disadvantages. 

(1) The  nodal  method  cannot  handle  some 

types  of  dependent  sources  (only 
voltage  controlled  current  sources 
are  allowed). 

(2) Multiterminal  elements  that  have  no 
admittance  matrix  representation 
cause  difficulty  in  nodal-analysis. 

(3) In  frequency  domain  analysis  using 
nodal  method,  the  transfer  function 
of  the  network  must  be  known. 


(4)There  are  problems  at  time 
discontinuities.  Although  the  nodal 
-analysis  method  have  been  extender 
to  analyze  a  lumped/  distributed 
networks,  but  its  analysis  of  all 
types  of  lumped/distributed  elements 
is  still  limited  and  its  analysis 
of  distributed  elements  is  poor  [1]. 

The  state  space  approach  is  a  powerful 
technique  in  dealing  with  lumped/ 

distributed,  linear  dynamic  networks  in 
both  time  and  frequency  domains.  The  state 
equations  of  networks  containing  lumped 
elements  have  been  dealt  with  by  numerous 
authors  [3]-[4].  The  state  equations  of 
networks  containing  transmission  lines  and 
resistive  elements  have  been  derived 
using  topological  approach  l8]-[9].  The 
state  space  technique  was  extended  for 
networks  containing  lumped  and  distributed 
elements. 

The  state  space  technique  eliminates 
the  need  to  know  the  transfer  function  in 
explicit  form  and  removes  any  restriction 
on  the  topology. The  same  form  of  equations 
can  be  applied  to  either  the  time  domain 
or  frequency  domain  without  the  need  of 
the  fourier  transform.  The  same  technique 
can  also  be  applied  to  purely  distributed 
circuits  an  d  to  lumped/distributed  circuits 
whether  the  transmission  lines  are 
commensurate  or  not.  No  problems  exist  at 
the  discontinuities  of  the  time 

response  ,  and  the  time  response  could  be 
obtained  for  any  arbitrary  set  of  inputs 
without  the  use  of  convolution. 

2-  Simulation  Program:- 

(2a)The  state  and  output  equations 

It  has  been  shown  [6]  that  the 
state  variables  of  a  distributed  network 
are  the  reflected  parameters  at  all  the 
transmission  line  terminals  or  ports. 
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The  state  variables  of  a  lumped/ 
distr  ibuted  network  are  the  voltages  on 
all  the  independent  capacitors,  the 
currents  in  all  the  independent  inductors 
and  the  reflected  parameters  (or  the 
reflected  voltages)at  all  the  transmission 
-line  ports.  The  state  equation  is  the 
differential-difference  equation  of  the 
form: 


X  (t) 

1 _ 

X  (t+T) 
2 


A  I  X  (t)  B. 


12  1 


U(t) 


X  (t)  B. 


(1) 


Where  Xl(t)  and  X2(t)  are  the  state 

vectors  of  the  lumped  and  distributed 
elements  respectively  and  u(t)is  the  input 
vector. 

The  output  equation  is  given  by 


y(t)= 


■xi(t)“ 

- +Du(t) 

X2(t)^ 


(2) 


Equations  (1)  and  (2)  are  derived 

using  topological  methods  with  no 
restriction  on  the  network  topology. 
Where  A,B,C  and  E  are  the  matrices  of  the 
state  and  output  equation  of  the  systems. 


(2b)-  Network  Analysis  Using  the  State 
Equations 

The  previous  results  are  used  to 
analyze  lumped/distributed  networks  in 
either  the  frequency  or  time  domain. 

(1) Frequency  domain  analysis  :  is  Based  on 

the  expansion  of  the  equation  (4)  to 
obtain  the  transfer  function  of  the  form. 

t  A  .  S'*  Z' 

L  i  j 

_  (5) 

T"  r 

i.  =  0^j=0 

This  expansion  is  achieved  by  using 
a  modified  faddeeva  algorithm[4]  for  the 
inversion  of  the  two  variable  matrix  in 
equation  (3). 

(2) Time  domain  analysis:  The  output  in  the 
time  domain  is  calculated  from  equation(2) 

T 

when  the  state  vector  [X^^  (t) : 
is  known.  The  state  vector  is  first 
determined  from  the  solution  of  Eq.(l). 
Equation!  1)  represents  two  simultaneous 
differential  cind  difference  equation  : 


I 


and 

X2(t)=A2^X^(t-T)+A22X2(t-T)+B^u(t-T)  (7) 


When  the  Laplace  transform  is 
applied  to  equations  (1)  and  (2)  we 
obtain  the  frequency-domain  equations. 


-1 


where  z  =  exp  sT  ,  I  is  the  identity 
matrix,  m,n  are  the  number  of 

lumped-state  elements  and  transmission 
lines,  respectively. 


T 

The  state  vector  [X^(t)  jX^  (T)]  is 

assumed  to  be  zero  at  t<O.The  vector 
(t)  is  calculated  by  substituting  the 
values  of  ^  (t-T),  ^  (t-T)and  u(t-T)in 

equation(7).The  vector  X^  (t)is  then 
calculated  by  solving  the  differential 
equation(6)  numerically  in  each 

time  period  kt  <  t  <  (k+l)T,  where  k 

=  0,1,2 . 

The  state  and  output  vectors  have 
discontinues  at  t  =  KT  and  the  initial 
values  of  the  state  vector  at  the  start 
of  each  time  period  are  obtained  from 
the  condition  that  the  integral  of  the 
state  vector  is  continuous. 

Example  1:  The  simple  circuit 

shown  in  Fig.  la  was  analyzed  using  the 
state  space  technique  and  the  SPICE  method 
to  show  the  difference  between  them  with 
respect  to  complete  analysis  (time 
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domain,  frequency  domain,  computation 
time).  The  frequency  response  is  given  in 
Fig.  la.  The  impulse  and  step  responses 
are  shown  in  Fig.  lb  and  Fig.  1C. 
Where  R1  =  R2  =1^2,  Zoi=1.626  Z02= 

1.059Q,C  =  L377F,  T=  1  s. 

Example  2:  The  normalized  network 
shown  in  Fig.  2a  shows  that  the  topology 
method  is  a  powerful  technique  in 
dealing  with  lumped/distributed,  linear 
dynamic  networks  and  this  method  removes 
any  restrictions  on  the  topology.  The 
SPICE  program  is  still  limited  and  its 
analysis  of  distributed  elements  is  also 
poor.  This  example  show  that  the  SPICE 
program  can  not  compute  like  this  topology 
of  network. 

3-Comparison  study; - 

Several  examples  were  solved  by 
using  our  state  space  analysis  program  and 
using  SPICE  program.  The  results  show  that 
SPICE  program  is  still  limited  in 

dealing  with  some  topology.  The 
computation  time  for  both  STATE  SPACE 
algorithm  and  SPICE  program  is  given  in 
table-1.  It  should  be  noted  that  the  time 
given  is  the  time  of  establishing  the 
state  equations  and  the  solution  of  these 
equations  to  give  the  output.  The  results 
also  show  a  considerable  reduction  in  the 
computation  time. 


Computation  Time  j 

Step  Frequency! 

Response  Response 

Spice  Program 

State  Space 
Techn  i  que 

^  10.27  Sec  9 . 7  S  ec 

5.27  Sec  6  Sec  ' 

_ 1 

Table-  1 


4-  Conclusions 

The  state  equation  derived  by 
topological  methods  can  form  the  basis  of 
powerful  computer-aided  technique  for  the 
analysis  and  design  of  networks 
containing  both  lumped/distributed 


elements.  The  method  is  shown  to  be  very 
effective  in  reducing  the  amount  of 
computation  time  necessary  f or  steady 
state  solution  of  lumped/distributed 
networks  than  method  used  by  SPICE 

program. 
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ABSTRACT 

In  this  paper,  we  address  the  convergence  model  of  a 
linear  CM  array  in  the  presence  of  noise  and  demon¬ 
strate  how  the  CMA  performs  for  multiple  signals  in 
terms  of  output  amplitude.  We  also  derive  an  expres¬ 
sion  to  evaluate  the  signal  strength  of  a  CM  canceler 
output  and  present  the  SIR  at  each  stage  CMA  output 
assuming  uncorrelated  signals  and  infinite  SNR  at  the 
first  stage. 

1.  INTRODUCTION 

A  new  processing  algorithm,  Constant  Modulus  algo¬ 
rithm  (CMA)  is  an  algorithm  which  optimizes  the  out¬ 
put  based  on  the  a  priori  knowledge  that  the  desired 
signal  is  of  constant  envelope.  As  such,  it  does  require 
neither  a  reference  signal  nor  all  signals  to  be  linearly 
independent  and  is  easily  applicable  to  cellular  environ¬ 
ments  where  many  dependent  signals  due  to  multipath 
exist  and  an  array  with  standard  adaptive  algorithm 
may  not  converge  properly  if  at  all  due  to  the  correla¬ 
tion  matrix  processed  by  the  array  being  close  to  sin¬ 
gular. 

In  this  paper,  we  address  the  convergence  model  of  a 
linear  CM  array  for  narrowband  signals  in  the  presence 
of  noise  in  Section  2.  We  also  derive  an  expression  to 
evaluate  the  signal  strength  at  the  output  of  a  CM 
signal  canceler.  Some  numerical  results  are  presented 
in  Section  3  and  conclusions  are  made  in  Section  4. 

2.  CM  SIGNAL  CANCELER 

Figure  1  shows  a  simplified  block  diagram  of  a  two- 
stage  AT-element  CM  signal  canceler.  At  the  first  stage, 
the  complex  signal  Xi(fc),  received  by  the  zth  element 
of  the  CM  array  in  the  A;th  sampling  interval,  is  multi¬ 
plied  by  an  adjustable  complex  weight  'Wi{k)  and  then, 
summed  to  form  the  array  output  y{k)  given  by  y{k)  = 
x^(k)w{k).  The  CMA  captures  a  desired  signal  and 


suppresses  all  interferences  by  adjusting  the  weight  vec¬ 
tor  w.  The  CMA  is  expressed  as  -{- 1)  ^  w{k)  — 
/i(|2/(fc)P  —  l)y{k)x*{k)j  where  fxis  a.  positive  step  con¬ 
stant  and  the  superscript  *  denotes  the  complex  con¬ 
jugate.  At  the  output  of  the  first  stage,  the  captured 
signal  is  subtracted  from  the  received  array  input  sig¬ 
nals  and  the  resulted  signals  are  fed  into  the  next  stages 
for  further  processing. 

2.1.  Convergence  Model  with  Noise 

We  derive  an  expression  for  the  convergence  of  an  N- 
element  linear  CM  array  with  M  narrowband  signals 
arriving  from  spatial  angles  0^,  m  =  1,  •  •  • ,  M,  when 
zero  mean,  independent  random  noise  processes, 
n  =  0,  •  •  • ,  iV  —  1,  exist  at  the  CM  array  elements.  As¬ 
sume  that  the  transmitted  signals  are  complex,  discrete¬ 
time  processes  given  by  rm(fe)  =  where 

m  =  1,  •  •  • ,  M,  is  the  phase  satisfying  = 

where  Am  is  the  amplitude  of  carrier 
signal,  dm(’)  is  the  data  signal,  and  i/jm  is  an  indepen¬ 
dent  random  variable  uniformly  distributed  over  the 
interval  [— 7r,7r)  for  the  mth  carrier’s  unknown  start¬ 
ing  phase.  We  denote  Cm  as  the  phase  difference  of 
signal  m  between  the  first  two  array  elements.  Then, 
from  Figure  1,  the  input  to  the  nth  CM  array  element 


Figure  1:  A  simplified  N-element  two-stage  CM  signal 
canceler 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


481 


Zn(fe),  n  =  0,  •  *  • ,  iV- 1,  which  is  the  sum  of  signals  and 
noise,  can  be  represented  in  matrix  form  as  follows  : 

ztx  +  d^ps-\-n  (1) 


where  z  ^  [zo{k)  zi{k)  •  •  •  ZN-iik)]^,  p  is  the  array 
matrix  with  its  column  p^  i  [Am  ■  *  • 
a  signal  vector  s  =  •••  and  n  is  a 

noise  vector  independent  at  each  array  element.  The 
array  output  with  the  input  signals  and  noise  is  given 
by  y  =  w'^z  =  z'^w  and  the  updating  algorithm  of  a 
CM  array  is  written  as  w{k  +  1)  =  w{k)  —  p{\y{k)\'^  — 
l)y{k)z*.  Substituting  y,  we  obtain  w{k-^l)  =  w{k)  - 
p{w’^zz^w*  —  l)z*z'^w,  where  we  used  T  for  trans¬ 
pose,  c  for  conjugate  transpose,  and  *  for  complex 
conjugate.  Define  the  instantaneous  array  output  am¬ 
plitudes  by  V  =  p^w  obtained  from  y 


T 

=  X  w 

s^p^w  =  s^v  and  similarly,  the  instantaneous  noise 
at  the  array  output  by  n^w.  Then  multiplying  p^ 


to  it7(fc  -h  1)  results  in 


where  Qu  i  2X)^=i  \vm?  “  1  and  Qij  =  0  for 

i  ^  j  and  z,  j  =  1,  •  •  • ,  M.  The  terms  after  the  second 
term  are  due  to  the  existence  of  noise.  By  similar  ma¬ 
nipulation  for  the  rest  of  terms,  we  obtain  the  following 
expression  of  the  mean  array  output  amplitudes  : 

=  v-^pNRQv-Vr^  (3) 

where  Vn  is  the  sum  of  E{3rd  term}  through  E{7th 
term}  survived  and  (M  x  M)  matrices  R  and  Q  are  de¬ 
fined  with  its  elements  Rij  t  for  z,  j 

=  and  Qn  i  and 

Qij  =  0  for  z  j  and  i,  j  =  !,•••,  M.  The  array  ma¬ 
trix  A(Ci)  is  defined  by  A(Ci)  =  [1 

We  also  establish  the  relationship  between  the  mean 
array  output  amplitude,  denoted  by  Vi^ki  and  the  mean 
weight  coefficients  at  a  time  instant  fc,  given  by 

«,(l=)=[A«0  ...  A(C„)]  H-' [SlA  ...  ^]’'(4) 


=  V  —  pp^p*{v^  SS^V*  —  l)s*S^V 
-pp'^p*{v'^sr]*  -  l)s*ri 
—pp'^n*(rf'8^v*  -  Vjs'^v 
-pp^n*{7]'^ri*  -  1)7} 

—p{v^SS^V*p^7l*S^  -{-  V^8ri*p^p*8*S^ 

-f  rj^ 8^v*p^p*  8*  8'^ 

-br}^ r}*p'^ p* 8* 8'^  -f  ri^r]*p^n* 8^}v 
— 88^V*p^n* 
+i;^S7y*p^n*  +  r}^8^v*p^p*8* 
■\-if'8^v*p'^n*  +  ?7^77*p^p*s*}97  (2) 

where  we  dropped  the  time  index  k  and  denoted  v{k  + 
1)  by  for  simplicity. 

Assuming  the  signals  and  noise  are  independent,  the 
mean  output  amplitude  is  obtained  by  taking  expecta¬ 
tion  at  both  sides  of  (2)  over  signal  which  includes 
the  data  signal  d(*),  independently  distributed  phase 
and  zero-mean  independent  noise  We  also  notice 
that  averaging  over  the  product  of  signals  results  in  a 
value  of  zero  regardless  of  signal  correlation  due  to  in¬ 
dependent,  uniformly  distributed  random  pha^se  '0,  i.e., 
£;{ejVieJVm}  q  /  ^  ^  Thus,  this  analy¬ 

sis  applies  to  both  uncorrelated  and  correlated  signals 
if  the  carriers  with  the  same  frequency  differ  in  start¬ 
ing  phase.  We  evaluate  the  expression  in  (2)  term  by 
term.  With  the  commonly  accepted  assumption  that 
p^p*{v^ 88^v*  —  l)s*s^  is  uncorrelated  with  v,  and 
denoting  E[vi]  by  Vi,  an  expression  for  the  expected 
value  of  the  second  term  becomes 

jB{2nd  term}  =  pp^p*E{{v^88^v*  —  l)8*s^}E{v} 
—  pp^p*Qv 


where  the  each  element  of  the  array  correlation  matrix 
R  is  written  by  Rij  ^  A^(Ci)A(Cj),  i,j  = 

Using  (3)  and  (4),  we  iteratively  calculate  the  weight 
coefficient  vector  w  as  follows  :  i)  Calculate  iw(0)  with 
some  initial  values  of  v®,  ii)  Calculate  iii)  Calcu¬ 
late  w{k  -h  1). 

2.2.  Output  Amplitudes  of  a  Canceler  Stage 

Since  the  CMA  in  each  stage  may  not  perfectly  sup¬ 
press  interferences  if  the  processing  time  is  limited,  the 
output  of  the  canceler  contains  some  residual  power  of 
the  signals  captured  in  the  previous  stages.  In  addi¬ 
tion,  when  one  signal  power  is  much  smaller  than  the 
others,  the  CMA  takes  much  more  time  to  suppress  all 
the  interferences  than  when  all  signals  have  the  same 
power  (see  Section  3).  If  the  first  CM  canceler  stage 
does  not  remove  the  first  desired  signal  completely,  it 
passes  some  residual  power  of  the  signal  to  the  sec¬ 
ond  stage.  Due  to  this  residual  signal,  the  second  CM 
canceler  again  passes  more  residual  signals  to  the  next 
stage  since  imperfect  capturing  and  deleting  occur.  As 
the  number  of  canceler  stages  increases,  the  sum  of 
residual  power  increases  and  eventually  limit  the  per¬ 
formance  of  the  signal  canceler  by  capturing  the  same 
signal  captured  at  one  of  the  previous  stages. 

Denote  the  input  to  the  Ith  element  of  the  first 

stage  CMA  by  =  Z)m=i  where  the  inter¬ 

element  delay  with  respect  to  the  first  array  element 

C[^  i  1  =  0, -  1,  for  the  mth  sig¬ 

nal.  The  output  of  the  first  stage  CMA  is  given  by 
j/(^)  =  aTOVm\  where  the  array  gain  for  the  mth 

signal  The  output  of  the  first 

canceler  stage  which  is  the  input  to  the  Ith  element  of 
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the  second  stage  CMA  is  given  by  — 

The  canceler  weight  h!p  is  obtained  by  minimizing  the 

canceler  output  power,  i.e.,  minJ5J{|a;p^p}.  With  un- 
correlated  signals  and  no  noise  assumed,  we  obtain  a 
general  expression  for  the  input  signal  to  the  A:th  stage 
CMA  as 


m=l 

M 


Em 

m=l  ^Ir 

Em  I 
m~\  I 


where  -  hj*'  and  the  signal 

power  Pm.in  =  Notice  for  two  signals  that 

if  the  interference  is  completely  suppressed  at  the  first 
stage  CMA  output,  i.e.,  =  0,  then  the  canceler 

output  contains  only  the  interfering  signal  by  eliminat¬ 
ing  the  desired  signal,  i.e.,  •  We  can  also 

show  that  an  equivalent  general  form  can  be  expressed 


^1  (n)>» 


E  -  E 


=  E  (6) 

m=l 

where  ^  Z)m=i  E[\fm^\‘^].  The  expression 

in  (5)  is  represented  in  terms  of  the  equivalent 
inter-element  phase  delay  taking  the  gain  and  the  phase 
shift  introduced  at  each  stage  into  account  while  (6) 
is  represented  in  terms  of  equivalent  input  amplitudes 

(k'\ 

fm-  The  CMA  output  at  each  stage  then  can  be  ex¬ 
pressed  as 


=  E“ 


,(fe)r(i) 

m  '  m 


= 


Im 


^Im  -  ^Im  ~  Otm 

^(«)  _  Z^m=l  ^Im  ^m,in  , 

Z^m=l  1^^  I  ^rjiyin 

with  an  initial  value  of  c\^  =  for  /  =  0,  •  •  • ,  iV  - 

1  and  m  =  . 


3.  NUMERICAL  RESULTS 


The  simulation  was  performed  with  an  eight-element 
array  {N  =  8)  and  two  signals  arriving  from  angles  of 
6\  =  35.3  and  62  =  23.8°.  The  signal  power  was  1.125 
(i.e.,  amplitude  of  1.5)  for  both  signals  and  a  CM  ar¬ 
ray  was  initialized  by  =  [1.1  0.9]^.  Additive  white 
Gaussian  noise  was  added  at  each  array  element.  Fig¬ 
ure  2  shows  the  results  of  calculation  and  simulation. 
When  SNR  =  20  dB,  the  output  amplitude  of  the  cap¬ 
tured  signal  approaches  1  and  the  interferer  is  success¬ 
fully  suppressed.  For  SNR  =  0  dB,  however,  the  output 
amplitude  of  the  captured  signal  converges  to  the  value 
less  than  1  and  the  interferer  has  relatively  big  residual 
strength.  In  Figure  3,  we  show  how  much  the  noise 
affects  to  suppression  of  interferer.  A  dashed  line  rep¬ 
resents  captured  signal- to-interference  ratio  (SIR)  vs. 
input  SNR.  We  also  show  the  output  SNR,  which  is 
less  than  the  theoretical  maximum,  when  an  interferer 
exists  (the  solid  line  represents  the  SNR  achieved  by  the 
array  when  no  interferer  exists).  We  observed  for  one 
signal  that  even  though  the  array  achieves  the  maxi¬ 
mum  gain  in  SNR,  the  amplitude  of  the  captured  signal 
may  not  be  equal  to  1.  This  is  shown  in  Figure  4  along 
with  the  characteristic  of  Least  Mean  Square  (LMS)  ar¬ 
ray  [4]  (also  see  Figure  2  for  two  signals).  This  may  be 
because  the  relatively  strong  noise  limits  the  correction 
of  error  functions  of  the  control  algorithms.  Notice  that 
for  the  CMA  the  noise  effect  on  the  Qutput  amplitudes 
can  be  somewhat  reduced  by  increasing  the  number  of 
array  elements  while  not  for  the  LMS. 

Figure  5  shows  the  suppression  of  an  interferer  with 
unequal  power.  When  an  interferer  is  much  weaker 
than  the  other  signals  including  the  desired  signal,  it 
takes  more  time  for  the  interferer  to  be  completely  sup¬ 
pressed.  Finally,  Figure  6  shows  SIR  at  each  canceler 
stage  for  signals  shown  in  Figure  5  but  with  equal  power 
and  SNR  =  00  at  the  first  stage.  As  one  may  expect, 
the  SIR  decreases  as  the  number  of  stages  increases. 
For  iteration  =  1000  and  1500  at  each  stage,  interfer- 


Number  of  Karatior 


We  verify  our  convergence  model  in  Section  2.1  by  com-  Figure  2:  Output  transients.  N  =  8,  M  =  2,  =  35.3° , 

paring  numerical  evaluation  and  computer  simulation.  ^2  =  23.8  ,  Pi, in  =  P2,in  =  1.125,  =  [1.1  0.9]^. 
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Figure  3:  Noise  effect  on  output  power  ratios.  N  =  8, 
M  =  2,  SIR  =  0  dB,  01  =  35.3^  02  =  23.8°,  Pi, in  = 
P2,in  =  1.125,  =  [1.1  0.9]^.  (the  solid  line  is  for  N 

=  8,  M  =  1,  01  =  35.3°,  Pi, in  =  1.125). 


Figure  4:  Noise  effect  on  converged  output  amplitude. 
For  the  CMA,  N  =  8  and  3,  0i  =  35.3°,  Pi, in  =  1*125 
(the  dashed  line  for  the  LMS  is  obtained  from  [4]), 

ences  are  relatively  deeply  suppressed  at  each  CMA  and 
the  SIR  linearly  decreases.  For  iteration  =  500,  at  the 
following  CMA  stages,  relatively  strong  residual  inter¬ 
ferences  are  also  further  suppressed  so  that  the  slope 
of  SIR  deviates  from  the  linearity. 

4.  CONCLUSIONS 

We  have  presented  the  convergence  model  of  a  CM  ar¬ 
ray  in  the  presence  of  noise  and  demonstrated  how  the 
CMA  performs  for  multiple  signals  in  terms  of  out¬ 
put  amplitude.  We  observed  that  the  CMA  achieves 
the  array  gain  in  SNR  less  than  the  theoretical  max¬ 
imum  value  when  at  least  one  interferer  exists.  We 
also  noticed  that  for  unequal  power  signals,  it  takes 
much  longer  time  to  suppress  relatively  small  power  in¬ 
terference  if  at  least  one  interferer  has  relatively  large 
power.  These  analytically  derived  conclusions  are  con¬ 
sistent  with  simulations.  We  also  derived  an  expression 
to  evaluate  the  signal  strength  of  a  CM  canceler  output 


Figure  5:  Convergence  speed  of  the  CMA  for  an  equal 
and  non-equal  power  interferer.  N  =  8,  M  =  4,  0i  = 
35.3°,  02  =  23.8°,  03  =  4.4°,  04  =  -5.9°,  Pi, in  = 
P2,m  =  P3,m  =  Constant  =  1.125,  P4, in  =  1-125  (lines 
with  ”0”)  or  0.045  (lines  without  ”0”) 


Figure  6:  Signal-to-interference  ratio  at  each  stage.  N 
=  8,  M  =  4,  01  =  35.3°,  02  =  23.8°,  03  =  4.4°,  04  = 
—  5.9  ,  Pl,in  =  P2, in  —  Ps, in  —  P4,in  ^  1.125 

and  presented  the  SIR  at  each  stage  CMA  output  as¬ 
suming  uncorrelated  received  signals  and  infinite  SNR. 
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ABSTRACT 

This  paper  presents  a  wavelet-based  algorithm  for  time 
delay  estimation,  specifically  for  linearly  moving  time 
delay  estimation.  The  wideband  cross  ambiguity  func¬ 
tion  (WBCAF)  of  the  received  signals  at  two  spatially 
separated  sensors  is  obtained  by  taking  their  cross  wave¬ 
let  transform.  The  global  maximum  of  the  WBCAF  is 
then  located  which  gives  a  direct  measurement  of  the 
time  delay  and  doppler  compression.  The  performance 
of  the  estimation  algorithm  is  analysed,  and  the  mean 
values  together  with  the  variances  of  the  estimates  are 
derived.  Simulation  results  are  provided  to  illustrate 
the  potential  of  the  method.  The  major  disadvantage 
of  the  algorithm  is  its  excessive  computation  complex¬ 
ity. 

1.  INTRODUCTION 

In  radar  and  sonar  signal  processing,  time  delay  es¬ 
timation  (TDE)  has  been  widely  used  in  target  rang¬ 
ing  and  bearing  measurements [1]  [2].  When  the  tar¬ 
get  and  sensors  are  stationary,  the  time  delay  between 
two  received  signals  is  a  constant.  There  are  basically 
two  methods  that  are  commonly  used  to  estimate  the 
time  difference  between  signals  received  from  two  spa¬ 
tially  separated  sensors,  namely  the  cross-correlation 
method[3]  and  the  parametric  approach[4].  If  the  tar¬ 
get  and/or  the  sensors  are  moving,  the  time  delay  be¬ 
tween  the  received  signals  will  vary  as  a  function  of 
time.  The  simplest  scenario  being  that  the  motion  of 
the  target  or  sensors  is  of  constant  speed  and  the  tar¬ 
get  is  far  away  from  the  sensors.  In  this  case,  the  time 
delay  becomes  a  linear  time  varying  function.  The  al¬ 
gorithms  that  are  capable  of  measuring  time  delay  in 
the  stationary  condition  have  limitations  in  this  non¬ 
stationary  environment.  If  the  time  difference  varies 
rapidly,  most  algorithms  will  fail  to  track  the  time  vary¬ 
ing  delay. 

The  wideband  cross  ambiguity  function  (WBCAF) 
has  been  used  to  estimate  the  time  delay  and  doppler 
shift  by  Jin  ei  al  [5].  The  method  is  mainly  appli¬ 
cable  to  active  radar  because  the  source  signal  must 
be  known  a  priori.  In  passive  sonar,  the  source  sig¬ 
nal  is  usually  submerged  in  environment  noise  and  is 
therefore  not  readily  available.  Weiss  [6]  has  studied 
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the  relationship  between  wavelet  transforms  and  wide¬ 
band  correlation  processing  and  their  applications  to 
source  localization  and  channel  modeling.  By  making 
use  of  this  observation,  we  have  developed  a  wavelet 
based  algorithm  to  obtain  the  WBCAF  of  two  received 
signals.  This  algorithm  is  then  applied  to  solve  the 
linearly  moving  time  delay  estimation  problem. 

2.  THE  ALGORITHM 

In  time  delay  estimation,  if  the  target  and/or  the 
sensors  are  moving,  the  two  received  signals  can  be 
expressed  as  follows: 

where  D{i)  ^  Dq  (3t  is  the  linear  time  varying  de¬ 
lay.  If  the  target  is  moving  with  constant  velocity  v  in 
which  1^1  <  c,  where  c  is  the  speed  of  light  in  radar  (or 
the  speed  of  sound  in  sonar),  the  variables  Do  and  jS 
can  be  considered  as  the  time  delay  and  doppler  shift 
respectively.  The  signals,  ni{t)  and  n2(t),  are  additive 
white  Gaussian  noises  with  power  spectral  density  No 
in  the  two  sensors.  From  (1),  /2(t)  can  be  expressed 
as: 

/2(t)  =  x{{l  -  P)t  -  Do) n2{t) 

=  x{- — —)  +  n2(t)  (2) 

Oo 

where  ao  =  ~  scale  and  shift 

parameters  respectively.  The  received  signal  /2(t)  in 
this  case  is  approximated  by  the  wideband  model  and  it 
is  essentially  a  scaled  and  delayed  version  of  the  source 
input.  Our  task  is  to  estimate  ao  and  bo  from  the  two 
data  sequences,  fi(t)  and  /2(t),  and  subsequently  to 
compute  the  delay  Do  and  the  doppler  factor  /3. 

Weiss[6]  and  Young[7]  have  introduced  the  concept 
of  the  wideband  cross  ambiguity  function  (WBCAF) 
and  its  relationship  with  the  wavelet  transform.  The 
WBCAF  of  two  wideband  signals,  fi(t)  and  /2(t),  is 
defined  as 

-I  /•OO  J  _  _ 

WBCAF(s,  t)  =  -=  f2{t)  dt  (3) 

Y  |s|  J-OO  ^ 
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In  noise  free  environment,  we  obtain 


1 


WBCAF(s,  r)  = -4=  /  — -)dt 

v|s|  J-oo  «0  s 

(4) 

This  WBCAF  has  a  global  maximum  at  5  =  ao,  r  = 
bo.  However,  it  is  very  complicated  to  calculate  the 
WBCAF  directly  because,  for  a  data  sequence  aj(t),  it 
is  difficult  to  obtain  an  arbitrarily  scaled  and  delayed 
version  of  x{t).  Though  we  may  apply  multirate  signal 
processing  methods  to  obtain  the  computation 

load  is  immense  which  makes  the  system  impractical  to 
be  implemented  for  real  time  applications.  Therefore, 
it  is  necessary  to  derive  an  efficient  and  effective  way 
to  find  the  WBCAF. 

From  [7],  the  cross  wavelet  transform  indicates  the 
extent  of  commonality  between  the  two  original  signals 
or  the  two  original  wavelet  transforms  with  respect  to 
the  same  mother  wavelet.  The  magnitude  of  the  cross 
wavelet  transform  can  be  used  to  detect  the  presence 
of  common  signals.  By  definition,  the  cross  wavelet 
transform  is 


W,f{s,T)  =  1  /  /  WHfia,b)W;:gi- 

J  J  —oo  ^ 


a  h  —  T ,  da  db 


(5) 

where  Whf  denotes  the  wavelet  transform  of  f[t)  with 
respect  to  a  particular  mother  wavelet,  h{t),  which  can 
be  computed  from 


A  1  f  —  h 

W,/(a,fe)  -y=  /  f(t)h*(  —  )dt  (6) 

Vla|  J-oo  ^ 


The  parameter  Ch  is  the  admissible  constant  of  the 
wavelet  h(t)  which  is  given  by 


Ch 


m^)r 

iwi 


dev 


(7) 


where  JI(cv)  is  the  Fourier  transform  of  h(t).  From  (3), 
(6)  and  (5),  the  WBCAF  can  be  written  as 


WBCAF(5,r)  =  Wfj2(s,T) 

-w 


.  V  *  /  b  —  T  ^  da  db  . 

WKf2{a,  b)  — )  ^8) 

S  S  Of 


In  this  application,  it  is  essential  to  find  a  mother  wavelet 
h{t)  that  is  as  close  to  the  signal  being  transformed  as 
possible  and  it  will  be  an  advantageous  if  such  a  mother 

wavelet  is  analytic  so  that  h{ — r*— )  can  be  calculated 

in  advance  for  different  values  of  s  and  r.  In  addi¬ 
tion,  we  need  to  derive  two  wavelet  transforms  for  the 
received  signals  fi{t)  and  f2{t)  separately, 

WHf2{a,b)  r  f2{t)h*C-^)dt  (9) 

^|al  7-00  “ 


Wkhi 


a  b  —  T 


t- 


b—r 


-)  dt 


Then  by  putting  (9)  and  (10)  into  (8),  the  maximum 
point  that  corresponds  to  the  required  estimation  of 
ao  and  bo  at  the  (5,r)  plane  of  the  WBCAF  can  be 
obtained. 


3.  PERFORMANCE  ANALYSIS 


The  performance  of  this  wavelet  based  WBCAF 
method  can  be  analysed  by  computing  the  expecta¬ 
tion  and  variance  of  the  estimates  of  the  scale  and 
time  delay  parameter,  viz.  5  and  f.  The  mathematical 
derivation  is  tedious  and  it  can  be  proved  that  the  es¬ 
timates  of  s  and  f  are  asymptotically  unbiased  in  high 
SNR  condition.  The  variances  of  s  and  f  are  fairly 
close  to  the  CRLB  for  the  case  of  high  SNR,  and  they 
are  dependent  on  the  source/noise  signal  as  well  as  the 
mother  wavelet.  In  general. 


j  E[s]  ^  So 
\  E[t]  ^  To 


(11) 


var{r)  =  )(£>,_  A.)  +  g, 

where  A(5),  R5,  Dg  and  Eg  are  defined  in  a  similar 
way  as  in  [5].  The  second  terms  in  (12)  are  constants 
dependent  on  the  noise  power  as  well  as  the  properties 
of  the  source  signal  and  the  wavelet  function.  It  is 
noticed  that  the  first  terms  of  (12)  are  twice  as  large  as 
that  in  [5].  This  is  deemed  to  be  reasonable  because, 
in  this  case,  the  time  delay  and  doppler  shift  are  being 
estimated  in  both  sensors  respectively.  The  strategy 
we  have  adopted  in  the  proposed  method  is  that  we 
first  design  an  optimum  source  signal  to  minimize  the 
first  terms  of  (12),  and  then  select  an  optimum  mother 
wavelet  to  make  Ki  and  K2  as  small  as  possible. 


4.  SEARCH  METHOD  AND  SIMULATION 
RESULTS 

In  our  simulations,  we  used  two  sinusoids  with  dif¬ 
ferent  frequencies  as  the  source  signal,  viz.  x{t)  = 
sin(27r  0.45 1)  +  sin(27r  0.20 1),  The  sampling  period  was 
chosen  to  be  AT  =  1.  Two  uncorrelated  wideband 
Gaussian  sequences  were  generated  for  the  noise  signal 
ni{t)  and  712 (t).  In  sonar  applications  the  ratio  of  the 
target  speed  and  the  velocity  of  sound  is  ^  <  0.025. 
So  the  scale  [6]  is  within  the  range  of  [0.95, 1.05].  Here 
we  chose  the  scale  and  time  shift  as  so  —  1*05  and 
To  =  0.15  respectively.  The  corresponding  time  delay 
and  doppler  compression  factor  is  Do  =  0.1429  and 
(3  0.0467. 

Three  different  mother  wavelets  have  been  tried, 

hi{t)  =  (1— t^)e”V 

h2{t)  =  2sinc(|) -sinc(|)  (13) 

/i3(t)  =  cos(27r0.15i) 
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It  was  found  that  the  shape  of  the  WBCAF  was  differ¬ 
ent  for  different  mother  wavelets.  But  for  the  particu¬ 
lar  source  signal,  x{i),  the  mother  wavelet  hs^t)^  which 
was  a  Gaussian  windowed  sinusoid,  produced  the  best 
result.  The  additional  terms  in  (12),  that  increased  the 
variance,  are  also  smaller  for  h3{t)  than  for  hi{t)  and 
h2{t).  Therefore  h3{t)  has  been  chosen  as  the  mother 
wavelet  in  our  study. 

Fig.l  depicts  the  WBCAF  of  the  source  signal  and 
its  scaled  and  delayed  version,  and  it  can  be  seen  that  it 
is  a  smooth  convex  function  in  the  range  of  s  G  [0.9, 1.1] 
and  r  G  [—1,  Ij.  However,  if  noise  is  present,  the  WB¬ 
CAF  becomes  rugged  and  is  no  longer  a  smooth  convex 
function  as  shown  in  Fig.2.  In  general,  the  steepest  de¬ 
scent  method  is  unable  to  locate  accurately  the  maxi¬ 
mum  WBCAF.  Instead,  we  used  a  grid  search  method 
to  determine  the  maximum  point  which  correspond  to 
the  scale  and  time  shift  of  the  signal. 

Search  Method 

Assume  that  the  initial  range  of  s  and  r  for  observation 
is  5  G  [0.9, 1.1]  and  r  G  [—1, 1].  The  search  procedure 
is  as  follows: 

1.  Make  some  grid  on  the  (5,  r)  plan,  and  find  the 
maximum  point  of  WBCAF. 

2.  Make  a  finer  grid  in  a  smaller  range  around  the 
maximum  point,  then  find  the  maximum  point 
again. 

3.  Repeat  step  2  until  the  resolution  reaches  the  pre¬ 
defined  value. 

The  performance  of  the  method  has  been  studied 
for  different  signal- to-noise  ratio  (SNR)  and  different 
observation  data  lengths.  The  SNR  was  varied  from 
—  10,0,10  to  20dB  while  the  observation  data  length 
was  changed  from  50,  100,  200,  400  to  600  points,  re¬ 
spectively.  In  each  case  20  independent  runs  were  per¬ 
formed  to  obtain  the  ensemble  average. 

Fig. 3  and  Fig. 4  show  the  ensemble  average  of  s  and 
f,  respectively,  vs.  the  observation  data  length  under 
different  SNR  conditions.  While  Fig.5  and  Fig. 6  show 
the  variances  of  s  and  f  under  the  same  conditions.  In 
order  to  amplify  the  difference  for  ease  of  comparison, 
the  variances  of  s  and  f  in  Fig.5  and  Fig. 6  are  plotted 
in  logarithmic  scale.  It  is  seen  that  the  scale  and  shift 
estimates  are  fairly  close  to  the  actual  values  of  sq  — 
1.05  and  tq  =  0.15  when  SNR  =  OdH,  lOdB  and  20dB, 
and  the  variances  reduced  as  the  observation  length 
increased.  But  for  noisy  condition,  say  SNR  =  —  lOdH, 
the  estinlates  had  a  large  fluctuation  about  the  actual 
values  of  sq  and  tq,  and  the  variance  was  significantly 
larger  even  a  long  observation  data  length  was  used. 

It  is  found  from  Fig.3  to  Fig.6  that  under  high  SNR, 
less  input  points  are  required  whilst  for  low  SNR  more 
input  samples  are  needed  to  obtain  better  estimation 
of  the  scale  and  time  shift  of  the  received  signals.  For 
example,  when  SNR  >  lOdR,  200  input  signal  points 
are  enough  to  get  a  reasonable  result,  whereas  when 
SNR  =  OdBj  4d0  points  are  required.  It  is  also  found 
from  Fig.5  and  Fig.6  that  the  shift  estimate  has  a  larger 
variance  than  the  scale,  that  means  the  latter  is  more 


sensitive  to  additive  noise.  The  estimates  of  the  delay 
JDo  and  the  doppler  factor  0  can  be  obtained  easily 
from  the  unique  relations  between  and  s,  r. 

The  major  difference  between  this  method  and  [8] 
is  that  in  [8]  the  time  delay  is  estimated  iteratively 
and  an  inherent  disadvantage  of  the  method  is  that  if 
the  time  delay  varies  rapidly,  it  will  fail  to  track  the 
delay.  Whereas  in  the  wavelet-based  method,  we  use 
the  whole  data  sequence  to  determine  the  scale  and 
the  shift  between  the  two  signals,  and  from  there  we 
are  able  to  obtain  accurate  estimation  of  the  delay  and 
the  doppler  parameters  irrespective  of  whether  the  time 
delay  is  changing  rapidly  or  slowly. 

5.  CONCLUSION 

In  this  paper  we  have  used  a  wavelet-based  method  to 
estimate  the  scale  and  time  shift  between  two  received 
signals  and  subsequently  to  extract  the  time  delay  and 
doppler  compression  in  the  source  localization  problem. 
This  method  is  based  on  the  concept  of  wideband  cross 
ambiguity  function  and  the  cross  wavelet  transform. 

6.  ACKNOWLEDGEMENT 

Part  of  this  work  is  supported  by  a  research  grant 
#CUHK  74/93E  funded  by  the  Hong  Kong  Research 
Grant  Council. 

7.  REFERENCES 

[1]  G.  C.  Carter,  Coherence  and  time  delay  estimation: 
an  applied  tutorial  for  research,  development,  test,  and 
evaluation  engineers,  IEEE  Press,  1993. 

[2]  IEEE  'Frans,  on  Acoustics,  Speech,  Signal  Processing, 
special  issue  on  Time  Delay  Estimation,  vol.  ASSP-29, 
no.3,  1981. 

[3]  C.  H.  Knapp  and  G.  C.  Carter,  “The  generalized  cor¬ 
relation  method  for  estimation  of  time  delay”,  IEEE 
Trans,  on  Acoustics,  Speech,  Signal  Precessing,  vol, 
ASSP-24,  no.4,  pp. 320-327,  1976, 

[4]  Y.  T.  Chan  J.  M,  F.  Riley  and  J.  B.  Plant,  “A  param¬ 
eter  estimation  approach  to  time-delay  estimation  and 
signal  detection”,  IEEE  'Trans,  on  Acoustics,  Speech, 
Signal  Processing,  vol,  ASSP-28,  no.l,  pp8-15,  1980. 

[5]  Q.  Jin  K,  M.  Wong  and  Z.  Q  Luo,  “The  estimation  of 
time  delay  and  doppler  stretch  of  wideband  signals”, 
IEEE  'Trans,  on  Signal  Processing,  vol.  SP-43,  no.4, 
pp.904-916,  1995. 

[6]  L.  G.  Weiss,  “Wavelets  and  wideband  correlation 
processing”,  IEEE  Signal  Processing  Magazine,  Jan. 
pp.13-32,  1994, 

[7]  R.  K.  Young,  Wavelet  theory  and  its  applications, 
Kluwer  Academic  Publishers,  Boston,  1993. 

[8]  H.  C.  So  and  P.  C.  Ching,  “Comparative  performance 
of  LMSTDE  and  ETDE  for  delay  and  doppler  estima¬ 
tion”,  Proc.  of  28th  Asilomar  Conference  on  signals, 
systems  and  computers.  Oct.  1994. 


487 


WBCAF  in  absence  of  noise 
(observation_length=1 00) 


shift  -1  0.9  scale 


Figure  1:  WBCAF  is  a  smooth  convex  in  the  absence 
of  noise 


WBCAF  in  presence  of  noise 
{SNR=0dB  observation Jength=1 00) 

x10® 


shift  -1  0.9  scale 


Figure  2:  WBCAF  is  no  longer  a  smooth  convex  in  the 
presence  of  noise 


obervation  length 

Figure  3:  Ensemble  average  of  scale  estimations  in  different 
SNR  conditions,  (so  =  1.05) 


observation  length 

Figure  4:  Ensemble  average  of  shift  estimations  in  different 
SNR  conditions,  (ro  =0.15) 


observation  length 

Figure  5:  Logarithm  of  the  variance  of  scale  vs.  observation 
length  in  different  SNR  condition. 


observation  length 

Figure  6:  Logarithm  of  the  variance  of  shift  vs.  observation 
length  in  different  SNR  condition. 
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ABSTRACT 

As  well-known  it  is  not  possible  to  fulfil  the  sampling- 
theorem  exactly:  Cut-off-errors  because  of  the 
necessary  low-pass-filtering  before  sampling  -  to  get  a 
band-limited  input  signal  -  and  if  this  is  not  possible 
additionary  aliasing  errors  appear. 

In  the  paper  it  is  shown,  that  both  cut-off-  and 
aliasing-errors  are  depending  of  the  algorithm  of 
processing:  Processing  algorithms  with  an  integral  (I-) 
part  -  as  typically  mean  operations  -  lead  to  decreasing 
errors  in  comparison  with  the  -  normally  regarded  - 
case  of  no  operation  after  sampling;  algorithms  with  a 
differential  (D-)  part  on  the  other  hand  result  in 
increasing  errors.  Final  an  example  shows  typical 
cases  using  a  general  signal  model  of  the  input. 


1.  INTRODUCTION  -  ERRORS  WITHOUT 
SIGNAL  PROCESSING 


e^  «  e^  +  62  +  03  =  2| 


JS^^(co)-|l-Gg(jco)|  dco+ 
0 


(jXj/2 


d-aijj/Z  .ry 

+  JS,,(co)dco+  js„(a)-i-co3|G,(j[®-t-coJ|  dco 

-0^12 

(la) 


Investigations  of  the  errors  may  be  done  either  in  the 
time  or  frequency  domains.  An  approximation  using 
the  time  domain  is  given  by  Churkin  et  al.  1966.  If  as  a 

quality  criterion  the  mean-square  error  e^  is  used  the 
frequency  domain  often  is  preferred  (Zadeh  and 
Desoer,  1963;  Woschni,  1990;  Woschni,  1993). 

Fig.  1  shows  the  error  components  caused  by  ideal  or 

real  (non-ideal)  anti-aliasing  filtering  G  (jco) 

a 

including  aliasing  errors  with  o>5  =  sampling 

frequency,  Sxx(®)  spectral  power  density 


Fig.  1.  ef  =  damping  error;  e^  =  cut-off  error; 

03  =  aliasing  error 

It  is  assumed  the  error  components  to  be  not  correlated 
-  because  of  the  mirror-effect  this  supposition  may  be 
fulfilled  in  practice  approximately  -  and  side-bands  of 
higher  than  first  order  do  not  faU  into  the  frequency 

range  “C03/2<co<cDs/2.  If  the  last  mentioned 
condition  is  not  fulfilled  the  last  part  in  (la)  has  to  be 
changed  using  lag  theorem  of  Fourier  Transform 
(Woschni,  1990) 
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_  00  +0^/2  12 

^3  =  L  1  Sxx(®+rco,)|G.(j[(o+rcoJ)|  dco.  (lb) 

r— 1  — cog/2 


2.  ERRORS  WITH  CONSIDERATION  OF 
SIGNAL  PROCESSING 

A  lot  of  important  problems  of  signal  processing  as 
mean-value-,  P-,  PI-,  PD-,  PID-,  FFT-algorithms  and 
most  of  all  filter-algorithms  are  linear.  In  these  case 
the  linear  system  theory  yields  and  operations  in  the 

time  domain  y  =  {Op  x(t)}  can  be  transformed  by 
means  of  the  Fourier-Transform  to  the  frequency 
domain  Y(j©)  =  G(jco)X(jco). 


(3a) 

E  side-bands  of  higher  order  are  to  be  taken  into 
consideration  the  last  part  has  to  be  changed 


2.1  Errors  with  fulfilled  Sampling  Theorem 

Using  the  approach  as  given  before  we  gain  for  the 
ideal  algorithm  of  the  processing  procedure  (the 

model)  Gid(j©)  =  F{Opy]  and  for  the  real  algorithm 

=F{OPn=ai}- 

Now  the  results  of  linear  system  theory  may  be  applied 
to  calculate  the  mean-square  error 


2.2  Errors  with  not  fulfilled  Sampling  Theorem 

Here  we  obtain  -  under  the  same  suppositions  as  before 
approximately 


3.  EXAMPLES 

In  most  cases  the  power  density  of  the  signal  is  not 
known  because  of  missing  a-priori-information.  Then 
Sxx(t)^)  is  to  be  estimated  -  e.g.  caused  from  a  band- 
limited  white  noise  by  a  low -pass  of  first  order 


So=0  for  (D^co^o  .  (4) 

For  this  example  it  follows  for  the  cases 
a)  without  signal  processing 
-  with  anti-aliasing  filtering 


=  2SoCOo(arc  tanco^^o  /  coq  -  arc  tan  /  2cDo); 


-  without  anti-aliasing  filtering 


=  4SqCOo  (arc  tan  /  cOq  -  arc  tan  o)^  /  2coo ); 

(5b) 


—  ®xO 

e^  =  4 


=  4  J  c 

^/2l+(co/cOo) 


—  ®xO 

£^=2  f 
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b)  with  signal  processing  GQco)  =  1+  Jcd/cDq  (PD- 
algorithm) 


-  with  anti-aliasing  filtering 

E  “2SQ(cOj^Q-C0g  72); 

-  without  anti-aliasing  filtering 


e^=4SoKo-c03/2); 


(6a) 


(6b) 


c)  with  low-pass  filtering  co^p  after  sampling  (I- 
algorithrii) 

-  with  anti-aliasing  filtering 


e^=0  if 

(7a) 

-  without  anti-aliasing  filtering  only  aliasing  errors 
appear  and  only  if 


®XO^®S'®UP 
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^  =  2  J 


So 


l+((»/(no)' 
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=  2So(J 
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operation  after  sampling,  as  for  instance  used  in 
surface  measurement.  As  Fig.  2  shows  that  means  a 
low-pass-filtering  copp.  In  this  case  no  aliasing  error 
occurs  as  long  as  the  limiting  frequency  of  the  signal 
(Oq  is  less  than  (cog  -  cojp).  That  means  in  this  case 
the  sampling  frequency  has  to  be  only 


^  COq  *4"  CQj  p 


(8) 


Instead  of  the  value  due  to  the  sampling  theorem  now 
the  sampling  frequency  is  allowed  to  be  only  half  of 
this  value  for  the  extreme  case  colp  — >  0  without 
errors! 


Fig.  2.  Mean-operation  after  sampling 

d)  On  the  other  hand  additional  errors  due  to 
non-ideal  approximations  appear: 

For  the  rectangular  approximation  (Tg  = 
sampling  period)  the  mean-square  error  due  to 
approximation  can  be  calculated  (Woschni, 
1990) 


n/% 

=  2  J 


Sq 


0  (l+co/ooo) 
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joo  1-e 


dco 


(9a) 


The  next  better  approximation  is  the  trapezoidal  one 
with  the  approximation  error 


(7b) 

-  71/T^ 

0  r 

So 
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e  =2J  ] 
0  [ 
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l+oo/oooj 

jco  2 
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(9b) 


4.  CONCLUSION 


The  sampling  theorem  and  the  sampling  errors  in  the 
classical  theory  are  treated  without  signal  processing 
after  sampling. 

If  signal  processing  after  sampling  is  taken  into 
consideration  the  situation  is  another  one  as  shown  in 
detail  in  the  paper:  In  this  case  cut-off-errors  as  well  as 
aliasing-errors  are  depending  of  the  processing 
algorithm.  These  algorithms  mean  in  the  frequency 
domain  an  increasing  or  deaeasing  of  the  higher 
spectral  frequencies.  Due  to  this  fact  the  errors  -  both 
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the  cut-off  an  the  aliasing  parts  -  are  decreasing,  if  the 
processing  algorithm  includes  an  integral  (I-)part. 
On  the  other  hand  a  differential  (D-)part  leads  to 
increasing  errors.  In  the  extreme  case  of  a  mean- 
operation  (i.  e.  an  extreme  low-pass-filtering  after 
sampling)  the  sampling  frequency  is  allowed  to  be  half 
of  the' value  due  to  the  sampling  theorem. 

In  the  paper  at  least  illustrative  examples  show  the 
situation  in  typical  cases  of  signal-processing  as  P-, 
PD-  or  I-algorithms. 

Furthermore  it  has  to  be  taken  into  consideration  as 
well-known  the  condition  for  one-line  processing.  That 
means  the  computing  time  must  be  less  than  the 
sampling  period  T^. 

depends  on  the  algorithm:  In  general  Tq  is  the 
longer,  the  better  the  approximation.  For  the  example 
of  integration  the  trapezoidal  approximation  needs 
approximatialy  the  dubble  of  time  in  comparision  with 
the  rectangular  approximation.  For  minimizing  the 
overall  error  one  has  to  make  a  compromise  between 
sampling-period,  degree  of  approximation  and 
calculation  error-components.  Because  due  to  the  law 
that  the  computation  speed  increases  one  order  of 
magnitude  every  5... 7  years  in  future  the  possible 
critical  frequency  will  shift  to  higher  frequencies  that 
means  the  errors  will  be  decreasing  drastically. 

Last  not  least  it  may  be  pointed  out  that  the  use  of 
computers  with  higher  bit-words  (16  to  32  to  64  bit- 
computers)  is  only  effizient  if  the  errors  are  decreasing 
also  drastically. 
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ABSTRACT 

The  paper  describes  a  novel  approach  to  the  use  of 
direct-sequence  spread-spectrum  modulation  for  code 
division  multiple  access  (DS-CDMA)  applications  by 
replacing  the  normal  PN  spreading  code  with  an 
orthogonal  low-rate  convolutional  code.  This  achieves 
both  the  spreading  in  bandwidth  of  the  data  and  a 
powerful  coding  strategy  within  the  output  state  of  the 
convolutional  shift  register  system,  which  maximises  the 
number  of  simultaneous  CDMA  channels.  In  order  to 
achieve  orthogonality,  Hadamard  codes  are  incorporated 
within  the  convolutional  coding  strategy.  The  effects  of 
using  such  a  design  within  a  DS-CDMA  system  are 
studied  and  the  results  show  a  direct  increase  in  the 
capacity  of  the  cellular  system  compared  with  a  DS- 
CDMA  using  maximal  length  sequences.  The 
importance  of  randomising  the  output  symbols  from  the 
Hadamard  encoder  is  also  considered  with  respect  to  the 
spectral  characteristics  of  the  transmitted  signal  and  the 
isolation  between  different  simultaneously  transmitting 
users. 

1.  INTRODUCTION 

The  field  of  spread  spectrum  multiple  access  is  currently 
receiving  considerable  attention  as  a  contender  for  the 
next  generation  mobile  telephone  system.  DS-CDMA 
systems  are  at  present  looking  very  promising  compared 
to  advanced  time  division  multiplex  systems  (TDMA) 
[1],  as  these  systems  offer  increased  flexibility  in  the 
frequency  management  of  the  cellular  netwoik,  the 
prospect  of  temporary  capacity  overload  and  improved 
handover  flexibility.  The  ultimate  goal  is  to  maximise 
the  potential  number  of  simultaneous  users  within  a 
specific  cell  boundary. 

Conventional  DS-CDMA  systems  rely  on  an  internally 
generated  PN  sequence,  such  as  maximal  length 
sequences  or  Gold  codes,  to  spread  the  data  in  the 
transmitter  by  means  of  modulo-2  addition.  This  causes 


the  original  data  to  be  spread  in  bandwidth  by  a  factor 
dependant  on  the  PN  sequence  chip  rate.  If  the  PN  data 
is  of  repetition  length  L,  and  one  data  period  lasts  for  the 
whole  repetition  length,  then  the  data  will  experience  a 
spread  in  bandwidth  proportional  to  L.  The  factor  L,  by 
which  the  data  is  increased  in  bandwidth  can  be  termed 
the  spreading  ratio  of  the  base-station  transmitter. 

This  is  not  the  only  way  of  spreading  the  bandwidth  of 
the  data  in  the  transmitter  base-station,  however.  Using  a 
convolutional  encoder  of  rate  1/L,  as  shown  in  figure  1, 
also  introduces  a  redundancy  of  L  symbols,  hence  this 
also  provides  bandwidth  spreading. 


Figure  1  Spread-spectrum  system  using 
convolutional  coding  for  signal  spreading 

Figure  2  shows  the  schematic  of  a  CDMA  base-station 
transmitter  which  is  based  on  the  use  of  low-rate 
convolutional  codes  rather  than  the  more  usual  PN 
spreading  sequences.  The  orthogonality  of  ordinary 
convolutional  codes  is  very  poor,  and  this  would  lead  to  a 
small  number  of  simultaneous  users  when  used  in  a 
CDMA  system.  This  can  be  improved  significantly  by 
using  low-rate  orthogonal  codes  [2]  which  directly  code 
and  spread  the  data  signal.  This  system  provides  an 
encoded  bit  sequence  of  length  L  on  each  bit  transition 
from  the  data  source  and  in  effect  provides  complete 
spreading  of  bandwidth. 

It  is  well  known  that  orthogonal  functions  do  not  have 
good  auto-correlation  properties  and  therefore  the  direct 
use  of  orthogonal  functions  can  lead  to  poor  transmission 
spectral  characteristics  (e.g.  non-uniform  spreading)  and 
large  interference  signals. 
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To  reduce  this  problem,  a  second  stage  of  randomisation 
is  added  to  the  encoded  symbol  set  to  improve  the  auto¬ 
correlation  property  of  the  orthogonal  signals  [2].  The 
PN  sequence  serves  purely  as  a  randomiser  and 
introduces  no  further  spreading  of  bandwidth.  This 
encoded  and  spread  signal  is  then  modulated  onto  the 
carrier  (using  a  technique  such  as  QPSK)  prior  to 
transmission  over  the  channel. 


Figure  2  Simplified  diagram  of  the  proposed  BS 
transmitter 


The  receiver  structure  performs  the  inverse  operation  by 
demodulating  the  received  signal,  then  decoding  the 
convolutionally  encoded  signal  and  providing  an 
estimate  of  the  most  likely  transmitted  data  bit.  The 
decoding  algorithm  used  by  the  receiver  must  use  the 
maximum-likelihood  decoding  technique  since  the 
probability  of  any  codeword  being  transmitted  is  equally 
likely.  There  are  various  decoding  algorithms  such  as  the 
Viterbi  algorithm,  sequential  decoding  (which  is  nearly 
maximum-likelihood)  or  concatenated  coding  schemes. 
One  particular  feature  which  is  important  to  the 
decoding  algorithm  is  its  speed,  especially  if  it  is  to  be 
used  for  low  rate  codes  in  a  DS-CDMA  system  where  the 
throughput  (i.e.  the  speed  at  which  each  data  bit  is 
processed)  must  be  high.  It  will  become  apparent  that  a 
low  rate  code  will  inevitably  slow  down  the  speed  of 
operation  in  the  decoder  since  a  large  number  of  symbol 
comparisons  per  data  bit  will  be  made. 

The  Viterbi  algorithm  is  a  fast  and  efficient  way  of 
decoding  convolutionally  encoded  data.  It  works  by 
comparing  the  received  sequence  of  encoded  bits  with  the 
possible  combinations  of  most  likely  transmitted  bits  and 
makes  its  decision  after  it  has  reached  a  decision  on  the 
most  likely  path  through  the  trellis  of  the  convolutional 
code.  Thus  it  can  be  seen  that  this  technique  will  yield  an 
improvement  over  the  simple  correlating  process  in  a 
conventional  CDMA  since  a  decoding  decision  is  based 
not  only  on  the  complete  set  of  L  redundant  symbols  but 


also  takes  into  account  the  previous  data  bits  which  had 
been  present  in  the  shift  register  system.  Hence, 
improved  error  detection  is  provided  where  the  level  of 
improvement  depends  on  the  constraint  length  K  of  the 
code  (i.e.  number  of  shift  registers  used).  As  K  is 
increased,  the  decoding  method  becomes  more  historic 
since  the  number  of  possible  states  in  the  trellis  diagram 
will  increase  exponentially.  The  higher  the  number  of 
possible  states  in  the  trellis,  the  better  the  bit-error 
performance.  The  drawback,  of  course,  is  that  the 
decoding  procedure  will  slow  down  as  the  number  of 
states  in  the  trellis  increases,  as  the  decoder  has  to 
compare  many  more  possible  data  bit  transitions. 

The  level  of  improvement  that  a  convolutional  code  will 
give  compared  to  an  uncoded  system  is  given  by  the 
coding  gain.  The  Viterbi  algorithm  offers  the  advantage 
of  providing  a  high  coding  gain  at  low  bit-error  rates, 
which  can  range  from  1.5  dB  to  4  dB  at  a  BER  of  10'^ 
using  hard  decision  limiting  with  constraint  lengths  3 
and  8  respectively. 

2.  DS-CDMA  SIMULATIONS 

Computer  simulations  of  conventional  DS-CDMA 
systems  and  the  low-rate  orthogonal  code  system  were  set 
up  with  identical  basic  parameters  of  code  rate  and 
spreading  bandwidth  for  the  two  systems.  In  this  way  a 
fair  comparison  of  bits/s/Hz/cell  between  the  two  systems 
can  be  established,  hence  it  was  possible  to  obtain  a 
direct  comparison  of  the  number  of  simultaneous  users 
between  the  two  systems  for  a  given  bit-error  rate  (BER). 
For  the  case  of  the  low-rate  convolutional  Hadamard 
encoder,  shown  in  figure  3,  the  simulation  provides  an 
encoded  sequence  of  2*^  output  symbols  for  each 
consecutive  input  data  bit.  There  are  2^  different 
sequence  sets,  each  of  length  2*^  symbols  and  orthogonal 
to  any  other  sequence.  The  inherent  orthogonal 
properties  of  the  Hadama^'d  codes  are  based  on  the  fact 
that  any  two  sequences  of  length  2^  differ  in  2*^'^  bit 
positions,  thus  giving  a  64  symbol  length  Hadamard  code 
32  different  symbol  combinations. 


Figure  3  A  low-rate  convolutional  encoder 

To  decode  the  received  signal  sequence  with  a  Viterbi 
decoder  it  is  necessary  to  find  the  most  likely  path 
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through  the  trellis  state  diagram.  Since  each  user  has  a 
different  unique  mapping  of  state  outputs  in  the  shift 
register  system  to  the  inputs  of  the  Hadamard  block 
encoder,  the  trellis  for  each  user  will  be  considerably 
different,  hence  allowing  all  simultaneous  users  to  be 
isolated  from  one  another. 

It  is  well  known  [3]  that  the  BER  performance  of  a 
Viterbi  decoder  is  directly  related  to  the  free  distance  of 
the  convolutional  code,  consequently  the  BER  probability 
in  the  continuous  additive  white  Gaussian  noise  channel 
is  given  as; 


oo 

BER< 

x=dfree 


(1) 


where  p 

^  ^  \  N 


R  is  the  rate  of  the  code,  Eb/No  is 


the  required  SNR  at  the  output  of  the  decoder  and  C,  is 
determined  by  the  code  structure  of  the  Hadamard 
encoder.  Equation  (1)  is  the  key  to  determining  the 
structure  of  the  most  effective  code  performance  for  the 
continuous  output  additive  white  Gaussian  noise 
channel.  In  order  to  use  the  same  bandwidth  in  the  most 
efficient  way  it  is  desirable  to  use  a  coding  strategy 
which  yields  the  highest  free  distance,  ultimately 
governed  by  the  code  rate  and  constraint  length  of  the 
encoder.  The  new  code  design  featured  in  this  paper  will 
inevitably  give  a  superior  performance  to  a  simple  rate 
1/2  encoder  of  same  constraint  length  operating  in  a  very 
noisy  environment  (i.e.  high  number  of  simultaneous 
users).  At  low  user  numbers  the  performance  of  both 
systems  would  yield  an  equal  BER. 


To  obtain  a  result  for  the  performance  of  such  a  system 
in  “other-user”  noise  it  is  necessary  to  assume  that  a 
relatively  large  number  of  simultaneous  users  and  a  high 
spreading  ratio  will  approximate  the  interference  as 
normally  distributed  with  a  mean  of  zero  and  variance  of 


a 


2 


M-\ 

l/R 
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where  M  is  the  number  of  simultaneous  users.  This  will 
lead  to  a  new  expression  for  Px: 


number  of  paths  which  will  take  the  decoder  decision  to 
the  all-zero  state,  given  as  K+1.  For  the  Hadamard 
convolutional  encoder  used  in  this  simulation  the  free 
distance  is  7x32=224.  The  values  for  Cx  have  been 
obtained  using  a  full  search  of  the  trellis  structure  and  for 
a  K=6  Hadamard  code,  neglecting  the  higher  order 
terms,  yield  a  bit-error  probability  of: 

BER  <  P(224)  +  2P(256)  -h  6P(288)  16P(320)  -h 

35P(352)  60P(388) ...  (4) 

The  results  of  equations  3  and  4  are  displayed 
graphically  in  figure  4.  It  must  be  noted  that  BER  values 
of  greater  than  1  serve  only  on  a  theoretical  basis.  A 
comparison  between  the  uncoded  conventional  DS- 
CDMA  system  and  the  new  Hadamard  encoded  system 
shows  considerable  improvements  with  the  new 
implementation  at  a  low  number  of  simultaneous  users. 
These  results  are  also  compared  to  the  conventional  rate 
R=l/2  coded  DS-CDMA  implementation  and  show  that 
the  Hadamard  rate  R=l/64  implementation  performs 
better  at  a  relative  high  number  of  users. 


Figure  4  Theoretical  performance  of  different  coding 
schemes  in  other  user  noise 

The  resulting  theoretical  and  simulated  bit-error  rate 
curves  in  “other-user”  noise  can  be  seen  in  figure  5.  It  is 
apparent  that  the  theoretical  prediction  of  the  code 
matches  the  simulated  results  very  closely,  and  a 
significant  advantage  in  performance  can  be  found  when 
the  number  of  simultaneous  users  is  low. 


lx 
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(3) 


The  free  distance  of  the  K=6  Hadamard  convolutional 
code  is  governed  by  the  total  Hamming  distance  with 
respect  to  the  “all-zero”  path  in  the  trellis  and  the 


To  achieve  a  BER=10'^  a  maximum  number  of  7  users 
could  be  employed  in  a  conventional  DS-CDMA  system 
but  the  use  of  low-rate  convolutional  coding  can  lead  to 
an  increase  of  nearly  130%,  giving  a  new  maximum 
number  of  users  of  16. 
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Figure  5  Performance  of  low-rate  convolutional  codes 
in  other  user  noise 

The  use  of  PN  sequences  to  randomise  the  output  of  the 
low-rate  convolutional  encoder  is  to  improve  the  spectral 
characteristics  of  the  orthogonal  codes  and  provide  better 
isolation  between  users  without  causing  any  further 
spreading  of  the  encoded  signal.  This  is  done  by  using  a 
PN  sequence  of  the  same  rate  as  the  orthogonal  output 
sequence. 


Users  (M) 

Figure  6  Graph  displaying  the  advantages  of  using  a 
randomiser 

If  8  ^  0,  where  8  is  the  duration  of  one  output  symbol 
from  the  Hadamard  encoder,  the  spectral  density  of  a 
pseudo-random  signal  can  be  seen  [2]  to  approach  a 
constant  value  of  E/T,  where  Es  is  the  symbol  energy 
and  T  is  the  period  of  repeat  of  the  pseudo-random  noise 
sequence.  It  can  therefore  be  seen  that  such  a  signal  will 
give  an  almost  uniform  spectrum,  which  is  desired  in  a 
DS-CDMA  implementation.  The  sequence  of  output 
symbols  from  a  low-rate  orthogonal  convolutional 
Hadamard  encoder,  however,  have  a  periodic  nature  for 


each  sequence  set  of  length  2^.  Thus  the  auto-correlation 
properties  of  orthogonal  sequences  is  extremely  poor  and 
yields  a  spectrum  with  large  variations  in  amphtude, 
which  has  the  direct  effect  of  degrading  the  performance 
of  the  decoder  to  other  user  noise  since  the  signal  sets  are 
not  independent  random  processes.  To  eliminate  this 
problem  a  PN  sequence  of  same  rate  can  be  used  to 
smooth  out  the  spectrum  and  randomise  the  interference 
created  by  other  users  as  seen  in  figure  6. 

It  is  clearly  evident  that  the  second  stage  of 
randomisation  improves  the  performance  of  the  cellular 
DS-CDMA  system  by  at  least  60%,  giving  an  equivalent 
improvement  in  BER  performance  of  about  2dB. 

3.  CONCLUSIONS 

This  paper  has  shown  the  feasibility  of  using  low-rate 
orthogonal  convolutional  coding  for  DS-CDMA 
applications.  It  has  been  found  that  the  number  of 
simultaneous  users  per  cell  is  increased  by  at  least  70% 
and  there  is  an  additional  improvement  of  60%  when  a 
randomiser  is  used,  giving  a  total  improvement  of  130%. 
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ABSTRACT 

The  use  of  continuation  methods  and  various  homo- 
topies  used  for  globally  convergent  dc  operating  point 
simulation  are  discussed.  A  new  homotopy  that  gives 
the  shortest  possible  arclength  between  the  initial  solu¬ 
tion  and  the  desired  operating  point  is  introduced.  In 
addition,  it  is  shown  that  for  the  case  where  a  circuit 
possesses  multiple  operating  points,  this  homotopy  al¬ 
lows  the  user  to  choose  which  operating  point  is  found 
first  and  is  guaranteed  to  find  at  least  three  operating 
points  in  a  single  analysis. 

1.  BACKGROUND 

Finding  the  dc  operating  points  of  a  nonlinear  circuit 
is  one  of  the  most  important  and  difficult  tasks  in  elec¬ 
trical  circuit  simulation.  In  the  majority  of  circuit  sim¬ 
ulation  programs  the  dc  operating  point  is  found  (as 
in  SPICE,  for  example)  by  using  a  Newton- Rap hson- 
based  iterative  algorithm.  Such  algorithms  have  two 
shortcomings:  First,  convergence  is  in  general  not  guar¬ 
anteed;  it  is  guaranteed  only  if  the  initial  guess  (speci¬ 
fied  by  the  user  in  SPICE  by  the  .nodeset  statement) 
is  sufficiently  close  to  the  actual  solution.  Unfortu¬ 
nately  the  user  either  may  not  know  the  solution  or 
cannot  give  a  sufficiently  accurate  initial  guess.  Sec¬ 
ond,  only  one  operating  point  can  be  found  during  an 
analysis;  neither  location  nor  even  existence  of  other 
operating  points  are  known  once  the  algorithm  has  con¬ 
verged  to  a  particular  operating  point. 

A  better  approach  to  finding  the  solution  of  a  set 
of  nonlinear  equations  F{x)  =  0,  where  F  \ 
is  smooth,  is  the  use  of  a  continuation  method.  In  gen¬ 
eral,  this  method  entails  embedding  the  continuation 
parameter  A  into  a  set  of  nonlinear  equations  if(x,  A) 

This  work  was  supported  by  the  Natioucil  Science  Foundation 
under  Gr£uits  MIP  94-12779  and  MIP  94-57387. 


where  H  :  ^  7^”,  x  E  and  A  E  [0, 1],  which 

satisfies  the  following  conditions: 

1.  H{x,l)  =  F{x). 

2.  The  solution  x  =  aoi  H{x,  0)  —  0,  where  a  E 
is  trivial  or  easily  found. 

3.  There  exists  a  continuous  curve  a  (called  a  ‘‘zero 
curve”)  in  IZ^  x  [0, 1]  such  that  H{x,X)  =  0  for 
every  point  on  a. 

Since  H  gives  a  continuous  deformation  between  the 
“easy”  problem  H{x^0)  =  0  and  the  “hard”  problem 
H{x,  1)  =  0,  it  is  called  a  homotopy.  In  circuit  simula¬ 
tion  applications,  x  E  IZ^  is  the  vector  of  node  voltages, 
F{x)  gives  the  set  of  Kirchhoff  ^s  Current  Law  equations 
at  each  node,  and  the  continuation  parameter  A  can  be 
chosen  to  control  a  circuit  component,  such  as  a  voltage 
source,  a  resistor,  or  a  transistor  parameter. 

A  class  of  homotopies,  known  as  “probability-one,” 
has  been  found  that  can  be  applied  to  nonlinear  cir¬ 
cuits  [1] ,  These  homotopies  have  the  property  that  the 
resulting  zero  curve  will  not  exhibit  a  bifurcation,  will 
remain  bounded  for  A  E  [0, 1]  (achieved  by  keeping  all 
circuits  corresponding  to  A  E  [0, 1]  passive)  and  will  not 
return  to  A  =  0  (achieved  by  arranging  the  operating 
point  of  the  A  =  0  circuit  to  be  unique).  Hence  in  a 
probability-one  homotopy  the  simulation  is  guaranteed 
to  reach  a  solution  at  A  =  1. 

Another  important  advantage  of  continuation  meth¬ 
ods  using  a  probability-one  homotopy  is  the  possibility 
of  finding  more  than  one,  or  even  all  of  circuit's  operat¬ 
ing  points,  since  the  zero  curve  can  pass  through  A  =  1 
more  than  once.  Consider  the  Schmitt  trigger  shown 
in  Fig.  1(a),  biased  in  its  hysteresis  region.  Using  the 
circuit  simulator  SFRAME  [2],  developed  at  Bell  Labo¬ 
ratories,  a  continuation  method  was  used  that  employs 
the  Variable  Gain  Homotopy  to  find  the  solution  to  this 
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circuit.  The  Variable  Gain  Homotopy  is  defined  by 

H{x,  A)  =  (1  ~  X){x  -  a)  +  F(Aa,  x)  ,  (1) 

where  the  vector  a  determines  the  A  =  0  solution,  and 
F(Aq',  a:)  corresponds  to  the  nodal  equations  of  the  cir¬ 
cuit  that  results  when  all  of  the  transistor  a’s  are  mul¬ 
tiplied  by  A.  A  simulation  was  run  where  each  compo¬ 
nent  of  the  a  vector  was  set  to  0.  The  resulting  zero 
curve,  projected  onto  the  Vout  “  A  plane,  is  shown  in 
Fig  1(b).  Notice  that  the  curve  intersects  A  =  1  three 
times;  hence,  three  operating  points  have  been  found 
in  this  one  analysis.  However,  if  another  simulation  is 
run  where  all  components  of  a  are  set  to  1,  the  curve 
shown  in  Fig.  1(c)  results;  in  this  case,  only  one  op¬ 
erating  point  has  been  found.  A  third  simulation  was 
run  where  all  components  of  a  were  set  to  5,  with  the 
resulting  curve  shown  in  Fig.  1(d).  In  this  case  three 
operating  points  were  again  found,  but  in  the  opposite 
order  of  the  run  corresponding  to  Fig.  1(b).  Notice 
also  that  a  zero  curve  beginning  at  A  =  0  can  “wan¬ 
der”  before  reaching  A  =  1  for  the  first  time  -  that 
is,  the  arclength  can  be  long.  Such  a  wandering  zero 
curve  requires  more  points  to  be  calculated  due  to  con¬ 
straints  from  the  curve-tracking  algorithm  [3]  and  thus 
will  require  a  longer  simulation  time. 

2.  THE  ARCLENGTH  REDUCING 
HOMOTOPY 

The  above  examples  suggest  three  areas  for  improve¬ 
ment  of  current  continuation  methods  in  use: 

A.  The  user  should  be  able  to  specify  which  oper¬ 
ating  point  is  found  first  (i.e.,  equivalent  to  the 
SPICE  .nodeset  command). 

B.  The  zero  curve  connecting  A  =  0  to  A  =  1  should 
have  the  shortest  arclength  possible. 

C.  If  multiple  operating  points  exist,  they  should  al¬ 
ways  be  found. 

We  will  now  construct  a  homotopy  that  has  these 
properties.  We  begin  by  deriving  the  Augmentation 
Principle: 

Augmentation  Principle:  Consider  the  circuit  shown  in 
Fig.  2(a),  where  a  set  of  open-circuit  voltages  Vi , . . . ,  14 
and  short-circuit  currents  7^+1... have  been  mea¬ 
sured.  The  circuit  in  Fig.  2(b)  is  constructed  as  follows. 
Each  open  circuit  in  Fig.  2(a)  where  Vi,  i  = 
has  been  measured  is  replaced  with  a  voltage  source 
of  value  Vi;  likewise,  each  short  circuit  in  Fig.  2(a) 
where  /i,  i  =  fc  -f  1, . . . ,  n,  has  been  measured  is  re¬ 
placed  with  a  current  source  of  value  4*.  By  the  Sub¬ 
stitution  Principle,  the  Fig.  2(b)  circuit  must  possess 


an  operating  point  that  is  identical  to  the  operating 
point  measured  from  the  Fig.  2(a)  circuit.  Moreover, 
at  this  operating  point  the  current  through  each  volt¬ 
age  source  and  the  voltage  across  each  current  source 
must  be  zero.  But  if  this  is  true,  then  augmenting  each 
voltage  source  Vi  with  a  series  resistor  Ri  and  augment¬ 
ing  each  current  source  !{  with  a  shunt  conductance  G* , 
as  illustrated  in  Fig.  2(c),  will  not  change  this  operat¬ 
ing  point,  since  none  of  these  augmenting  resistors  and 
conductances  will  conduct  any  current.  Therefore,  the 
circuit  constructed  by  replacing  each  open  circuit  hav¬ 
ing  voltage  Vi  with  a  resistor  Ri  in  series  with  voltage 
source  Vi  and  replacing  each  short  circuit  having  cur¬ 
rent  li  with  a  conductance  Gi  in  parallel  with  current 
source  I*  will  possess  the  same  operating  point  for  any 
Riy  Gi  E  [0, 4*00]. 

The  Augmentation  Principle  can  be  used  to  con¬ 
struct  a  homotopy  as  follows.  If,  at  a  certain  operating 
point,  a  particular  transistor  is  expected  to  be  in  the 
off  state,  then  the  pseudo-off  model  shown  in  Fig.  3(a) 
is  used,  where  =  +00  at  A  =  0  and  goes  to  0  as  A 
goes  to  1;  if  a  transistor  is  expected  to  be  in  saturation, 
then  the  pseudo-saturated  model  shown  in  Fig.  3(b)  is 
used,  where  psat  =  0  at  A  =  0  and  goes  to  -j-oo  as  A  goes 
to  1;  if  a  transistor  is  expected  to  be  in  the  forward  ac¬ 
tive  state,  then  the  pseudo-forward  active  model  shown 
in  Fig.  3(c)  is  used,  where  =  0  at  A  =  0  and  goes 
to  -|-oo  as  A  goes  to  1.  In  this  case,  the  value  of  Vbe 
is  derived  from  the  user’s  estimate  of  the  collector  cur¬ 
rent  at  the  desired  operating  point.  The  resulting  ho¬ 
motopy,  called  the  Arclength  Reducing  Homotopy,  has 
the  following  properties: 

1.  It  is  based  on  simply  augmenting  the  existing 
BJT  model  with  appropriate  resistors;  no  change 
of  the  model  itself  is  necessary. 

2.  The  circuit  corresponding  to  A  =  0  must  possess 
a  unique  operating  point  by  the  Nielsen-Willson 
Theorem  [4]. 

3.  If  the  transistor  states  are  chosen  correctly,  then 
the  node  voltages  will  be  nearly  constant  with  A  G 
[0, 1];  hence,  the  arclength  of  the  zero  curve  will 
be  close  to  the  minimum  possible.  This  implies 
a  small  number  of  points  that  are  needed  to  be 
calculated  along  the  zero  curve  and  thus  a  very 
fast  simulation. 

4.  The  first  operating  point  encountered  will  always 
correspond  to  the  selected  transistor  states,  if 
such  an  operating  point  exists. 

5.  It  exploits  the  designer’s  knowledge  of  the  tran¬ 
sistor  states,  which  are  typically  known  before 


498 


simulation.  However,  even  if  the  expected  states 
are  incorrect  or  even  randomly  chosen,  the  global 
convergence  property  guarantees  that  an  operat- 
ing  point  will  still  eventually  be  found;  it  will  just 
take  longer  to  find. 

Hence  requirements  A,  and  B.  are  fulfilled  by  use  of 
the  Arclength  Reducing  Homotopy.  We  will  now  inves¬ 
tigate  how  multiple  operating  points  can  be  found. 

3.  GUARANTEEING  FINDING  MULTIPLE 
OPERATING  POINTS 

We  now  allow  A  to  vary  between  0  and  2  and  place 
the  following  additional  conditions  on  the  Arclength 
Reducing  Homotopy: 

1.  The  curve  is  bounded  for  A  G  [0, 2].  This  is  easily 
arranged  by  constraining  any  circuit  correspond¬ 
ing  to  A  G  [0,2]  to  be  passive. 

2.  There  exists  some  7]q  >  0  such  that  the  circuit 
corresponding  to  any  A  G  [0,  rjo]  possesses  a  unique 
operating  point. 

3.  There  exists  some  r]2  >  0  such  that  the  circuit 
corresponding  to  any  A  G  [2  —  772,2]  possesses  a 
unique  operating  point. 

4.  The  unique  operating  point  of  the  circuit  corre¬ 
sponding  to  A  =  0  is  ‘‘close”  to  an  operating  point 
of  the  original  (i.e.,  A  =  1)  circuit. 

5.  The  unique  operating  point  of  the  circuit  cor¬ 
responding  to  A  =  2  is  “close”  to  an  operating 
point,  different  from  the  one  in  Condition  4,  of 
the  original  circuit. 

Under  these  conditions,  the  only  possible  zero  curve 
that  can  exist  must  pass  through  A  =  1  at  least  three 
times,  as  illustrated  in  Fig.  4.  The  shaded  regions  in 
this  figure  correspond  to  the  homotopy^s  “regions  of 
uniqueness.”  Notice  that  if  if(A,x)  were  symmetric 
around  A  =  1  (i.e.,  if  /f(A,  a?)  =  H{2  —  A,  x))^  then  any 
zero  curve  with  one  endpoint  at  A  =  0  would  necessarily 
pass  through  A  =  1  only  once.  Thus,  any  homotopy 
that  passes  through  A  =  1  more  than  once  must  have 
some  degree  of  asymmetry  around  A  =  1.  The  five 
conditions  given  above  guarantee  this  asymmetry. 


To  realize  these  conditions  we  simply  extend  the  Ar¬ 
clength  Reducing  Homotopy  by  choosing  another  set 
of  transistor  states  for  to  A  =  2  that  corresponds  to 
a  second  expected  potentially  stable  operating  point. 
Hence  each  transistor  model  will  have  two  augmenting 
resistors;  for  example,  suppose  a  circuit  possesses  one 
operating  point  where  a  certain  transistor  is  expected 
to  be  off  and  another  operating  point  where  the  tran¬ 
sistor  is  expected  to  be  saturated.  Then  a  model  called 
off  jsat  would  be  used.  The  Fig.  1(a)  Schmitt  trigger 
was  run  again  using  the  Arclength  Reducing  Homotopy 
where  transistor  Qi  had  continuation  model  off _sat 
and  transistor  had  continuation  model  sat_off. 
The  resulting  zero  curve  for  Vout  is  shown  in  Fig.  5. 
Notice  that  the  operating  point  corresponding  to  the 
first  intersection  with  A  =  1  corresponds  to  Qi  off  and 
Q2  saturated.  Moreover,  the  variation  in  Vout  (and,  in 
fact,  in  all  other  node  voltages)  is  very  small  along  the 
zero  curve  between  A  =  0  and  A  =  1. 
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ABSTRACT 

A  learning  procedure  for  CNN  is  presented  and  applied 
in  order  to  find  the  parameters  of  networks  approximat¬ 
ing  the  dynamics  of  certain  nonlinear  systems  which  are 
characterized  by  partial  differential  equations  (PDE). 
Our  results  show  that  —  depending  on  the  training  pat¬ 
tern  —  solutions  of  various  PDE  can  be  approximated 
with  high  accuracy  by  a  simple  CNN  structure.  Re¬ 
sults  for  two  nonlinear  PDE,  Burgers’  equation  and  the 
Korteweg-de  Vries  equation,  are  discussed  in  detail. 


CNN  behaviour.  The  cell  output  function  is  usually  gi¬ 
ven  by  /out (2/)  =  c|  —  -  c|)  with  positive  c.  The 

function  defines  the  local  dynamics  of  a  cell. 

Up  to  now  a  large  number  of  learning  methods  for 
CNN  has  been  considered  [5],  most  of  them  are 
modifications  of  the  Recurrent  Backpropagation  and 
Backpropagation-through-Time  algorithms  for  standard 
recurrent  neural  networks.  But  these  methods  are  desi¬ 
gned  in  order  to  find  stable  fixpoints  of  a  network  with 
linear  weight  functions.  They  are  not  applicable  to  the 
approximation  of  the  dynamics  of  nonlinear  systems. 


1.  INTRODUCTION 


A  CNN  [2,3,4]  is  a  system  of  simple  nonlinear  proces¬ 
sors  (cells)  which  are  arranged  in  one  or  more  layers  on 
a  regular  grid.  Interactions  between  cells  are  local  and 
usually  translation  invariant,  i.e.  a  connection  from  a 
cell  j  towards  another  cell  i  only  exists  if  j  is  part  of 
I’s  neighborhood  //{i)  and  its  type  and  strength  de¬ 
pend  only  on  the  relative  position  of  j  with  respect  to  t. 
Thus  the  number  of  connections  increases  only  linearly 
with  the  number  of  cells,  a  feature  that  makes  hardwa¬ 
re  realization  of  CNN  relatively  simple  when  compared 
to  other  types  of  artificial  neural  networks.  State 
and  output  of  a  cell  i  in  layer  m  are  real  numbers, 
their  dynamics  being  determined  by  state  equations  of 
the  form 


dvZii) 


=  -p«.W) 


m'=l 


+  E  (1) 


with 

C  =  /out«0. 

The  feedback  connection  from  a  cell  i  +  /  in  layer  m'  to¬ 
wards  another  cell  i  in  layer  m  is  defined  by  the  weight 
function  while  the  functions  6^  define  the  connec¬ 
tion  from  an  input  node  towards  a  cell  z,  the  pa¬ 
rameter  vectors  and  have  to  be  determined 

(e.g.  by  a  learning  algorithm)  to  achieve  the  desired 


2.  SOLVING  NONLINEAR  PARTIAL 
DIFFERENTIAL  EQUATIONS  WITH 
CNN 

The  state  equations  (1)  of  a  CNN,  a  system  of  locally 
interconnected  ordinary  differential  equations,  can  be 
considered  as  a  spatially  discretized  representation  of  a 
given  PDE  where  the  functions  aj”  and  have  to 
be  chosen  appropriately.  Then  the  cell  states  (t)  are 
approximations  of  the  PDE’s  solution  u(x,<)  at  x  =  x* 
and  the  inputs  represent  external  forces. 

Let  the  cell  outputs  be  identical  with  the  cell  states, 
which  can  be  achieved  by  choosing  a  sufficiently  large 
parameter  value  c  of  the  piecewise  linear  output  func¬ 
tion.  Furthermore  we  set  =  Oj  ^.nd  —  as  the 

PDE  considered  here  are  homogeneous  —  the  network 
has  no  inputs.  Then  for  a  single  layer  network  with 
M{i)  =  {z  —  r, . . . ,  i  +  7’},  the  state  equations  are  given 

by 

l=-r 

2.1.  CNN  design  based  on  finite  difference  Me¬ 
thods 

An  approach  for  solving  PDE  with  CNN,  the  so-called 
direct  method,  which  has  been  used  so  far  [7,8,9]  is  ba¬ 
sed  on  standard  finite  difference  approximations  of  the 
spatial  derivatives.  This  method  is  introduced  for  the 
example  of  the  homogeneous  Burgers  equation  [1,10] 

du{x,t)  _  1  d^u{x,t)  ld{u{x,t)f 

dt  R  dx^  2  dx  ■  ^ 
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After  spatial  discretization  of  Eq.  (3)  using  three  points 
to  estimate  the  derivatives,  the  resulting  set  of  equations 

dM*i  (f )  1  «*i+,  (*)  -  2uxi  (<)  +  «!(_,(<) 

dt  ~  R  (Ai)2 

4Ax  ’ 

can  be  represented  by  a  single-layer  CNN  with  state 
equations  given  by  Eq.  (2)  with  r  =  1.  By  comparing 
Eq,  (2)  with  Eq.  (4)  we  obtadn  the  weight  functions 

I  2 

=  Pa^l.lVy^i^l  +  Pa,i, 2 

with 


1  1 


2 

Pa, 0,1  =  0, 

pa, 0,2  =  *  v2 

R{Axf 

1 

1 

“  4Ax  ’ 

We  have  shown  previously  [7,8]  that  for  several  PDE 
including  Burgers’  equation  precise  solutions  can  be  ob¬ 
tained  by  this  method. 


While  this  well-known  method  is  applicable  if  the  en¬ 
tire  PDE  is  given,  our  approach  requires  only  a  rough 
knowledge  of  the  PDE’s  mathematical  form  —  the  hig¬ 
hest  order  spatial  derivative  and  the  type  of  nonlinearity 
—  and  a  few  values  of  the  system  output.  The  learning 
method  outlined  in  the  following  section  is  able  to  de¬ 
termine  the  PDE  parameters  —  e.g.  the  coefficient  R 
in  Burgers’  equation  —  and  thus  the  complete  system 
output. 


2.2.  CNN  parameter  learning 

For  learning  the  dynamics  of  a  given  nonlinear  system, 
firstly  the  basic  CNN  structure  has  to  be  determined. 
This  includes  the  choice  of  the  neighborhood  A/*  and  of 
a  class  of  nonlinear  weight  functions  aj.  While  the  num¬ 
ber  of  cells  in  the  neighborhood  has  to  be  large  enough 
to  allow  an  estimation  of  the  highest  order  spatial  deri¬ 
vatives  in  the  considered  PDE,  it  is  necessary  that  the 
nonlinearity  of  the  PDE  can  be  represented  by  the  func¬ 
tions  ai. 

Then  let  assume  that  the  values  of  a  special  solution 
u*{xyt)  of  a  PDE  for  an  initial  condition  u*(x,0)  are 
known  at  the  cell  positions  x,-  for  a  few  times  tm-  The¬ 
se  values  are  presented  to  the  CNN  as  learning  pat¬ 
terns.  After  choosing  random  initial  values  for  the  com¬ 
ponents  of  the  parameter  vector  pa  =  (pa,-r,  •  * . ,  Pa,r)) 
the  mean  square  error 

1  M  JV 

Wn  EEt  «CNN(*>>*m,Pa)  -  U*{xi,tm)f 
m  =  l  t=:l 

of  the  obtained  CNN-solution  fm,  Pa)  is  mi¬ 

nimized  for  all  cell  positions  x*  at  times  im>  The  mi¬ 
nimization  procedure  is  performed  by  using  a  standard 
technique,  the  downhill-simplex-method  [6],  which  re¬ 
quires  no  explicit  gradient  information. 


3.  RESULTS 

The  learning  method  has  been  applied  to  determine  so¬ 
lutions  of  the  Burgers’  equation  (3)  and  the  Korteweg-de 
Vries  (KdV)  equation  [10] 


du{x,t)  _  d^u{x,t)  _d{u{x,t))‘^ 

dt  ~  dx^  dx  •  ^ 

A  CNN  with  a  neighborhood  radius  r  =  2  and  polyno¬ 
mial  weight  functions 


K 

Ctf  (Vu,»-f  j  5  Pa,l)  =  ^  ' 

k=l 


with  iT  =  2  has  been  used  in  all  cases.  Results  obtained 
with  the  direct  method  and  the  learning  method  have 
been  compared  to  reference  solutions  u*(x,  t)  which  were 
calculated  with  high  accuracy. 

In  order  to  determine  the  achieved  approximation  accu¬ 
racy,  the  mean  square  error 

1  ^ 

^  ^  ^  [^CNN(^*)  ^m)  ”  U  (xi,fTn)] 
i=l 

was  calculated  in  all  treated  cases.  A  stepsize  of  Ax  = 
0.5  has  been  used  for  both  CNN  solutions. 

Solutions  of  Burgers’  equation  with  R  =  10  are  shown 
in  Figs.  1  and  2,  The  values  of  the  reference  solution 
at  ti  =  5  and  t2  =  15  for  the  initial  condition  shown  in 
Fig.  1  have  been  taken  as  the  learning  pattern. 

The  CNN  with  the  same  parameters  has  been  applied 
to  various  other  initial  conditions  including  the  square 
function  shown  in  Fig.  2.  Although  the  square  func¬ 
tion,  due  to  its  discontinuity,  represents  a  numerically 
critical  case  and  functions  with  discontinuities  have  not 
been  taken  as  learning  patterns,  the  error  e(t)  of  the  so¬ 
lution  obtained  with  the  learning  method  is  significantly 
smaller  for  a  large  range  of  i  than  that  of  the  solution 
obtained  with  the  direct  method. 

Figs.  3  and  4  show  results  for  the  KdV-equation  with 
c  =  1/2.  The  two-soliton  solution  shown  in  Fig.  3,  which 
has  been  calculated  anadytically  [10]  at  =  1,  formed 
the  learning  pattern.  The  CNN  has  then  been  applied 
to  calculate  solutions  of  the  KdV-equation  for  a  gauss- 
ian  initial  condition,  which  cannot  be  solved  analytical¬ 
ly.  Here  both  methods  lead  to  the  same  approximation 
accuracy. 

The  learning  process  usually  took  about  1000  iterations 
in  the  case  of  Burgers’  equation  and  about  3000  iterati¬ 
ons  for  the  KdV-equation. 
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Figure  la;  Solutions  of  Burgers’  equation  at  t  =  0  and 
t  =  25;  values  at  ti  =  5  and  <2  =  15  have  been  taken  as 
the  learning  pattern 
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Figure  2b:  Mean  square  error  e(t)  of  the  solutions 
shown  in  Fig.  2a 


Figure  lb:  Mean  square  error  e(t)  of  the  solutions  Figure  3a:  Two-soliton  solution  of  the  KdV-equation; 
shown  in  Fig.  la  values  at  =  1  have  been  taken  as  the  learning  pattern 
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Figure  3b:  Mean  square  error  e(t)  of  the  solutions 
shown  in  Fig.  3a 


Figure  2a:  Solutions  of  Burgers’  equation  at  <  =  5  for 
a  square  function  as  initial  condition,  which  has  not  be¬ 
en  taken  in  the  learning  process 
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e(t) 


Figure  4a:  Solutions  of  the  KdV-equation  at  t  =  5  for 
a  gaussian  function  as  initial  condition,  which  has  not 
been  taken  for  the  learning  process 


Figure  4b:  Mean  square  error  e(<)  of  the  solutions 
shown  in  Fig.  4a 


4.  CONCLUSION 

Our  results  show  that  the  dynamics  of  two  nonlinear 
systems  which  are  characterized  by  PDE,  the  Burgers 
equation  and  the  KdV-equation,  can  be  approximately 
solved  by  CNN  with  parameters  optimized  by  a  learning 
method.  The  solutions  obtained  by  this  approach  were 
significantly  more  accurate  than  those  using  direct  dis¬ 
cretization.  The  learning  method  does  not  require  the 
knowledge  of  the  PDE’s  parameters. 
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ABSTRACT 

In  this  paper  the  authors  describe  the  results  of  their 
investigation  of  the  development  of  an  algorithm  for 
recognizing  and  identifying  numerals  which  may  be 
isolated  or  overlapped.  These  numerals  may  be 
continuous  or  fragmented.  The  features  that  the  digits 
have  in  common  and  the  intrinsic  features  of  each  one 
were  considered.  Such  characteristics  are  used  in  the 
hiunan  recognition  process  and  are  the  basis  of  the 
proposed  algorithm.  The  principal  idea  of  the 
algorithm  is  to  start  at  the  top  of  the  image  and  group 
the  pixels-on  in  horizontal  and  vertical  lines  that  will 
characterize  each  one  of  the  digits.  Numerous 
experimental  tests  were  carried  out  for  recognition  of 
handwritten  numerical  characters  of  bank  checks  and 
an  accuracy  of  99.88%  was  obtained. 

1.  INTRODUCTION 

The  recognition  of  human  handwriting  has  been  the 
subject  of  intensive  research  for  more  than  three 
decades. 

The  principal  motive  for  the  development  of 
recognition  systems  is  the  need  to  cope  with  the 
enormous  flood  of  paper  such  as  bank  checks, 
commercial  forms,  postal  address  reading,  etc. 

Several  high  accuracy  algorithms  have  recently  been 
proposed  for  the  recognition  of  handwritten  numerals. 
Among  the  several  techniques  that  have  been 
proposed,  the  works  of  Shridhar  et  al.  [1],  Kimura  et 
al.  [2]  and  Lan  et  al.  [3]  should  be  mentioned. 

The  character  segmentation  algorithm  used,  separates 
the  characters  when  there  are  blank  spaces  between 
them.  The  result  of  this  segmentation  may  be  isolated, 
overlapped  and  connected  characters.  After  each 
segmentation  between  the  characters,  they  are  stored 
in  a  matrix  (this  matrix  is  named  original-matrix  in 


this  paper).  Therefore,  the  original-matrix  may  have 
one,  two  or  more  characters  at  the  same  time,  which 
may  be  isolated,  overlaj^d  and  connected.  Below  we 
define  these  three  types  of  characters; 

Isolated  Characters  -  characters  in  which  the 
original-matrix  is  composed  of  only  one  character. 

Overlapped  Characters  -  characters  with  pixels-on 
(active  pixel  in  one)  in  the  same  column,  but  apart. 
Therefore,  the  original-matrix  may  be  composed  of 
two  or  more  characters  depending  on  the  way  they  are 
segmented. 

Connected  Characters  -  characters  with  pixels-on  in 
common,  i.e.,  one  or  more  pixels  belong  to  the  two 
distinct  characters,  so  the  original-matrix  may  be 
composed  of  two  or  more  characters  depending  on  the 
way  they  are  coimected. 

In  this  paper,  the  authors  focused  their  attention  on 
recognizing  handwritten  numerical  characters,  that 
are  isolated  or  overlapped,  which  may  be  continuous 
or  fragmented.  The  algorithm  is  presented  considering 
intrinsic  and  common  characteristics  used  by  human 
beings.  Then  there  are  some  comments  about  the  data 
used  to  test  the  algorithm  and  the  results  obtained  are 
shown.  Finally,  some  conclusions  concerning  these 
results  are  presented. 

2.  DESCRIPTION  OF  THE  ALGORITHM  TO 
RECOGNIZE  NUMERICAL  HANDWRITTEN 
CHARACTERS  ISOLATED  AND  OVERLAPPED 

The  main  features  utilized  by  the  program  to  define 
each  digit  are: 
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Superior  Horizontal  Line 

The  superior  horizontal  line  is  obtained  by  the 
program  scanning  the  bitmap  from  top-to-bottom  and 
left-to-right  until  the  first  pixel-on  is  found.  At  this 
point  the  initial  coordinate  of  the  superior  horizontal 
line  is  stored.  This  coordinate  is  denominated  initial 
coordinate  of  the  superior  horizontal  line  (ICSHL).The 
ICSHL  coordinate  consists  of:  the  row  of  the  initial 
coordinate  of  the  superior  horizontal  line  (RICSHL) 
and  the  column  of  the  initial  coordinate  of  the  superior 
horizontal  line  (CICSHL). 

Handwritten  numerals  have  various  imperfections.  As 
a  result  not  all  the  pixels-on  are  in  line  on  the  superior 
horizontal,  inferior  horizontal  or  left  vertical  lines. 
With  the  intention  of  creating  an  algorithm  with 
greater  flexibility  and  in  this  way  include  the  pixels-on 
that  are  not  in  line,  variables  were  defined  in  the 
program  to  represent  this  flexibility.  These  variables 
are  called  “advantage-condition”,  “vertical- 
condition”  and  “horizontal-condition”. 

From  the  ICSHL  coordinate  a  new  pixel-on  is 
searched  for  by  the  program  to  constitute  the 
horizontal  superior  line.  This  pixel-on  may  be  on  the 
same  row  as  the  RICSHL  or  one  row  above  or  below, 
the  pixel-on  being  one  column  to  the  right  of  the 
CICSHL  (this  condition  will  be  referred  to  from  now 
on  as  the  first  condition).  In  case  the  first  condition  is 
not  verified  then  a  pixel-on  which  may  be  in  the 
directions  2  or  6  (i.e,  being  in  the  same  column  and 
one  row  above  or  below  the  pixel-on  RICSHL)  in 
accordance  with  the  chain  code  of  8  directions  [4] 
(this  condition  will  be  referred  to  from  now  on  as  the 
second  condition).  If  the  second  condition  occurs  the 
program  increases,  by  one  unit,  the  “vertical- 
condition”  variable.  If  ^s  does  not  occur  the  program 
will  increase  by  one  the  “advantage-condition” 
variable  and  move  itself  on  the  same  row  increasing 
the  coliunn  by  one.  The  process  of  searching  for  the 
pixels-on  of  the  superior  horizontal  line  repeats  itself 
while  the  “vertical-condition”  variable  is  less  than  3 
or  the  “advantage-condition”  variable  is  less  than  2. 
At  the  end  of  this  search  the  final  coordinate  of  the 
superior  horizontal  line  denominated  final  coordinate 
of  the  superior  horizontal  line  (FCSHL)  is  stored.  The 
FCSHL  coordinate  consists  of:  the  row  of  the  final 
coordinate  of  the  superior  horizontal  line  (RFCSHL) 
and  the  column  of  the  final  coordinate  of  the  superior 
horizontal  line  (CFCSHL).  Figure  1  illustrates  the 
superior  horizontal  line  in  two  handwritten  numbers. 


Figure  1-  Superior  horizontal  line  in  two  handwritten 
numbers. 

Inferior  Horhmntal  Line 

The  inferior  horizontal  line  is  obtained  by  the  program 
scanning  the  bitmap  from  bottom-to-top  and  left-to- 
right  until  the  first  pixel-on  is  found.  At  this  point  the 
initial  coordinate  of  the  inferior  horizontal  line  is 
stored.  This  coordinate  is  denominated  initial 
coordinate  of  the  inferior  horizontal  line  (ICIHL).  The 
ICIHL  coordinate  consists  of:  the  row  of  the  initial 
coordinate  of  the  inferior  horizontal  line  (RICIHL) 
and  the  coltunn  of  the  initial  coordinate  of  the  inferior 
horizontal  line  (CICIHL).  From  this  point,  the  same 
process  is  utilized  as  in  the  superior  horizontal  line  for 
the  first  condition.  The  only  variation  is  the  position  of 
the  RICIHL  which  may  be  on  the  same  row  or  one  row 
below  or  above.  In  the  second  condition  the  same 
process  is  repeated,  therefore  the  first  pixel-on  should 
be  searched  for  in  direction  6  and  then  in  direction  2, 
the  rest  of  the  process  repeats  itself.  The  final 
coordinate  of  the  inferior  horizontal  line  is 
denominated  final  coordinate  of  the  inferior  horizontal 
line  (FCIHL).  The  FCIHL  coordinate  consists  of:  the 
row  of  the  final  coordinate  of  the  inferior  horizontal 
line  (RFCIHL)  and  the  column  of  the  final  coordinate 
of  the  inferior  horizontal  line  (CFCIHL).  Figure  2 
illustrates  the  inferior  horizontal  line  in  two 
handwritten  numbers. 


. u .  . 


O .  pcttl-oA  -  -  pnol-flff 

■  -  iafetior  hoiizontal  line  k  .  'advantage-condhion'’ 
u  -  ‘Verticel-conditioti''  (6  and  2  duecttons) 


Figure  2  -  Inferior  horizontal  line  in  two  handwritten 
numbers. 


Left  Vertical  Line 

The  left  vertical  line  always  begins  at  the  ICSHL 
coordinate.  From  this  point  a  new  pixel-on  of  the  left 
vertical  line  is  searched  for  by  the  program  and  may 
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be  in  the  same  column  of  the  pixel-on  CICSHL  or  one 
column  to  the  left  or  right,  this  pixel-on  must  be  one 
row  below  RICSHL.  When  no  neighbour  is  found,  a 
pixel-on  should  be  searched  for  which  may  be  in  the 
directions  4  or  0  (i.e,  be  on  the  same  row  and  in  the 
column  before  or  in  the  next  coliunn  of  the  pixel-on 
CICSHL)  in  accordance  with  the  chain  code.  If  this 
condition  occurs  the  program  will  increase,  one 
unit,  the  “horizontal-condition”  variable.  If  this  does 
not  occur  the  program  will  increase,  by  one  unit,  the 
“advantage-condition”  variable  and  move  itself  in 
the  same  column  increasing  the  line  by  one.  The 
process  to  meet  the  pixels-on  that  constitute  the  left 
vertical  line  is  repeated  while  the  “horizontal- 
condition”  variable  is  less  than  3  or  the  “advant^e- 
condition”  variable  is  less  than  1.  When  the 
“advant^e-condition”  variable  is  equal  to  1  or  the 
ICfflL  coordinate  is  found,  it  is  no  longer  necessary  to 
scan  the  bitmap  because  the  final  coordinate  of  the  left 
vertical  line  has  been  found.  At  the  end  of  this  search 
the  final  coordinate  of  the  left  vertical  line 
denominated  final  coordinate  of  the  left  vertical  line 
(FCLVL)  is  stored.  The  FCLVL  consists  of:  the  row  of 
the  final  coordinate  of  the  left  vertical  line  (RFCLVL) 
and  the  column  of  the  final  coordinate  of  the  left 
vertical  line  (CFCLVL).  Figure  3  illustrates  the  left 
vertical  line  in  two  handwritten  munbers. 


Figure  3  -  Left  vertical  line  in  two  handwritten 
numbers. 

Elimination  of  the  Comma  and  the  Period 

In  the  recognition  of  numerical  characters  of  bank 
checks,  the  authors  decided  to  eliminate  the  comma  (,) 
and  the  period  (.)  between  the  numbers,  since  a  lot  of 
people  do  not  use  them  when  writing  their  checks  and 
besides  which  the  comma  and  period  can  be  so  small 
that  the  thinning  algorithm  for  characters  [5] 
considers  them  to  be  “noise”.  At  times  the  comma  may 
be  of  such  a  considerable  size,  that  the  thinning 
algorithm  does  not  eliminate  it.  In  this  situation  this 
algorithm  is  able  to  recognize  it  and  eliminate  it.  In 
the  program,  the  fimction  which  carries  out  the 
elimination  of  the  comma  is  named  comma- 
eliminator. 


In  order  to  recognize  a  handwritten  numerical 
character  of  bank  checks  after  obtaining  its  skeleton  1^ 
the  thinning  algorithm,  the  program  scans  it  to  find 
the  coordinates  of  the  horizontal  superior  line  (ICSHL 
and  FCSHL);  the  coordinates  of  the  inferior  horizontal 
line  (ICIHL  and  FCIHL)  (this  step  will  be  referred  to 
as  STEP  1  from  now  on).  The  final  coordinate  of  the 
left  vertical  line  (FCLVL)  will  only  be  formd  after  the 
comma-eliminator  is  executed.  Later,  the  program 
performs  STEP  1,  executing  the  comma-eliminator 
fimction  and  then  performing  the  following  steps: 

•  If  (CFCSHL  -  CICSHL  <  2)  and  if  (CFCIHL  - 
CICIHL  ^  0)  the  fimction  named  verif-1-3-5-6-0  in 
the  program  is  executed.  The  objective  of  this 
fimction  is  to  recognize  the  digits  1,  3,  5,  6  and  0. 
Figure  4  illustrates  some  examples  of  numbers 
recognized  by  the  function  verif-1-3-5-6-0. 


Figure  4  -  Some  examples  of  munbers  recognized  by 
the  function  verif-1-3-5-6-0. 

•  If  the  number  is  not  recognized  and  if  (  CICIHL  ^ 
CFCIHL)  and  (CICSHL  ^  CFCSHL)  and 
((RFCLVL  -  RICIHL  ^  2)  and  (CFCLVL  - 
CICIHL  ^  2))  or  (RFCLVL  -  RICIHL  ^  2)  and 
(CFCLVL  -  CFCIHL  <  2))  the  function  named 
verif-0-8-6-3-5-1-2  in  the  program  is  executed. 
The  objective  of  this  function  is  to  recognize  the 
digits  0,  8,  6,  3,  5,  1  and  2.  Figure  5  illustrates 
some  examples  of  numbers  recognized  by  the 
function  verif-0-8-6-3-5-1-2. 


Figure  5  -  Some  examples  of  numbers  recognized  by 
the  function  verif-0-8-6-3-5-1-2. 

•  If  the  number  is  not  recognized  and  (CICSHL  ^ 
CFCSHL)  and  (CICIHL  ^  CFCIHL),  then  the 
function  named  verif-2-3-5-7-9-8-0  in  the  program 
is  executed.  The  objective  of  this  function  is  to 
recognize  the  digits  2,  3,  5,  7,  9,  8  and  0.  Figure  6 
illustrates  some  examples  of  numbers  recognized 
by  the  function  verif-2-3-5-7-9-8-0. 


Figure  6  -  Some  examples  of  numbers  recognized  by 
the  function  verif-2-3-5-7-9-8-0. 
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•  If  the  number  is  not  recognized  and  (CICSHL  ^ 
CFCSHL)  and  (CFCIHL  -  CICIHL  ^  2)  the 
function  named  verif-7-9-4-1-2  in  the  program  is 
executed.  The  objective  of  this  function  is  to 
recognize  the  digits  7,  9,  4,  1  and  2.  Figure  7 
illustrates  some  examples  of  numbers  recognized 
the  function  verif-7-9-4-1-2. 


Figure  7  -  Some  examples  of  numbers  recognized  by 
the  function  verif-7-9-4-1-2. 


be  recognized,  so  the  algorithm  carries  out  STEPS  1, 2 
and  3  again.  Figure  9  illustrates  some  overlapped 
numerals  recognized  by  the  algorithm  explained 
above. 
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Figure  9  -  Some  overlapped  numerals  recognized  Ity 
the  algorithm  explained  above. 


3.  RESULTS  OBTAINED 


•  If  the  number  is  not  recognized  and  (CFCSHL  - 
CICSHL  ^  2)  and  (CFCIHL  -  CICIHL  ^  2)  the 
hmction  named  verif-1-4-5-6-7  in  the  program  is 
executed.  The  objective  of  this  function  is  to 
recognize  the  digits  1,  4,  5,  6  and  7.  Figure  8 
illustrates  some  examples  of  numbers  recognized 
by  the  function  verif-1-4-5-6-7. 


Figure  8  -  Some  examples  of  numbers  recognized  by 
the  function  verif-1-4-5-6-7. 


Should  the  number  not  be  recognized  by  any  of  the 
mentioned  functions,  the  program  sends  the  message 
of  nonrecognition. 


The  algorithm  was  tested  with  over  820  handwritten 
numbers  obtained  from  164  checks  written  41 
different  people.  Th^  were  asked  to  write  random 
numerical  characters  on  4  checks.  This  algorithm  was 
unable  to  deal  with  connected  numbers.  The  check 
was  simulated  using  white  paper.  819  out  of  the  820 
numbers  handwritten  were  recognized  successfully 
and  one  was  recognized  inconectly.  Results  showed  a 
recognition  rate  of  99.88%. 

4.  CONCLUSIONS 

The  results  obtained  showed  that  the  algorithm 
utilized  for  the  recognition  of  isolated  or  overlapped 
handwritten  numerical  characters  presented  an 
excellent  recognition  rate. 


Overlapped  Characters  Segmentation 

To  separate  two  or  more  overlapped  characters,  the 
program  scans  the  whole  character  from  the  ICIHL 
coordinate  (this  step  will  be  referred  to  as  STEP  2 
from  now  on).  During  this  process,  each  pixel-on 
scanned  is  stored  on  a  new-matrix  and  is  deleted  from 
the  original-matrix.  By  the  end  of  this  procedure,  the 
new  matrix  contains  a  copy  of  the  scanned  character. 
By  scanning  the  whole  character  in  STEP  2,  the 
program  verifies  if  the  coordinates:  FCIHL,  ICSHL 
and  FCSHL,  found  in  the  character,  agree  with  the 
coordinates;  FCIHL,  ICSHL  and  FCSHL  foimd  in 
STEP  1.  If  this  coincidence  occurs,  the  algorithm 
recognizes  the  character  contained  in  the  new-matrix 
(this  step  will  be  referred  to  as  STEP  3  from  now  on). 
If  this  coincidence  does  not  occm,  the  program  goes 
back  to  STEP  1  in  order  to  find  the  coordinates  of  the 
superior  and  inferior  horizontal  lines  of  the  character 
in  the  new-matrix.  Then,  STEP  3  is  executed,  in 
order  to  recognize  the  character. 

After  recognizing  the  character,  the  program  verifies 
if  there  are  still  pixels-on  in  the  original-matrix.  If  so, 
it  means  that  there  are  still  one  or  more  characters  to 


5.  REFERENCES 

[1]  -  M.  Shridhar  and  A.  Badreldin,  Recognition  of 
isolated  and  simply  cotmected  handwritten  numerals. 
Pattern  Recognition  19,  pp.  1-12,  January  1986. 

[2]  L.  Lam  and  C.  Y.  Suen,  Structural  classification 
and  relaxation  matching  of  totally  unconstrained 
handwritten  Zip-code  numbers.  Pattern  Recognition 
21,  pp.  19-31,  January  1988 

[3]  -  S.  N.  Srihari,  E.  Cohen,  J.  J.  Hull  andL.  Kuan,  A 
system  to  locate  and  recognize  ZIP  codes  in 
handwritten  addresses,  IJRE  1,  i^).  3745,  January 
1989. 

[4]  -  A.  K.  Jain,  Fimdaments  of  Digital  Image 
Processing.  New  York.  Prentice  Hall,  Inc.,  USA, 
1989,  ch.  6,  pp.  363. 

[5]  -  E.  L.  Fibres,  E.  N.  Rezende,  G.  A.  Carrijo  and  J. 
B.  T.  Yabu-uti,  A  fast  thinning  algorithm  for 
characters,  in  Proceedings  of  the  Conference  1995 
IEEE  Workshop  on  Nonlinear  Signal  and  Image 
Processing,  Neos  Marmaras,  Halkidiki,  Greece,  June, 
1995,  pp.  166-169. 


508 


NEURAL  SYSTEM  FOR  TRACKING  AND  CLASSIFICATION  OF 
PRIMARY  RADAR  ECHO  SIGNALS 


S.  Cuomo,  P.F.Pellegrini,  E.  Piazza 

Dipartimento  di  Ingegneria  Elettronica,  University  of  Florence 
Via  di  S. Marta  3  -  50139  Florence,  Italy 
phone:  +39-55-4796267;  fax:  +39-55-494569 
e-mail:  Iabtel@ingfi1  .ing.unifi.it 


Abstract 

This  work  presents  a  system  designed  to  acquire 
primaiy  echo  radar  from  aircrafts  and  to  operate  a 
classification  based  upon  spectral  consideration  on 
the  retrieved  echo.  Fine  spectral  consideration  are 
carried  out  exploiting  the  Jet  Engine  Modulation 
(JEM)  phenomenon  due  to  the  modulation  on  the 
primary  echo  introduced  by  engines  rotating  parts. 
Classification  efforts  are  obtained  by  the  use  of  a 
Neural  Network  previously  trained,  put  on  the 
system  as  classification  sub  system. 

Target  parameters  are  investigated  analyzing 
effective  signals  obtained  by  two  operating  radars 
with  an  experimental  system  specifically  designed 
for  real  time  tracking  of  civilian  aircrafts  using  an 
acquisition  sub-system,  PC  based,  interfaced  with 
the  radar  circuitry. 

L  Introduction 

Nowadays,  because  of  the  ever-growing  number  of 
aircraft  in  the  areas  to  be  controlled,  the  raising 
demand  for  security  of  passengers  and  cargoes  and 
the  greater  care  about  the  environment,  the 
problem  of  improving  the  performances  of 
traditional  Airport  Control  systems  must  be  faced. 
Research  trend  in  future  years  is  oriented  towards 
the  development  of  integrated  systems,  which 
collect  information  coming  from  various  sensors, 
and  distribute  it  both  to  ground  air-traffic 
controllers  and  to  moving  vehicles.  The  realisation 
of  a  system  for  the  automatic  control  of  air  space 
with  non  co-operative  target  recognition  (NCTR) 
capability  can  play  an  important  role  in  this 
context. 

The  automatic  extraction  in  real  time  of  the  target 
features  suitable  to  ftilfil  the  requirements  stated 
above  can  be  carried  out  through  innovative 
approaches  to  the  processing  of  radar  data  based  on 
neural  network  processing  techniques. 

It  is  well  known  that  the  rotating  parts  of  aircraft 
engines  introduce  modulation  on  the  aircraft  echo 


spectral  response.  With  reference  to  jet  powered 
aircraft  these  phenomena  are  known,  in  related 
literature,  as  Jet  Engine  Modulation  (JEM)  [1]. 

It  has  been  observed  that  such  phenomena  are 
deterministic  and  supply  a  considerable 
contribution  to  the  total  energetic  content  of  the 
signal  and  it  is  reasonable  to  think  that  these  ones, 
being  related  to  the  particular  couple  "engine  - 
aircraft  body"  may  be  regarded  as  a  signature  that 
may  be  used  for  a  classification  of  the  targets  [2]. 

2.  Experimental  Measurements 

In  order  to  acquire  real  data,  a  flexible  registration 
tool  has  been  assembled.  At  each  scan  it  samples, 
with  a  high  data  rate,  the  in-phase  and  quadrature 
components  of  the  base-band  signal  in  prefixed 
window  of  the  radar  sky  and  transfer  it  from  the 
internal  buffer  into  a  mass  storage  for  off-line 
analysis  [2]. 

This  tool  has  been  designed  with  a  particular 
consideration  for  aspects  of  portability  and 
adaptivity  to  various  radar  systems.  It  is  based  upon 
a  PC  stnicture  and  a  fast  A/D  converter  which  is 
able  to  acquire  I,  Q  video  channels  (see  Fig.  1). 
Measurement  campaigns  were  carried  out  at 
Fiumicino  Airport  in  Rome  and  at  Linate  Airport 
in  Milan  (Italy).  In  Tab.  I  some  of  the  parameters 


of  the  two  radars 
setup  are  reported. 

involved  in 

the  experimental 

Name 

ATCR-33K 

ATCR-44K 

Location 

Fiumicino 

Linate 

Frequency 

2.76  GHz 

1.27  GHz 

Rotation  speed 

15RPM 

6RPM 

Beam  Width 

1.5  deg 

1.2  deg 

Pulse  length 

1  )XS 

1.5  ns 

PRF  (Average) 

1000  Hz 

446  Hz 

Sweep  on  Target 

17  Sweeps 

15  Sweeps 

Table  1  -  Radar  parameters 

Results  reported  here  are  mainly  concerned  with 
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the  S  band  radar  [3],  as  the  L  band  radar  campaign 
was  only  recently  performed  and  data  analysis  is 
still  in  progress.  As  previously  pointed  out,  various 
types  of  aircrafts  in  different  operative  conditions 
have  been  recorded.  In  the  analysis  of  spectral 
response,  the  recordings  have  been  mainly 
considered  in  Stagger  OFF  conditions  in  order  to 
perform  a  sampling  at  the  same  time  intervals. 

In  this  work  mainly  are  taken  into  account  analysis 
of  commercial  aircrafts  with  turbojet  engines  and 
especially  MD80,  DC9,  B747  types  for  which,  by 
courtesy  of  Alitalia  Airlines,  it  has  been  possible  to 
obtain  auxiliary  information  concerning 
constitutive  and  operative  data. 


Tort 


Fig,  1  -  Recording  tool  as  it  result  when  connected 
to  the  ATCR-33K  radar  system 

Few  words  about  the  recording  procedure  are 
useful.  Once  the  target  has  been  designated, 
automatically  the  recording  equipment,  using  the 
information  coming  from  the  tracking  PC,  centers 
the  window  on  the  aircraft  estimated  position  and 
records  the  I  and  Q  signals  belonging  to  the 
window.  The  process  of  centering  the  window  and 
recording  is  repeated  at  any  antenna  turn,  until  the 
operator  stops  it.  A  number  of  auxiliary 
informations  about  the  setting  of  the  recording 
equipment  and  environment  is  recorded  with  raw 
data. 

3.  Spectral  Analysis  of  ATC  Radar  echoes 
In  the  case  of  JET-powered  aircraft  the  rotating 


blades  of  fan  and  turbines  are  internal  to  the  engine 
ducts  and  thus  the  visibility  depends  on  the  aspect 
angle,  while  in  turboprop-powered  aircraft  the 
rotating  blades  are  always  exposed.  In  either  case 
the  spectral  signature  is  controlled  -  in  addition  to 
the  RCS  -  by  the  dimensions,  geometry,  number  of 
the  rotating  blades  and  by  the  rotation  speed  [1][3]. 
The  phenomena  called  Jet  Engine  Modulations 
(JEM),  have  a  general  validity  and  they  can  be 
observed  in  a  very  large  majority  of  measurement, 
at  least  for  the  S-band  systems  which  where  mainly 
considered.  In  any  case,  the  characteristics  of  the 
ATC  radar  give  rise  to  some  problems  for  a  correct 
interpretation  of  information  present  in  the  spectral 
response.  In  particular,  the  low  PRF  (about  1  KHz) 
typical  of  ATC  systems,  strongly  limits  the 
unambiguous  band  of  the  spectrum  and  cause 
aliasing  effects  which  are  present  both  for  main 
Doppler  line  and  for  the  JEM  lines.  Moreover  a 
dwell  time  (related  to  the  antenna  main  lobe)  of 
few  tens  of  milliseconds  cause  a  poor  spectral 
resolution. 


Fig.  2  -  Theoretical  spectrum  showing  the  main 
Doppler  line  and  side  JEM  lines 

4.  The  NCTR  problem 

The  problem  of  Non-Cooperative  Target 
Recognition  (NCTR)  is  not  elementary.  A  lot  of 
effort  is  spent  trying  to  achieve  NCTR  with  various 
types  of  sensors. 

The  problem  to  recognize  a  target  with  radar 
depends  basically  on  which  target  characteristics  to 
rely  on.  Some  important  target  characteristics  are: 
Radar  Cross  Section  (RCS);  target  velocity;  target 
fluctuation;  dominant  scatterers;  Doppler  spectrum. 
Probably  a  combination  criteria  based  on  a  number 
of  different  characteristics  will  form  a  reliable 
method  for  target  identification. 
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To  cariy  out  a  kind  of  automatic  classification 
some  parameters  characterizing  the  signal  are 
needed.  Particularly  those  parameters  that  they  are 
invariant  with  respect  to  a  number  of  variables 
characterizing  the  target  and  the  sight  geometry 
[4].  For  example  they  can  be  invariant  with  respect 
to  the  position,  speed  or  aspect  angle  of  the  target, 
or  with  respect  to  the  type  of  the  aircraft  revealed 
or  with  respect  to  its  engines[2]. 

The  application  of  neural  networks  in  signal 
classification  has  been  receiving  increased 
attention.  Neural  Network  classifiers  have  shown, 
in  a  number  of  occasions  [5],  superior 
performance  than  other  types  of  classifiers. 

One  of  the  most  popular  and  well  understood 
neural  network  paradigms  is  the  multilayer 
network  trained  with  the  back  propagation 
algorithm,  specially  in  recognition/classification 
problems  [6]. 

A  multilayer  neural  network  is  a  feed-forward 
network  consisting  of  some  layer  of  processing 
nodes  called  neurons  (see  Fig.  3)  or  adalines.  Each 
neuron  receives  its  inputs  from  the  previous  layer 
neurons  otuput  and  put  its  output  to  all  the  neurons 
of  the  successive  layer.  The  first  layer  is  the  input 
layer  which  receive  input  from  external  inputs,  the 
last  layer  is  the  output  layer  which  put  its  output  to 
the  external  output  and  the  intermediate  layers  are 
called  hidden  layers. 


Fig.  3  -  Single  adaptive  linear  element  (adaline) 
called  neuron. 


Although  it  has  been  demonstrated  that  only  one 
hidden  layer  can  realize  any  non  linear  mapping, 
two  hidden  layers  can  reduce  the  dimensions  of  the 
network  and  speed  up  the  training  process  [6]  [7]. 
Usually  the  input  layer  has  as  many  neurons  as 
available  data  for  each  record  to  classify,  for 
instance  signal  samples  of  a  signal,  and  the  output 
layer  has  as  many  neurons  as  target  classes. 

The  neuron  weights  are  automatically  calculated  by 


the  training  algorithm.  Once  trained  the  network 
should  be  able  to  produce  appropriate  responses 
when  presented  with  input  vectors  it  has  not  seen 
before 

When  learning  takes  place,  the  actual  output  vector 
is  compared  to  the  desired  output  vector  and  the 
error  between  the  two  vectors  is  calculated  and 
propagated  back  to  the  hidden  and  input  layers  to 
change  the  weights  values  in  order  to  minimize  the 
mean  squared  error. 

This  kind  of  learning  is  called  training  with 
supervision  and  requires  a  set  of  training  inputs  of 
which  the  expected  output  are  known  and  called 
target  output  [6]. 

In  this  work.  Neural  Networks  have  been  used  to 
distinguish  radar  signals  from  5  models  of 
commercial  aircrafts.  Some  preprocessing  has  been 
carried  out  on  the  raw  signal  to  extract  the  pattern 
sets  to  be  put  into  the  network  inputs.  Five  kinds  of 
patterns  have  been  tested: 

a)  complex  samples  of  the  raw  ATC  signal; 

b)  complex  sample  of  the  signal's  spectnim; 

c)  real  samples  of  the  spectnim  amplitude; 

d)  complex  samples  of  the  raw  ATC  signal  and 
kinematic  data; 

e)  complex  samples  of  the  spectrum  and  kinematic 
data. 

The  raw  ATC  signal  is  made  by  32  complex 
samples,  taken  with  the  sampling  frequency  of  the 
radar,  PRF,  that  is  1  KHz. 

Two  different  networks  have  been  tested,  the  first 
with  only  one  hidden  layer  and  the  second  with  two 
hidden  layers. 

The  whole  set  of  recorded  signal  includes  164 
records  of  turbofan  powered  aircraft  and  23  records 
of  turboprop  aircrafts  (see  Tab.  2.) 


Aircraft 

Available 

Training 

DC9 

76 

12 

MD80 

66 

12 

B747 

22 

5 

ATR42 

18 

5 

C130 

5 

3 

Tab.  2.  -  Available  data  set. 

and  Training  set 

composition. 

The  training  set  was  made  by  choosing  randomly  a 
proportional  number  of  records  for  each  class.  The 
composition  of  the  training  set  is  shown  in  Tab.2. 
The  training  set  was  the  same  for  every  test  carried 
out.  The  test  set  was  the  whole  set  of  available  data, 
despite  of  the  theoiy  says  to  purge  the  training  set 

from  the  test  set. 


For  the  software  simulation  has  been  used  the 
program  PDF  Software  included  in  the  book 
"Exploration  in  parallel  distributed  processing"  [8]. 
This  software  allow  to  simulate  several  neural 
network  including  Back  Propagation  networks  and 
is  available  on  both  MSDOS  and  UNIX  systems. 
The  two  network  simulated  are  summarized  in  Tab. 
4. 

The  classification  is  made  on  the  basis  of  networks 
output.  The  POP  software  produce  outputs  in  the 
interval  0-1.  If  just  an  output  is  greater  than  0.5, 
the  record  is  assigned  to  that  output  class,  if  none 
or  more  than  one  output  is  greater  than  0.5,  the 
record  is  not  classified. 


Parameter 

Net  1 

Net  2 

layers 

3 

4 

1st  hidden  layer  neurons 

16 

32 

2nd  hidden  layer  neurons 

- 

16 

Total  number  of  neurons 

85 

134 

Number  of  weights 

1104 

2640 

Tab.  4.  -  Parameters  summary 

of  the 

two  used 

networks. 

As  was  expected,  the  second  network  has  achieved 
better  result  but  with  a  much  larger  computational 
time,  mainly  in  the  training  phase,  while  the 
classification  time  is  almost  the  same. 


DC9 

MD80  B747 

AT42 

Cl  30 

N.C. 

DC9 

57 

14 

5 

4 

0 

20 

MD80 

26 

42 

5 

11 

2 

15 

B747 

36 

27 

22 

5 

0 

9 

AT42 

17 

6 

0 

67 

0 

11 

C130 

0 

0 

0 

40 

60 

0 

Tab.  5.  - 

Aircraft  classification  result  for  pattern  e). 

Net  2, 

values 

in 

%.  N.C. 

standing  for  Not 

Classified. 

a) 

b) 

c) 

d) 

e) 

DC9 

55 

43 

34 

43 

57 

MD80 

42 

48 

38 

35 

42 

B747 

22 

22 

27 

23 

22 

AT42 

11 

33 

39 

33 

67 

C130 

0 

60 

60 

60 

60 

Average 

41 

42 

36 

37 

51 

Tab.  6.  -  Probability  of  aircraft  correct 

classification  versus  the  parameters  taken  into 
account.  The  network  is  the  2nd.  Values  in  % 

The  best  classification  result  was  given  by  the 


patterns  a)  and  e),  which  represent  the  complex 
samples  of  the  raw  signal.  An  example  of 
classification  result  is  if  Tab.  5.  A  global  result, 
shown  in  Tab.  6.  is  the  probability  of  correct 
classification. 

The  misclassification  between  DC9  and  MD80 
have  to  be  remarked  because  that  aircrafts  have  a 
very  similar  fuselage  body  and  dilTer  only  for 
engines.  Maybe  the  network  uses  information  about 
the  target  RCS. 

It  is  evident  that  the  use  of  ancillary  data  or  target 
features  improve  the  classification  performance. 
See  the  difference  between  pattern  b)  and  e)  results. 
These  first  results  suggest  that,  with  a  further 
sophisticated  system,  a  major  increase  in  the 
capabilities  of  future  radar  systems  could  be 
developed. 
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ABSTRACT 

The  application  of  1.  3pm  Pr  ion  doped  optical  fiber  amplifier 
(PDFA)  for  CATV  subscriber  network  is  reviewed,  Design  con¬ 
sideration  and  theoretical  analysis  of  PDFA  is  conducted.  Based 
on  interaction  between  photon  and  electric  field  along  the  fiber, 
the  numerical  calculation  of  signal  gain,  effective  fiber  length  and 
pump  optimization  arc  also  made. 

1.  INTRODUCTION 

Optical  fiber  CATV  subscriber  networks  have  been  rapidly  devel¬ 
oped  reccntlyin  China.  Compared  to  coaxial  cable  transmission, 
optical  fiber  has  evident  advantages,  such  as  extreme  low  loss 
and  wide  frequency  band ,  therefore  it  saves  cascade  electrical  am¬ 
plifiers  with  equalizers  which  will  accumulate  noise,  signal  fre¬ 
quency  nonlinear  distortion  over  transmission  line  and  improves 
system  reliability.  Optical  CATV  network  will  gradually  become 
multi-media  terminal  network  with  development  of  people  ^s 
needs  to  integrated  service  communication,  data,  computer, 
%ideo  and  voice  etc. 

Generally  speaking,  the  transform  of  coaxial  cable  to  optical  fiber 
CATV  network  in  China  will  experience  the  following  four 
steps : 

Super-trunk  architecture  of  optical  cable  for  master  headend  to 
remote  headend  or  sub-headend. 

Fiber  optic  backbone  trunk  from  sub-headend  to  feeder  or  node. 
Fiber  to  the  feeder  through  optical  star  radiation. 

Fiber  to  the  hoom(FTTH),  a  future  CATV  transmission  style. 
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Among  the  architectrure  of  fiber  CATV  network ,  passive  optical 
network  (PON )  is  an  attractive  and  important  one  to  realize  the 
goal  of  FTTH.  It  is  evident  that  PON  shows  low  system  cost, 
high  stability  and  promise  the  further  developing  trends.  There 
are  two  kinds  of  typical  construction  in  PON  distribution 
scheme,  star-shape  and  tree  shape.  The  formmer  arranges  the 
optical  splitter  near  to  the  central  TV  station.  The  latter  uses 
trunk  optical  cable  to  optical  dirtribution  network.  In  order  to 
extending  transmission  distance  and  enlarging  number  of  sub¬ 
scribers,  appling  in-line  optical  fiber  amplifier  into  PON  is  an  ef¬ 
fective  way  as  shown  in  Fig.  1. 

Generally,  light  source  for  optical  fiber  CATV  system  may  use 
ISlOnm  DFB-LD,  1550nm  DFB-LD  and  soUd  state  YAG  Laser 
with  external  modulation.  It  is  clear  that  the  wavelength  at 
which  optical  signal  is  amplified  should  be  accordance  with  that  of 
light  source  of  optical  CATV  system.  Although  1550nm  is  the 
communication  window  of  low  loss  of  single  mode  optical  fiber 
and  1550nm  Er  ion  doped  optical  fiber  amplifiers  have  been  used 
to  practical  optical  systems,  1550nm  LD  in  CATV  will  bring  in 
the  second  term  distortion  resulted  from  frequency  chirp  during 
broadband  modulation  process.  YAG  laser  and  external  modula¬ 
tor  have  characteristics  of  strong  power  and  low  noise,  howe’s'cr, 
it  needs  complicated  linear  compensation  technology  to  overcome 
the  third  term  distortion.  At  the  same  time  YAG  laser  with  nar¬ 
row  line-width  will  produce  stimulated  Brillouin  scattering  and 
that  will  limit  the  injected  optical  power  to  optical  fiber  back¬ 
bone. 

Evidently,  131  Onm  DFB  laser  is  most  suitable  to  optical  CATV 
because  of  its  low  cost ,  weak  noise,  mature  technology  and  high 
reliability.  At  present  Pr-ion  doped  fiber  amplifier  (PDFA  ) 
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which  operated  at  wavelength  of  1300nm  with  signal  gain  of 
30dB  has  been  reported.  The  result  reveals  that  PDFA  will  be 
the  best  candidate  to  compensate  splitting  loss  in  passive  optical 
fiber  CATV  systems. 

2.  THEORETICAL  MODEL  AND 
CALCULATION  OF  PDFA 

2. 1  Theoretical  model 

PDFA  consists  of  a  segment  of  Pr  ion  doped  fluoride  glass  fiber, 
a  pump  laser  operating  at  wavelength  of  SOOnm  or  1016nm ,  a 
WDM  coupler  for  combining  1300nm  optical  signal  and  pump 
light  and  an  isolator  to  prevent  return  light  into  light  source.  The 
typical  construction  of  PDFA  and  the  energy  level  diagram  of  Pr 
ion  are  shown  in  Fig.  2.  The  model  of  the  four  level  transition  of 
Pr  ion  for  1300nm  amplification  is  denoted  in  the  figure.  Pump 
photon  ground  state  absotion  occurs  between  Eo  and  E3  energy 
level  with  pumping  efficiency  of  Wp.  Then  Pr  ions  transfer  swift¬ 
ly  to  E2  level  without  emission  and  some  of  them  return  to  Eo  lev¬ 
el  spontaneously  with  spontaneous  radiation  probabtility  AmCAjo 
<§^532)0  The  population  in\‘ersion  is  formed  and  stimulated  emis¬ 
sion  and  amplification  is  realized  between  upper  laser  level  E2and 
down  level  Ei  at  emission  rate  of  W21.  T2iis  lifetime  of  the  stimu¬ 
lated  ions.  During  above  process ,  ions  of  E2  level  can  also  be  ab¬ 
sorbed  to  the  higher  level  E^  by  excited  state  absorbtion  (ESA) 
and  transit  to  the  lower  level  by  the  amplified  spontaneous  emis¬ 
sion  (ASE).  These  reduce  the  pump  efficiency.  The  rate  equa¬ 
tion  of  ion  population  density  arc  given  by  the  following: 
dn(,(r,(p,z) 


dPp^Cz)^ 


dt 

dni(r,q:>,z) 

dt 


-  _  —  ( Wp -f  )no +Sioni  +  W*on2  +  Asons 


—  ~Sioni  +  (W^-(-W2i  H-1/t2i  )n2 


=  W,„n„  -  (W«  +W2,  +W«  4- 1  /t2,  )n2+S,2n3 


dn3(r,cp,z) 

dt 


— WpOo  —  (A3o“rSs2)n3 


According  to  interaction  between  atoms  and  electric  field ,  the  ba¬ 
sic  equation  of  space  distribution  for  light  intensity  in  a  fiber  can 
be  written  as  : 

I(r,(p,z,Y)=p(z)  I  E(r,p,Y)  |  ^ 

E  is  the  normalized  transvcrce  distribution  of  electric  field  for  def¬ 
inite  LP  mode  in  the  fiber. 

Thus,  the  power  propagation  equation  along  the  fiber  for  pump 
and  signal  can  be  written  as: 


dz 


=  ±gp(z)Pp'^(z) 


gp(z)— oosjojo  |E(r,<p,Yp)  I  ^no(r ,(p,z)rdrdq) 

^S^-±8.fa)P,fa) 


g.(z)==(o2i(Yi)“"cr2<(Yi))JoJo  I  E(r,cp,Yi)  I  ^n2(r ,q),z)rdrdqp 
where  Pp(z)  and  Ps(z)  are  light  intensity  for  pump  and  sgnal  re¬ 
spectively,  gp(z)  is  absorbance  of  pump  light  and  g,(z)  is  gain 
coefficient  of  signal  Hght.  b  is  doping  diameter  of  the  ion  in  the 
fiber.  ±  denotes  the  direction  of  +z  and  --z  respectively,  oij  is 
cross-section  area  between  i  and  j  level  that  denotes  photon  ab¬ 
sorbtion  or  emission  probability. 

2.  2  Numerical  calculation  of  the  gain 

According  to  the  above  theoretical  model  the’numerical  calcula¬ 
tion  of  gain  can  be  made.  Fig.  3  is  the  relation  between  signal 
gain  and  pump  power.  The  wavelength  of  pump  and  signal  are 
1017  nm  and  1300  nm  respectively.  The  core  diameter  of  the  Pr 
ion  doped  fiber  is  5pm.  The  numerical  aperture  is  0.  19.  Pr  ion 
concertration  is  500ppm.  The  gain  of  0.  025dB/mw  can  be  ob¬ 
tained  for  fiber  length  of  23  m.  The  calculation  results  is  in  ac¬ 
cordance  with  experimental  results  of  ref.  9.  Fig.  4  is  the  calcula¬ 
tion  results  of  the  gain  for  different  fiber  length  at  the  same  pump 
power  of  200  mw.  The  calculation  shows  that  the  signal  power 
no  longer  increase  while  fiber  length  being  20  m.  So  the  length  of 
20  m  can  be  treated  as’ effective  fiber  length. 

The  effect  of  pump  operation  on  gain  of  in-line  PDFA  is  also  e- 
valuatcd  by  the  above  theoretical  model.  Fig.  5  shows  signal  gain 
of  PDFA  at  different  power  intensity  distribution  of  fundamental 
mode  of  LPoi  for  pump.  The  100%  LPpi  denotes  pump  power 
propagates  completely  in  fundamental  mode.  The  0%  LPoi  de¬ 
notes  it  propagates  in  higher  order  modes.  Fig.  6  is  a  comparison 
between  uni-and  bi-direction  pumping  scheme.  The  results  shows 
gain  improvements  can  be  obtained  under  bi-direction  pumping 
scheme.  This  is  because  ESA  of  (^G^-^Po)  for  pump  photon  and 
co-operation  upconversion  of  (‘G<-^D2  )  —  — ^Hs)  for  signal 

photon  are  suppressed.  This  results  in  an  increase  of  population 
on  energy  level. 

3.  POTENTICAL  APPLICATION  SCHEME 

OF  PDFA  FOR  FTTH 
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Optical  fiber  CATV  network  are  rapidly  developing  in  China  and 
may  be  predicated  that  they-  wiU  transit  to  FTTH  by  using  new 
technology  and  facilities.  An  effective  way  to  compensate  split¬ 
ting  loss  is  use  of  PDF  A  which  operates  at  the  same  light  wave¬ 
length  of  1300  nm  in  current  fiber  CATV  systems  as  shown  in 
Fig.  7. 

4.  SUMMARY 

To  increase  the  numbers  of  subscriber  for  current  optical  fiber 
CATV  network  and  realize  future  FTTH  in  China.  The  applica¬ 
tion  of  in-line  PDFA  is  an  effective  way  to  compensate  spUtting 
insertion  loss.  The  design  consideration  and  theoretical  calcula¬ 
tion  of  PDFA  have  been  made  in  this  paper.  The  calculation  re¬ 
sults  denoted  that  the  pumping  efficiency  of  0.  025dB/mw  can  be 
obtained  for  fiber  length  of  20  m,  NA  of  0.  19  and  Pr  doping 
concentration  of  500  ppm.  The  theoretical  analysis  on  pumping 
scheme  confirmed  that  bi-direction  pump  is  superior  to  uni-direc¬ 
tion  pump  for  improving  signal  gain. 

REFERENCES 

[1]F.  Marcerou>  H.  A.Fe%Tier,  J.  Famos  ^‘General  theoretical 
approach  describing  the  complete  behavior  of  the  erbium- 


doped  fiber  amplifier"  SPEEVol.  1373  PP168-186 

[2] B.  Pedersen  et  al.  “Optimization  of  P?"*"  ZBLAN  fiber  amplifi¬ 
er"  Photon  Tech  Lett.  Vol.  4  No.  5  May  1992 

[3] S.  P.  Bastien  et  al.  “Theoretical  modeling  of  erbium-doped 
amplifier  pumped  by  the  800  nm  band"  SPIE  Vol.  1587, 
Fiber  Laser  Source  and  Amplifiers  I  1991. 

C4]B.  P^ersen  et  al. .  “The  design  of  erbium  doped  fiber  ampli¬ 
fier"  J.  L.  T.  Vol.  9  No.  9  Sept.  1991  PP1105- 

1112 

[5]  A.  Biarklev  et  al.  “Large  signal  modelling  of  an  Erbium 
doped  fiber  amplifier"  SPIE  Vol.  1171  Fiber  laser  source 
and  amplifiers  (1988)  PP 11 8- 129 

[6] Yasutake  chishi  et  al  “A  high  gain,  high  output  saturation 

power  Pr^"^  doped  fluoride  fiber  amplifiers"  IEEE  Photon 
Tech  Lett.  Vol.  6  No.  2  1994  PP195-198 

[7] Yasulake  Chishi  et  al.  “Investigation  of  efficient  pump 

scheme  for  Pr^'*'  doped  fluoride  fiber  amplifiers"  IEEE  Pho¬ 
ton  Tech  Lett.  Vol.  6  No.  2  1994  PP195-198 

[8] Whilhey  et  al-  “Towards  a  practical  1.  3pm  optical  fiber  am¬ 
plifier"  B.T.Tech.J  1993  No.  2  P115 

[9] Cakter.  S.  F.  Szebesta  “Amplification  at  1.  3pm  in  Pr^"^ 
doped  single-mode  fluorozirconate  fiber"  Electron  Lett 
1991  27  PP  628-629 


Fig.  1  Fiber  optic  backbone  trunk  with  PDFA 


515 


Fig.  2  Construction  and  principles  of  PDFA 
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Fig.  3  Signal  gain  versus  pump  power 
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Fig.  4  Signal  gain  at  different  length 


Fig.  5  The  effect  of  power  distribution  Fig.  6  The  comparison  of  gain  between 
of  LPoi  mode  on  signal  gain  uni— and  bi— direction  pump  scheme. 
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Fig.  7  The  integrated  service  subscriber  loop  with  PDFA 
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ABSTRACT 

From  sub-shot  noise  photonic  measurements  in 
oplocouplers  we  obtain  the  frequency  characteristics 
of  quantum  noise  reduction  using  the  relative  noise 
difference  method.  We  have  shown  that  the  RND 
method  enables  us  to  remove  the  dependence  of  noise 
measurements  on  the  noise  and  frequency  response  of 
the  instrumentation  amplifier.  We  have  also 
attempted  to  derive  the  same  frequency 
characteristics  using  a  circuit-based  model 
incorporating  photonic  noise  sources.  There  is  good 
agreement  between  measurement  and  modelling, 
hence  lending  experimental  support  to  the  use  of  the 
photonic  noise  model  to  other  sub-shot  noise 
optocoupler  structures. 

L  INTRODUCTION 

Conventionally,  the  small-signal  behaviour  of  an 
LED  is  modelled  in  the  same  way  as  for  a 
semiconductor  rectifier  by  a  parallel  RC  circuit  and 
its  noise  characteristics  is  determined  by  a  shot  noise 
current  source  connected  in  parallel  with  the  RC 
circuit.  While  this  representation  is  commonly  used 
for  describing  the  electrical  behaviour  of  the  device, 
it  has  been  found  inappropriate  when  dealing  with 
photonic  noise  behaviour  in  an  optocoupler.  It  is 
shown  in  this  paper  that  the  series  equivalent  noise 
voltage  proposed  by  Edwards[l]  yields  results  which 
agree  with  experimental  observations  whereas  the 
parallel  current  source  does  not. 

2.  MODULATION  RESPONSE 

Consider  the  optocoupler  as  shown  in  Fig.  1.  It  is 
assumed  that  all  photons  generated  by  the  LED  are 
collected  by  the  ideal  photo  detector  and  the  amount 
of  photo  current  generated  at  the  detector  is  equal  to 
the  current  responsible  for  generating  the  photons  at 
the  LED.  Since  electrons  and  holes  arc  injected  in 


pairs  and  recombine  in  pairs  in  the  LED,  it  is  enough 
to  consider  the  rate  equation  for  only  one  type  of 
charge  carriers.  Let  and  7=//^+/,  where  Ny 

and  are  the  mean  values,  and  //  and  i  are  the  small 
signal  quantities  of  the  carrier  number  and  junction 
modulation  current,  respectively.  Since  the  mean  rate 
of  change  of  the  carrier  number  is  zero,  the  rate 
equation  describing  the  excess  charge  number  is 
given  by  [2] 


where  q  is  the  electronic  charge,  and  is  the 
recombination  time  constant.  Taking  Laplace 
transform  of  eq.  (1)  and  rearranging  terms  give  the 
following: 


Clearly  the  left-hand  side  of  eq.  (2)  indicates  that  it  is 
a  measure  of  the  rate  of  carrier  recombination.  If  we 
assume  that  the  quantum  efficiency  is  100%  so  that 
recombination  is  entirely  radiative,  then  the  right- 
hand  side  of  (2),  denoted  by  lp(s),  may  be  regarded  as 
the  photon  generating  current.  Also,  eq.  (2) 
expresses  the  frequency  response  of  the  device 
relating  the  photon  current  Ip(s)  to  the  modulation 
i(s}.  We  note  that  the  frequency  response  is  identical 
to  that  of  a  single-pole  parallel  RC  circuit  and  the 
photon  current  is  represented  by  the  current  through 
the  resistance  of  (he  equivalent  circuit.  The 
modulation  bandwidth  is  evidently  given  by  IRlrn^. 

The  current  input  to  the  device,  /,  is  dependent  on  the 
modulating  voltage  the  source  impedance  Z^, 
and  the  electrical  characteristic  of  the  device.  From 
junction  diode  theory,  we  note  that  the  differential 
resistance  for  small  signals  at  low  frequencies  is 
given  by  R^f=]kVjlliyy  where  |.t  is  the  ideality  factor. 
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Vj  is  the  thermal  voltage  and  the  bias  current. 
We  then  arrive  at  the  commonly  used  equivalent 
circuit  of  the  LED  as  shown  in  Fig.  2  (omitting  the 
noise  for  the  moment),  where  Cj  is  given  by 
which  represents  the  storage  capacitance  of  the 
junction.  Any  parasitic  vSeries  inductance  and 
resistance  of  the  LED  may  be  lumped  with  Zy 


indicative  of  the  process  between  photonic  noise  and 
the  electronic  shot  noise  current  in  the  LED.  Using 
the  series  equivalent  circuit  of  Fig.  2(b)  and  with  Zy 
very  large,  can  be  expressed  as 

.  2  [<Vv^>/^,/^](coT,.)^ 


3.  SHOT  NOISE  CURRENT 

We  have  shown  above  that,  for  the  modulation  signal, 
photons  are  directly  related  to  the  current  component 
through  the  junction  resistance  If  we  now  follow 
convention  to  add  a  parallel  noi.se  current  as  shown 
in  Fig.  2(a),  we  would  find  our.sclves  in  the  dilemma 
that  when  the  device  is  externally  AC  shorted  (ie, 
Z^=0),  the  shot  noise  current  does  not  flow  through 
the  junction  resistance,  hence  implying  the  absence  of 
photonic  noise.  This  is  of  course  contrary  to  the  fact 
that  we  should  obtain  full  shot-noise  in  the  photon 
intensity.  This  difficulty  is  apparently  removed  if  we 
represent  the  shot  noise  by  a  voltage  source,  = 
in  series  with  as  shown  in  Fig.  2(b).  When 
the  device  is  now  AC  shorted,  we  obtain  the  full  shot- 
noise  as  can  be  verified  in  practice.  Although  one 
can  transform  the  parallel  combination  of  and  R^i 
into  the  series  connection  of  and  /?^/  using 
Thevenin's  Theorem,  their  equivalence  holds  only  as 
far  as  any  current  or  voltage  external  to  the  two 
equivalent  circuits  is  concerned.  Hence  any  external 
electrical  measurement  on  the  LED  will  not  show  the 
internal  difference  between  the  two  equivalent 
circuits.  However,  the  currents  through  the  internal 
source  resistance  need  not  be  the  same  in  the  two 
circuits.  This  apparent  ambiguity  can  be  resolved  by 
measuring  the  photon  number  fluctuations  generated 
by  the  currents  through  the  differential  resistance  as 
we  have  done  in  the  optocoupler. 

At  room  temperature,  the  quantum  efficiency  of  a 
practical  optocoupler  is  usually  much  lower  than 
unity  so  that  the  detector  noise  consists  of  two 
components,  namely,  the  transmitted  shot  noLse  and 
the  partition  noise  which  is  due  to  photon  number 
deletion.  That  is,  the  detector  noise  spectral  density 
is  given  by  [3]: 

(3) 

where  i]  is  the  transfer  efficiency,  q  is  the  electronic 
charge,  the  LED  bias  current,  and  is 

the  mean  square  shot  noise  current  spectral  density. 
The  partition  noise  is  frequency  independent.  The 
transmitted  noise,  on  the  other  hand,  is 


1  +  (COT^.)"^ 


(4) 


Putting  (4)  into  (3)  yields  the  output  noise  relative  to 
shot  noise: 


1  +  (0)T^)^  ^ 


(5) 


At  low  frequencies,  the  transmitted  noise  is  very 
small  and  the  output  noise  is  said  to  be  squeezed  to 
below  the  shot  noise  level.  The  squeezing  bandwidth 
can  be  defined  as  the  frequency  at  which  the  output 
noise  of  the  optocoupler  is  3  dB  below  the  shot  noise 
level  which  appears  at  high  frequencies.  This 
frequency  can  be  obtained  from  the  first  term  of  (5) 
and  is  given  by  l/(2nT^),  which  is  the  same  as  the 
modulation  bandwidth.  In  practice,  however,  because 
of  low  quantum  efficiency,  the  second  term  of  (5)  is 
quite  large  so  that  the  total  amount  of  squeezing  is 
very  small  at  room  temperatures.  One  way  to 
increase  quantum  efficiency  is  to  work  with  the 
system  at  low  temperatures. 

4.  PHOTONIC  NOISE 
MEASUREMENTS 


By  cooling  an  optocoupler  consisting  of  an  IRED  and 
high  efficiency  PIN  diode  detector  we  obtain 
quantum  transfer  efficiencies  about  30%  so  that  the 
output  noise  is  not  swamped  by  partition  noise. 
Typical  measurement  results  obtained  at  a 
temperature  of  150  K  and  for  different  bias  currents 
are  as  shown  in  Fig.  3.  We  consider  the  case  for  //,=2 
mA.  Curve  (a),  denoted  by  A(o)),  is  obtained  by 
illuminating  the  detector  with  Poi.ssonian  light  having 
a  constant  intensity  noise  spectral  density  over  the 
frequency  range  of  measurement.  The  roll-off  at  high 
frequencies  is  due  to  the  characteristics  of  the 
instrumentation  amplifier  response  Curve  (b) 

gives  the  detector  noise  when  the  LED  is  used  as  the 
light  source,  and  we  represent  it  by  /?((i)).  Both 
curves  include  a  noise  component,  due  to  the 
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instrumentation  amplifier.  Thus  the  functions  A(io) 
and  B((.o)  may  be  expressed  as  follows: 


(6) 

[<ino^>  +  =  B{m) 

(7) 

Taking  the  difference  between  (6)  and  (7) 
the  amplifier  noise  and  yields 

removes 

(8) 

At  very  high  frequencies,  the  detector  noise  reaches 
the  shot  noise  level  and  from  (3)  and  (4)  we  have 
Therefore  eq.  (8)  simplifies  to 

(9) 

Since  A((.o)  is  proportional  to  the  amplifier  gain,  or, 
=KA(o)),  we  may  rewrite  (9)  as 

(10) 

The  right-hand  side  is  referred  to  as  the  relative  noise 
difference  (RND).  If  the  circuit  model  of  Fig.  2(b)  is 
correct,  the  RND  characteristics  should  provide  us 
with  information  regarding  the  vSqueezing  bandwidth 
1/(2jtt^).  It  is  of  interest  to  note  that  the  squeezing 
bandwidth  can  also  be  thought  of  as  the  half-power 
point  of  the  relative  noise  difference,  as  implied  by 
(10). 

Since  both  A(u))  and  B(io)  are  spectral  plots  in  dBm, 
their  ratio  B(co)IA(io)  can  be  obtained  from  their  dB 
difference  and  then  linearised  before  the  RND  is 
obtained.  A  typical  plot  of  the  RND  is  as  shown  in 
Fig.  4  which  corresponds  to  the  case  when  /^=2  mA 
and  a  temperature  of  150  K.  By  fitting  a  1st  order 
low-pass  response  (solid  line)  to  the  RND  data,  we 
find  the  squeezing  bandwidth  to  be  650  kllz  ±15%, 
which  is  not  significantly  different  from  the 
modulation  bandwidth  of  616  kllz  ±5%  measured 
directly  at  the  output  of  the  PIN  detector. 


experimental  support  to  the  validity  of  the  series 
noise  equivalent  circuit  for  the  LED  in  optocoupler 
applications  [4-5]. 
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5,  CONCLUSION 

The  method  .of  relative  noise  difference  is  original 
and  its  use  removes  both  the  noise  and  frequency 
dependence  of  the  amplifier  used  in  the  noi.se 
measurement.  The  apparent  agreement  between  the 
squeezing  and  modulation  bandwidths  lends 
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Fig.  1  Oplocouplcr  circuit 


Zg  Ip  +in 


(b) 


Fig.  2  Electronic  small-signal  model  of  the 
LED  with  its  internal  shot-noise 
represented  as  (a)  a  current  source, 
(b)  a  voltage  source. 


Fig.  3  Measurements  of  photonic  noise 
spectral  density  at  the  output  of  the 
optocoupler  with  different  LED  bias 
currents. 


0.1  1 

Frequency,  MHz 


Fig.  4  RND  data  (dark  squares)  and  a  1st 
order  low-pass  response  (solid  line) 
using  circuit  model. 
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ABSTRACT 

A  possibility  to  control  parameters  of  microwave 
signals  by  pulsed  optical  heating  of  a  ferrite  film 
involved  in  a  magnetostatic  wave  transmission  line  was 
demonstrated.  Variations  in  the  signal  phase  shift  by 
IOtc  rad,  delay  time  by  35  ns,  and  insertion  loss  by  5  dB 
were  obtained  in  the  5.7  GHz  frequency  band  using 
5  ns  laser  pulses  with  the  energy  of  30  mJ. 

1.  INTRODUCTION 

Recently,  a  great  attention  was  paid  to  optically 
controlled  devices  for  microwave  signal  processing  and 
forming  [1],  Such  de\dces  are  considered  very 
promising  for  application  in  phase  array  antennas  and 
high-speed  opto-electronic  communication  systems  [2]. 
The  optically  controlled  K-band  oscillator  containing 
PIN  photodiode  [3],  optically  tuned  microwave  phase- 
shifter/attanuator  utilizing  liquid  cjystal  modulator  [4] 
and  optically  controlled  MESFET  structures  [5]  have 
been  designed  and  investigated. 

This  paper  demonstrates  for  the  first  time  an  optical 
control  of  microwave  signals  parameters  using  a 
magnetostatic  spin  wave  (MSW)  propagating  in  an 
epitaxial  yttrium-iron-gamet  (YIG)  film.  It  was  shown 
that  pulsed  laser  irradiation  of  the  film  resulted  in  a 
change  in  the  MSW  dispersion  characteristic  and  gave 
rise  to  a  considerable  variation  in  the  phase  shift  and 
delay  time  of  microwave  signal  passed  through  a  MSW 
transmission  line. 

2.  EXPERIMENTAL  SET-UP 

Fig.l  shows  a  schematic  view  of  the  MSW  transmksion 
line.  The  line  contained  an  YIG  film  placed  in  a  close 
contact  with  an  aluiiiiiia  subsLiaie.  The  fihii  of  the 
thickness  d=\AA  pm  and  magnetization  47cA/=1750  Gs 
(  at  the  room  temperature)  was  grown  on  one  side  of  a 
gallium-gadolinium-gamet  (GGG)  substrate  of  the 
thickness  0.5  mm  and  dimensions  of  3.5  mm  x  15  mm. 


Two  microstrip  transducers  of  the  width  of  50  pm 
deposited  on  the  alumina  substrate  at  the  distance  /=0.4 
cm  apart  were  used  to  excite  and  to  receive  MSWs.  An 
external  DC  magnetic  field  H=\216  Oe  was  applied 
tangentially  to  the  film  surface  along  the  transducers. 
An  input  microwave  signal  P{()  of  the  frequency  /=5-6 
GHz  feeded  to  the  first  transducer  excited  the  MSW 
propagating  in  the  film,  while  the  second  transducer 
converted  the  MSW  back  into  an  output  electromag¬ 
netic  signal  Pout(0-  At  the  same  time  the  film  was 
irradiated  (through  the  transparent  GGG  substrate) 
with  single  optical  pulses  of  the  energy  1F^2-100  mJ, 
duration  about  of  5  ns,  and  wavelength  of  0.53  pm 
firom  a  pulsed  Nd-laser. 


alumina  SGG -substrate  YIG  film 


Fig.  1  MSW  transmission  line 

Typical  frequency  response /(/)=ioig[P^^^(t)/P(t)j 
dispersion  characteristic  j{k)  of  the  MSW  transmission 
line  shown  in  Fig.  2  explain  the  nature  of  the 
microwave  interaction  in  the  YIG  film.  The  MSW  line 
dispersion  characteristic  is  well  described  by  the 
following  dispersion  equation  [6]: 

/"=/i +/»/«+ '^(1-6-“-')  (1) 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


521 


4.  CONCLUSION 


Time  ms 

Fig.  5  Time-dependence  of  the  YIG  film 
magnetization  AnM{t). 


Using  dispersion  equation  (1)  for  MSW  propagating  in 
the  ferrite  film,  measured  or  calculated  fi'equency 
response  L(f)  for  the  transmission  line,  and 
experimentally  found  time-dependence  47iM(/)  for  the 
film  magnetization,  one  can  calculate  all  characteristics 
of  the  microwave  signal  transformation  in  the  MSW 
transmission  hne  caused  by  pulsed  laser  irradiation  of 
the  YIG  film. 

For  example,  the  delay  time  for  microwave  signal  may 
be  calculated  using  the  following  relation: 

,-L.A _ L _  (3) 

(4 +/„/?)  2-/2 

For  our  case  the  dependence  t(/)  calculated  using 
Eq.(3)  and  experimentally  measured  time-dependence 
for  is  shown  in  Fig.6.  The  variation  in  the 

signal  delay  time  from  95  ns  up  to  60  ns  was 
accompanied  by  the  changes  in  the  transmission  line 
insertion  loss  by  5  dB. 

Dd^lime  t,  ns  Loss4  dB 


Tine  ^,nB 

Fig.  6  Time-dependence  of  the  microwave  signal 
delay  time  t(0  and  insertion  loss  L{t). 


It  was  demonstrated  that  pulsed  laser  irradiation 
provided  an  effective  control  of  the  MSW  transmission 
hne  characteristics.  Laser  pulses  of  the  energy  of  30  mJ 
and  duration  of  5  ns  resulted  in  heating  of  the  YIG 
film  by  Ar=17^C  and  decrease  in  its  magnetization  by 
A(47cM)=66  Gs.  For  microwave  signals  with  the 
frequency  j=51  GHz  the  optically  induced  variations 
in  the  phase  shift  by  IOtt  rad  and  in  the  delay  time  by 
decades  of  percent  have  been  obtained.  These  effects 
may  be  used  to  design  optically  controlled  microwave 
phase  shifters  and  delay  lines. 
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where  f}f=yH,  /m=y47iA/,  y=2.8  MHz/Oe  is  the 

giromagnetic  ratio,  is  the  ferrite  magnetization,  / 
and  k  is  the  MSW  frequency  and  wave  number, 
respectively. 


loss  L  (dB)  wove  number  Yl,  cm'^ 


Fxecpency 

/CK 


Fig.3  Dispersion  characteristic  of  the  MSW 
transmission  line. 


Fig.2  MSW  line  frequency  response  (a) 
and  dispersion  characteristic  (b). 


A  decrease  in  the  film  magnetization  (47cMi->47cM2) 
results  in  a  down  shift  of  both  the  frequency  response 
HJ)  and  the  dispersion  characteristic  j{k)  of  the  line. 
Under  pulsed  laser  irradiation  the  YIG  film  is  heated 
and  its  magnetization  is  decreased  with  the  coefficient 
5(47i:AO/9r^’’3.8  Gs/^C.  It  follows,  that  heating  of  the 
film  with  laser  pulses  have  to  change  the  phase  shift 
0=^7  as  well  as  the  insertion  loss  L  of  the  microwave 
signal  passed  through  the  MSW  transmission  line. 

3.  RESULTS  OF  INVESTIGATION 

The  dispersion  characteristic  of  MSW  transmission  line 
and  transformation  of  microwave  signal  under  pulsed 
laser  irradiation  of  the  ferrite  film  were  measured  by  the 
interferrometric  method.  The  input  signal  P{f)  was 
divided  and  part  of  it  was  then  mixed  with  the 
microwave  signal  from  the  transmission  line  output 
Pouxif)  having  the  phase  shift  0=A:/.  Measured  and 
calculated  dispersion  characteristics  of  the  line  under 
stationary  conditions  are  shown  in  Fig.3. 

The  time-dependence  of  the  phase  shift  0(7)  of  output 
microwave  signal  (shown  in  Fig.4)  under  pulsed  laser 
irradiation  was  found  by  measuring  the  period  of  the 
beats  at  the  line  output.  For  laser  pulse  energy  of 
30  mJ  and  the  signal  frequency  of  ^5713  MHz  the 
phase  shift  of  the  output  signal  was  increased  from 
Oi=107i  rad  .  up  to  02=1771  rad  during  the  time  of  the 
film  heating  (5  ns)  and  then  it  was  decreased  to  its 
initial  value  at  the  cooling  of  the  film. 


Fhase  shift 
C>  231  rad 


Tune  7,  ms 


Fig.  4  Time-dependence  of  the  signal  phase  shift. 


Using  dependence  of  the  ferrite  film  magnetization  on 
the  MSW  wave  number: 

and  experimentally  obtained  values  for  ^(7)=0(7)//  at 
fixed  /,  we  found  the  time-dependence  for  the  film 
magnetization  47t:M(7)  which  is  shown  in  Fig.  5.  In  our 
experiments  the  maximum  decrease  of  magnetization 
was  equal  to  A(47cA^niax=64  Gs  that  corresponded  to 
the  maximum  heating  of  the  film  by  Ar«17®C.  After 
the  end  of  the  laser  pulse  the  magnetization  was 
increased  to  its  initial  value  approximately  exponen¬ 
tially  with  a  characteristic  time  about  of  0.5  ms. 
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ABSTRACT 

A  self-consistent  model  of  coupling  in  an  active  twin- 
ridge  waveguide  coupler  is  presented,  based  on  the  beam 
propagation  method.  The  coupler  structure  is  assumed  to 
be  febricated  in  double-heterostructure  material  which  is 
electrically  pumped  to  provide  distributed  gain.  Stimulated 
recombination  in  the  active  region  perturbs  the  complex 
refractive  index  of  the  structure  and  this  results  in  non¬ 
linear  coupling. 

1.  INTRODUCTION 

There  have  been  a  number  of  significant  recent  develop¬ 
ments  in  a  range  of  advanced  waveguide  components  such 
as:  integrated  laser/waveguide/detector  structures,  wave¬ 
guide  coupled  DFB  sources  for  WDM  applications  and 
switchable  optical  routing  components  for  optical 
networks.  A  particular  problem  with  integrated-optic 
waveguide  components,  particularly  power  splitters  and 
couplers,  is  the  accumulation  of  losses  which  invariably 
reduces  the  bit  error  rate  performance  of  the  system.  A 
possible  solution  to  this  problem  is  to  incorporate 
amplifying  sections  within  the  integrated-optic  structure 
by  the  use  of  a  conventional  double  heterostructure 
substrate  (with  a  bulk  or  quantum  well  active  layer)  which 
can  be  locally  pumped  with  injection  current  to  provide  a 
region  of  net  gain.  One  consequence  of  this  is  that  there  is 
an  interaction  between  the  photons  and  the  carriers  in  the 
active  layer  which  gives  rise  to  lateral  and  longitudinal 


variations  in  the  complex  refractive  index  of  the  active 
layer  of  the  waveguide  and  this  affects  the  evolution  of  the 
guided  modes.  This  effect  is  particularly  strong  at 
wavelengths  close  to  the  emission  wavelength  of  the 
active  layer.  In  certain  applications,  such  as  optical  or 
current-controlled  spatial  switching,  this  interaction  can  be 
exploited. 

Jensen  [1]  and  Thylen  et  al  [2]  have  examined  the  effect  of 
non-linear  coupling  of  the  refractive  index  to  the  photon 
density  in  an  otherwise  passive  twin -guide  structure.  In 
this  paper,  a  self-consistent  model  of  a  twin-ridge  coupler 
is  presented  in  which  there  is  distributed  gain  in  each 
waveguide  and  the  effects  of  lateral  carrier  diffusion  and  the 
interaction  of  the  photons  with  the  carriers  are  included. 

2.  THE  MODEL 

Figure  1  shows  the  structure  of  the  waveguide.  Each  guide 
is  independently  pumped,  with  current  confinement  being 
provided  by  the  etched  ridges.  It  is  assumed  that  there  is 
good  current  confinement  in  the  p-type  cladding  layer  but 
there  is  electrical  coupling  between  the  guides  due  to 
lateral  carrier  diffusion  in  the  active  layer.  This  leads  to  a 
lateral  variation  in  the  real  part  of  the  effective  refractive 
index  due  the  ridge  structure  and  an  additional  variation  in 
both  real  and  imaginary  parts  due  to  the  carriers.  In  this 
model,  good  transverse  optical  confinement  is  assumed 
within  the  active  layer  of  the  DH  substrate  and  only  the 
lateral  variation  of  the  TE  mode  propagating  along  the 
guide  is  considered. 

In  common  with  many  other  authors,  the  effect  of  the 
ridge  geometry  is  modelled  using  the  effective  index 
method  [3]  to  reduce  the  computational  overhead,  and  the 
longitudinal  variation  of  the  lateral  mode  is  obtained  by 
the  beam  propagation  method  [4-7]. 

The  longitudinal  variation  in  photon  density  gives  rise  to 
a  longitudinal  variation  of  the  lateral  carrier  density 
distribution  leading  to  possible  gain  saturation.  This  effect 
is  specifically  included  in  the  model.  The  active  waveguide 
problem  may  be  split  into  two  coupled  sub-models,  an 
electrical  model  and  an  optical  model. 
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2.1,  Electrical  Model 

The  lateral  carrier  density  distribution  at  some  point  z 
along  the  waveguide  structure,  n(x,z)  can  be  found  from  a 
solution  of  the  diffusion  equation: 

d^n(x,z)  J(x)  _ 

dx'  e.d  (1) 

Kim  («'  z)  +  (« -  z)  +  (n ,  z) 

where  J(x)  is  the  cuirent  density  distribution  in  the  active 
layer,  Damb  is  the  ambipolar  diffusion  coefficient,  d  is  the 
thickness  of  the  active  layer.  Carrier  diffusion  in  the 
longitudinal  direction  has  been  neglected.  The  terms  on 
the  right  hand  side  of  the  equation  are  recombination  terms 
representing  stimulated,  spontaneous  and  non-radiative 
recombination.  These  are  assumed  to  be  of  the  form: 

Kmi'^>^)  =  —S{X,z)S{X,z) 

% 

Kon(»>^)=BA»  +  Po)  (2) 

Kn-rajC't’  z)=--+Cn{n  +  p„f 
r 

where  is  the  bimolecular  recombination  coefficient,  C 
is  the  Auger  recombination  term,  po  is  the  active  layer 
doping  density  and  T  is  a  characteristic  lifetime.  In  the 
stimulated  recombination  term,  g(x»z)  =  an(x,z)  -  b  is  the 
optical  gain  of  the  material,  c/tj^  is  the  group  velocity  of 
the  light  in  the  waveguide,  S(x,z)  is  the  photon  density 
distribution  and  r]o  is  the  background  refractive  index  of 
the  active  layer. 

2.2  Optical  Model 

For  non-linear  multi-guide  structures,  where  the  fields  in 
each  guide  may  not  be  phase  matched,  it  is  difficult  to 
apply  coupled  mode  theory.  The  beam  propagation  method 
is  particularly  appropriate  to  this  type  of  problem  because 
it  provides  a  unified  treatment  of  the  various  guided  and 
radiation  fields.  It  is  assumed  that  the  lateral  field,  ^(x,z), 
satisfies  the  paraxial  wave  equation: 

-2jk^^+^-^  +  [e^ff{x,z)kl-k^\'F  =  Q  (3) 

dz  dx^  ^  ^  ’ 

where  /tq  =  and  that  the  effective  complex  dielectric 
constant  of  the  active  layer  in  the  lateral  direction  has  been 
modified  to  take  into  account  the  effect  of  the  carriers  to 
yield: 

eefj(x,z)  =  ^ eact(x)  +  (-R  +  j)  J  (4) 


where  Sacdx)  is  the  effective  background  dielectric 
constant  of  the  active  layer  found  by  the  El  method  and  B 
is  the  ratio  of  the  change  of  the  real  part  of  the  refractive 
index  due  to  the  carrier  density.  The  evolution  of  the 
lateral  field  can  now  be  found  using  the  beam  propagation 
method,  with  appropriate  correction  for  the  gain,  as 
described  qualitatively  by  Lagasse  and  Baets  [5]  and  in 
greater  detail  in  references  [8]  and  [9].  The  procedure  is 
well  known  and  involves  splitting  the  guide  longitudi¬ 
nally  into  a  number  of  segments  of  length  25.  In  each  of 
these  segments,  distributed  gain  is  replaced  by  an 
equivalent  value  of  lumped  gain  in  the  centre  of  the 
segment.  The  paraxial  equation  (3)  is  rewritten  as 
J'P/c/z  =  (A  +  B(z))‘Fwhere  A  and  B  are  non-commut¬ 
ating  diffraction  and  complex  gain  operators,  respectively. 
If  is  the  lateral  field  at  the  i^^  station,  then  the  error  in 
computing  'Fy+y  is  minimised  [7]  when: 


- 

(zs 

SA 

e  exp 

J  B{z*)dz' 

SA 

e 

_ 

This  equation  is  solved  by  first  free  propagating  the  field 
over  a  distance  S  to  the  edge  of  the  gain  sheet.  This  is 
achieved  most  conveniently  by  first  expanding  ‘P/  as  its 
spectrum  of  plane  waves.  The  resulting  field  at  the  centre 
of  the  segment  is  then  operated  on  by  a  ‘lens’  of  complex 
refractive  index  prior  to  further  free  propagation  through  a 
distance  5  to  the  edge  of  the  segment. 

2.3  Solution  method 

A  multi-layer  effective  index  method  was  used  to  calculate 
the  effective  background  indices  of  the  guiding  and 
cladding  regions  of  the  lateral  active  guide  at  the 
wavelength  of  interest.  Longitudinally,  the  substrate  was 
split  into  short  [5]  segments  of  typically  O.lpm  length. 
The  arbitrary  optical  field  at  the  input  to  the  guide  was 
propagated  to  the  mid-point  of  the  segment  using  the 
BPM  algorithm  with  a  256  point  FFT,  and  the  photon 
density  distribution,  S(x,z),  obtained. 

The  carrier  diffusion  equation  was  then  solved  numerically 
at  this  point  using  S(x,z)  and  an  efficient  iterative  method 
with  derivative  boundary  conditions,  dn/dx  =  0  was 
employed.  From  the  solution  of  n(x,z),  the  gain  and 
complex  refractive  index  profiles  were  then  obtained.  The 
optical  field  was  then  propagated  to  the  end  of  the  first 
segment  using  the  new  value  of  complex  refractive  index 
in  the  phase  correction  term  of  the  BPM  algorithm,  as 
described  above. 

To  prevent  reflection  of  the  propagating  field  at  the  outer 
edges  of  the  structure,  optical  absorbing  regions  of  graded 
absorbency  were  used  in  the  usual  way. 
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3.  RESULTS 

For  the  purpose  of  this  paper,  the  method  was  applied  to 
an  active  twin-ridge  waveguide  coupler  assumed  to  be 
fabricated  from  GaAs/GaAlAs  dh  material.  Table  1  lists 
the  principal  parameter  values  [10]. 


guide  width,  wj  =  W2 

3|i.m 

guide  spacing,  D 

Spin 

active  layer  thickness,  d 

0.2pm  GaAs 

cladding  layer  thicknesses 

2pm  GaAlAs 

ridge  etch  depth,  t 

1.8pm 

percentage  A1  content  in  cladding  layer 

25% 

non-radiative  lifetime,  t 

10-»s 

ambipolar  diffusion  coefficient,  Damb 

68  cm^s"^ 

bimolecular  recombination  rate, 

10'^^  m^s"^ 

gain  coefficient,  a 

8.9  X  10'^^  m^ 

gain  coefficient,  b 

7.8  X  10^  in^ 

transverse  optical  confinement  factor  F 

0.75 

wavelength,  X 

0.88pm 

Table  1  Parameters  used  in  the  simulation 


Figure  2  shows  the  development  of  the  optical  field 
intensity  of  a  passive  twin-ridge  waveguide  structure,  of 
dimensions  given  in  table  1. 


X  axis  (pm) 

Figure  2.  Evolution  of  the  field  intensity  in  a  passive 
coupler,  dimensions  as  Table  1  and  effective  refractive 
index  step  0.0045 

In  this  example,  all  the  layers  are  assumed  to  be  lossless. 
Optical  injection  is  into  guide  1  and  classic  coupling 
behaviour  is  observed.  The  coupling  length  is  close  to 
that  obtained  from  coupled  mode  theory.  This  behaviour 
can  now  be  contrasted  with  the  case  for  the  symmetrically 
pumped  twin-ridge  coupler  electrically  pumped  just  above 
its  transparency  value  (measured  between  the  input  and 
output  ports  for  an  input  power  of  ImW),  shown  in  figure 
3.  In  this  case,  light  is  coupled  from  guide  1  into  guide  2 


within  iOOjrm.  There  is  a  rapid  increase  in  intensity  in 
guide  2  because  the  guide  has  excess  gain  for  the  initial 
levels  of  light  coupled  into  it  from  guide  1,  and  there  is 
amplification.  As  more  light  is  coupled  along  the  guide, 
the  intensity  builds  up  and  causes  significant  stimulated 
recombination  and  the  guide  starts  to  saturate.  Light 
couples  back  into  guide  1,  and  this  too  is  amplified  and 
starts  to  saturate. 


Figure  3.  Beam  propagation  in  a  symmetrically  pumped 
active  twin -ridge  coupler  at  input  power  of  ImW  and 
Jguide  “  1.1x10^  Am  ^ 

Figure  4  shows  the  same  structure  extended  over  3mm  but 
with  an  optical  input  power  of  lOmW,  where  the  two 
travelling  wave  amplifiers  are  now  very  heavily  gain 
saturated. 


Figure  4.  Evolution  of  field  along  symmetrically 
pumped  active  twin-ridge  waveguide  coupler.  Jguide  = 
l.5x  10^  Am'^, Optical  input  power  =  lOmW 
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Apart  from  the  initial  transient  behaviour,  shown  in  figure 
3,  once  the  light  has  coupled  into  guide  2,  the  light  is 
effectively  confined  to  both  guides,  and  the  intensity  stays 
reasonably  constant  due  to  the  effect  of  gain  saturation. 

By  modifying  the  relative  injection  currents  between  the 
guides,  a  number  of  different  coupling  modes  are  observed, 
indicating  that  the  device  could  be  used  as  an  electronically 
controllable  spatial  switch  in  OOK  optical  system.  It  is  of 
practical  benefit  to  note  that  the  length  of  the  active 
coupler  is  now  no  longer  critical  in  obtaining  the  required 
coupled  optical  power  into  each  guide. 

Figure  5,  shows  how  the  coupler  can  be  configured  to 
couple  from  guide  1  to  predominantly  guide  2.  In  this 
case,  guide  2  is  relatively  heavily  pumped  to  provide  gain 
whilst  guide  1  is  not  pumped  sufficiently  to  overcome  the 
inherent  loss  in  the  active  layer  of  the  guide.  Because  of 
the  strong  coupling  between  the  guides  due  to  the  effects 
of  carrier  diffusion  on  the  complex  refractive  index,  a 
significant  proportion  of  the  light  has  coupled  across  to 
guide  2  before  it  is  totally  absorbed  in  guide  1.  The  light 
that  has  coupled  to  guide  2  is  then  maintained  by  the  gain 
in  that  guide. 


Figure  5.  Evolution  of  the  field  in  an  asymmetrically 
pumped  active  coupler  at  2mW  input  power 
(Jguidei  =  0-8x10^ Acm-2  and  Jguide2=  13x10^  Am"^) 

4.  CONCLUSIONS 

A  self-consistent  model  of  the  active  twin-ridge  coupler 
has  been  presented  which  predicts  the  longitudinal 
evolution  of  the  optical  field.  It  is  found  that  the  strong 
electrical  coupling  between  the  guides  significantly 
modifies  the  coupling  length,  compared  with  the  passive 
case.  Current  injection  into  the  two  guides  controls  the 
intensity  of  the  field  beneath  each  guide  extremely 
effectively  by  adjusting  the  complex  refractive  index 
profile  and  this  can  be  exploited  for  optical  routing  appli¬ 
cations.  Gain  saturation,  however,  provides  a  feedback 


mechanism  and  this  damps  the  transfer  of  optical  power 
between  the  two  guides,  which  implies  that  the  physical 
length  of  the  active  component  is  no  longer  critical. 
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ABSTRACT 

The  realization  of  spectral  encoding  of  low-cost  broad¬ 
band  light  sources  using  acoustically  tunable  optical 
filters  is  discussed.  We  show  the  feasibility  of  source 
equalization  and  spectral  code  generation  by  some  ex¬ 
periments.  Finally,  for  the  first  time  a  polarization  in¬ 
dependent  single-chip  solution  for  transmitter  and  re¬ 
ceiver  is  proposed. 

1.  INTRODUCTION 

The  first  optical  code  division  multiple  access  (CDMA) 
systems  were  proposed  in  the  middle  of  the  eighties  by 
Hui  [1],  Prucnal  et  al.  [2]  and  Tamura  et  al.  [3]  for 
third-generation  communication  networks.  The  basic 
idea  was  to  employ  the  well  known  spread  spectrum 
technic  from  radio  frequency  in  an  optical  communica¬ 
tion  system  to  take  advantage  of  the  huge  bandwidth 
of  the  transmission  medium.  Afterwards  many  new  ap¬ 
proaches  were  proposed  [4]  which  make  a  better  use  of 
the  special  properties  of  light  and  fiber-optics. 

In  contrast  to  optical  time  division  multiple  ac¬ 
cess  (TDMA)  and  wavelength  division  multiple  access 
(WDMA)  in  CDMA  systems  no  time  synchronization 
or  frequency  stabilization  and  no  preparatory  infor¬ 
mation/coordination  (tell-and-go)  is  necessary.  While 
in  a  TDMA  and  WDMA  environment  the  channel- 
allocation  is  static  and  has  to  be  orientated  at  the  max¬ 
imum  number  of  possible  users,  in  CDMA  a  dynamic 
channel  allocation  depending  on  the  number  of  active 
users  is  implemented.  This  could  be  an  advantage  at 
bursty  traffic  [5]  and  an  easy  network  expansion  is  pos¬ 
sible.  Optical  CDMA  systems  offer  a  greater  immuni¬ 
ty  against  fibre  nonlinearities  and  have  a  high  safety 
against  interception.  Moreover  the  use  of  optical  stan¬ 
dard  components  is  possible. 

Optical  CDMA  communication  networks  employing 
spectral  encoded  low-cost  broadband  light  sources  like 


luminescence  diodes  (LED)  have  already  been  present¬ 
ed  [6-8].  The  concept  is  shown  in  Fig.  1.  To  every  sub¬ 
scriber  a  cyclic  shifted  m-sequence  is  assigned.  Other 
code  families  like  Hadamard  sequences  are  also  possi¬ 
ble,  If  the  desired  sequence  (coded  spectrum)  arrives 
at  the  receiver  the  upper  diode  detects  the  energy  of 
all  ones,  at  the  lower  diode  no  energy  is  received  be¬ 
cause  of  the  bandstop  case  (Fig.  1).  If  an  undesired 
sequence  arrives  at  the  receiver  both  diodes  detect  the 
half  of  the  one^s  and  after  the  balanced  receiver  no 
signal  is  observed.  Complete  orthogonal  transmission 
could  be  achieved,  providing  that  the  spectrum  is  prop¬ 
erly  equalized. 

Drawbacks  of  the  known  system  are  the  rather  com¬ 
plicated  construction  of  the  bulk  optic  encoder/decoder 
as  well  as  the  address  reconfiguration  based  either  on 
fixed  amplitude  mask  or  LCD-arrangement.  In  [9]  we 
show  the  feasibility  of  spectral  encoding  with  acous¬ 
tically  tunable  optical  filters  (ATOF).  In  the  following 
the  realization  of  an  all-optical  communication  network 
is  discussed. 
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Figure  1:  Block  diagram  of  one  transmitter  and  receiv¬ 
er,  example  for  m-sequences  of  period  7 
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2.  SPECTRAL  ENCODING  WITH 

ACOUSTICALLY  TUNABLE  OPTICAL 
FILTERS 

2.1.  Why  using  ATOF  ? 

The  basic  concept  of  ATOF  is  the  interaction  of  an 
acoustic  and  an  optic  wave.  In  a  LiNbOa-Chip  an 
acoustic  waveguide  (with  a  width  of  about  100/zm)  is 
integrated  around  an  optical  single  mode  waveguide. 
The  propagation  constants  of  the  optical  polarization 
modes  (TE  and  TM)  differs  because  of  the  birefrin¬ 
gence,  By  applying  of  an  acoustic  wave  with  the  same 
wavelength  as  the  beat  wavelength  of  the  two  polar¬ 
ization  modes  an  interaction  between  these  modes  can 
occur  and  light  will  be  converted  from  TE  to  TM  for  ex- 
ample.The  birefringence  is  wavelength  dependent.  By 
applying  a  single  electrical  frequency  to  such  an  de¬ 
vice,  only  for  one  optical  wavelength  conversion  can 
be  observed.  If  we  assume  a  constant  acoustic  pow¬ 
er  between  the  transducer  (transmitter  of  the  acoustic 
wave)  and  the  absorber,  the  filter  characteristic  is  like 
a  si^-function  (according  to  the  fourier  transformation 
of  a  rectangle).  The  power  of  the  acoustic  wave,  and 
therefore  the  voltage  of  the  electrical  drive  signal  deter¬ 
mines  the  degree  of  conversion.  For  100%  conversion 
voltages  of  some  lOOmV  will  be  enough  (available  fil¬ 
ters  for  the  first  telecommunication  window).  Increase 
of  the  voltage  over  the  100%  conversion  point  gives  a 
back  conversion  into  the  incoming  polarization  mode. 

By  the  use  of  TE  polarized  light  at  the  input  of  an 
ATOF  a  passband  filter  or  a  band  rejection  filter  can 
be  realized,  depending  on  the  use  of  an  TE  or  a  TM  po¬ 
larization  filter  at  the  ATOF  output.  The  linear  char¬ 
acteristic  of  the  acoustic  wave  transmitter  allows  the 
use  of  several  drive  frequencies  (more  than  100)  at  the 
same  time.  The  limit  is  given  by  the  maximum  applied 
voltage  at  the  transducer.  By  the  choice  of  suitable 
drive  frequencies  and  voltages,  nearly  every  thinkable 
filter  function  can  be  realized.  The  spectral  resolution 
is  determined  by  the  interaction  length  of  the  acoustic 
wave  on  the  ATOF  (some  cm  of  length  will  give  a  res¬ 
olution  of  some  tens  of  nm  in  the  first  telecommunica¬ 
tion  window).  This  paper  represents  the  realization  of 
spectral  encoding  of  a  20nm  wide  SLD-spectrum  with  a 
m-sequence  of  period  31,  With  variation  of  the  voltage 
of  the  several  drive  frequencies  a  spectral  flattening  of 
the  gaussian  SLD-power  spectrum  was  realized.  So  an 
ATOF  has  some  important  properties  for  the  using  as 
spectral  encoder: 

♦  the  filter  structure  can  be  determined  with  the 
choice  of  the  spectrum  of  the  electrical  drive  sig¬ 
nal, 


•  very  long  sequences  are  possible  by  the  use  of 
about  100  frequencies  at  the  same  time, 

•  only  low  drive  powers  (in  the  range  of  mW)  are 
necessary, 

•  spectral  flattening  of  the  SLD  spectrum  is  simple 
to  made  with  variation  of  the  drive  voltages  of 
the  several  lines, 

•  passband  and  rejection  filters  can  be  realized  with 
different  polarization  filters  (or  more  simple:  a 
polarization  dividing  splitter  gives  inverse  func¬ 
tion  at  two  outputs), 

•  because  of  using  of  only  one  waveguide  without 
any  free  space  optics  the  insertion  loss  of  the  el¬ 
ement  can  be  minimized, 

•  the  size  of  the  filter  is  in  the  cm  range, 

•  the  filter  function  can  be  changed  in  some  //s  only. 
Disadvantages  of  the  ATOF  are: 

•  necessity  of  permanent  drive  signal, 

•  the  temperature  dependence  of  the  birefringence. 

2.2.  Source  and  filter-parameters 

For  the  described  experiment  we  have  used  SLD  with  a 
spectral  width  of  about  20nm  and  a  center  wavelength 
of  825nm.  The  spectral  ripple  was  lower  than  5%  with 
a  measurement  resolution  of  O.lnm.  No  spectral  distor¬ 
tions  are  observable  with  modulation  up  to  100  Mbit/s. 
The  used  ATOF  have  an  interaction  length  of  18mm 
and  therefore  a  spectral  resolution  of  1/2  nm.  Total 
conversion  occurs  with  a  driving  voltage  of  400mVe//. 
The  tuning  rate  (dependence  of  optical  filter  wave¬ 
length  from  driving  frequency)  is  (2ib0.1)nm/MHz  over 
the  used  spectral  range. 

2.3.  Spectral  code  generation 

Fig.  2  shows  the  measured  spectrum  after  the  ATOF 
encoder,  applying  a  m-sequence  of  period  31.  For  the 
generation  of  the  16  different  frequencies  we  used  an 
arbitrary  function  generator.  The  frequency  shift  of 
two  direct  adjacent  frequencies  are  270kHz.  This  gives 
a  periodicity  of  the  driving  signal  of  3.704/xs,  matching 
to  the  propagation  time  of  the  acoustic  wave  on  the 
chip.  The  whole  driving  spectrum  has  a  bandwidth  of 
8.1MHz  and  was  first  generated  in  the  60  to  70  MHz 
region  an  than  shifted  in  the  required  region  of  350  to 
360  MHz  with  a  passive  mixer.  In  order  to  minimize  in¬ 
terference  phenomena  between  the  driving  frequencies, 
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Figure  2:  Measured  spectrum  after  the  ATOF  encoder 
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Figure  3:  Two  spectra  and  the  inverted  case 

there  phase  positions  were  optimized.  In  Fig.  2  two 
spectra  are  shown.  The  first  was  get  with  the  same 
driving  voltage  for  every  frequency.  The  envelope  rep¬ 
resents  the  spectrum  of  the  SLD,  For  the  second  spec¬ 
trum,  the  voltages  of  the  outer  lines  were  increased.  By 
this  we  achieve  a  nearly  flattened  spectrum  for  the  se¬ 
quence.  Fig.  3  shows  two  spectra  for  the  same  sequence 
and  the  inverted  case. 

2.4.  Correlation  properties 

For  a  complete  transmission  system  with  spectral  en¬ 
coding,  the  processes  of  encoding  and  decoding  have 
to  be  carried  out  afterwards.  Fig.  4  shows  a  spectrum 
after  an  ATOF-pair.  At  both  filters  the  same  sequence 
was  applied  (without  equalizing).  The  sequence  can 
be  identified  by  comparing  with  the  inserted  code  in 
Fig.  4.  The  correlation  properties  of  the  use  ATOF- 
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Figure  4:  Spectrum  after  an  ATOF-pair 

pair  were  investigated  by  applying  the  original  and  the 
inverted  sequence  at  the  second  filter.  Than  both  driv¬ 
ing  signals  were  frequency  shifted  against  the  matched 
position  (Fig.  5). 
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Figure  5:  Frequency  shifted  m-sequence  against  the 
matched  position.  Y- values  represents  total  optical 
power  after  the  ATOF-pair 


3.  FURTHER  INVESTIGATIONS 

The  disadvantage  of  the  described  solution  is  there  po¬ 
larization  dependence.  Fig.  6  shows  a  proposed  con¬ 
ception  with  polarization  splitter  at  the  input  and  out¬ 
put.  This  filter  works  independent  of  polarization  at 
the  input,  the  two  output  ports  give  the  converted  and 
non  converted  fraction  of  the  passed  light,  respectively. 
That’s  why  the  whole  system  has  only  the  loss  of  the 
sequence  (3dB  because  of  the  50%  share  of  “T’-chips  in 
the  sequence)  and  twice  the  additional  insertion  loss  of 


531 


Acoustical  Tunable  Optical  Filter  (Transmitter) 


6.  REFERENCES 


Figure  6:  Proposed  device  with  a  polarization  splitter 
at  the  input  and  output 

the  filters.  The  same  device  can  be  used  in  the  trans¬ 
mitter  and  the  receiver. 

An  important  advantage  of  the  ATOF  is  its  simple 
tunability.  The  desired  user  can  be  selected  in  some 
/is.  For  applications  with  fixed  channel  arrangement 
no  change  of  the  filter  function  is  necessary.  Here  the 
use  of  passive  optical  devices  is  preferred.  We  propose 
the  use  of  arrayed  waveguide  structures  or  cascaded 
fibre  bragg  gratings.  With  arrayed  waveguide  the  mul¬ 
tiplexing  of  64  channels  [10]  and  a  channel  separation 
of  lOGHz  [11]  was  achieved.  Fibre  bragg  grating  can  be 
realized  as  band  rejection  filters  with  a  very  low  band¬ 
width,  very  high  attenuation  at  the  filter  wavelength  an 
very  low  insertion  loss  far  away  from  these  wavelength 
[12]. 

4.  CONCLUSIONS 

The  advantages  and  the  feasibility  of  spectral  encod¬ 
ing  with  ATOF  has  been  shown.  The  proposed  con¬ 
cept  offers  an  interesting  alternative  to  future  densed 
WDM-Sy stems.  It  appears  that  more  users  with  higher 
signal-to-interference  ratio  are  possible  in  comparison 
to  a  WDM-System  with  ATOF  and  low-cost  broadband 
source.  Moreover  the  CDMA  concept  allows  a  higher 
system  stability. 
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ABSTRACT 

Quasiballistic  longitudinal  electron  transport  in  the  2D 
and  ID  heterostuctures  (or  electron  waveguides  (EW)) 
can  be  effectively  controlled  by  varying  transverse 
potential  distribution,  dimensions  or  shape  of  an  EW 
crossection,  since  any  such  deformation  appears  as  a 
"quasipotential"  barrier  or  a  “quasipotential"  well. 
Specific  properties  of  these  quasipotential  barriers  and 
wells  as  well  as  their  device  applications  are  analyzed. 

The  examples  include: 

1.  Control  of  electron  transport  through  a  short  section  of 
a  2D  EW  by  varying  its  transverse  dimensions  or 
transverse  potential  distribution. 

2.  Formation  of  population  inversion  of  the  electron 
energy  subbands  in  EWs  with  step-like  discontinuities. 

3.  Interference  of  different  electron  "modes"  in  non 
regular  EWs  and  application  of  this  effect  for  design  of: 
single  channel  interference  transistors  and  electron  energy 
filters. 

1.  INTRODUCTION 

It  is  well  known,  that  longitudinal  ballistic  electron 
transport  in  low  dimensional  (ID,  2D)  semiconductor 
structures  may  be  described  as  propagation  of  electron 
waves  in  EWs.  Recently  there  was  shown  effectiveness  of 
the  analogy  between  electron  and  electromagnetic  waves 
as  a  tool  for  calculation  of  electron  waves  propagation 
along  EWs  of  different  crossections.  But  as  far  as  we 
know,  all  published  works  were  devoted  to  analyses  of 
regular  EWs.  Meanwhile  one  can  suppose,  that  future 
quasiballistic  nanoelectronics  devices  will  be  based  on 
effects  arising  in  non  regular  EWs,  particularly,  in  the 
EWs  with  step-like  discontinuities.  Typical  examples  of 
these  effects  and  devices  will  be  presented  in  this  report 
Main  effects  to  be  considered  are  the  quantum 
mechanical  electron  reflection  and  tunneling  at 
quasipotential  barriers  and  wells,  electron  waves 
interference  and  transformation  in  nonregular  EWs.  The 
use  of  the  electron-electromagnetic  waves  analogy 
allowed  us  to  apply  software  created  earlier  for  dielectric 
or  metallic  waveguides.  In  the  simplest  way  this 


procedure  may  be  applied  to  the  plain  2D  EW  with 
infinitely  high  lateral  barriers,  where  electron  waves  may 
be  simulated  by  the  electromagnetic  TEo,n  (n=l,2...) 
wave  modes  in  a  plain  waveguides  with  metallic  walls. 

2.  QUASIPOTENTIAL  BARRIERS  AND  QUANTUM 
WELLS. 

The  high  speed  performance  of  electron  devices  is 
determined  by  the  effectiveness  of  methods  for  electron 
current  control.  In  common  semiconductor  devices  this 
control  is  made  by  the  help  of  potential  barriers,  formed 
by  p-n  junctions,  metal-semiconductor  junctions  and  so 
on.  Quantum  sized  structures  with  electron  gas  of  reduced 
dimensionality  —ID  and  2D  electron  waveguids  open  new 
possibilities.  We  will  show  that  electron  transport  in  the 
electron  waveguides  can  be  controlled  by  varying 
dimensions  or  form  of  its  crossection,  because  of  any 
deformation  of  the  waveguide  crossection  is  equivalent  to 
a  potential  barrier  or  a  quantum  well  appearance. 

Let  us  consider  the  ballistic  electron  motion  along  the  axis 
Z  of  rectilinear  electron  waveguide  restricted  at  its 
boundaries  by  a  potential  barrier.  At  the  absence  of  the 
longitudinal  electric  field  a  solution  of  the  time 
independent  Shrodinger  equation  can  be  written  as 

y,  z,  t)  =  \|/^  (x,  y)exp[i(©t  -  yz)]  (1) 

The  transverse  wave  function  \|4  (x,  y  satisfies  to 
Helmholtz  equation 

^x.yM4  +PVi  =0, 

2m  (2) 

^^=e-y\e=^{E-Y). 

■p  / 

Here  0  =  ,  E-the  total  electron  energy,  V  =  V(x,y)  - 

potential  electron  energy,  m-  electron  effective  mass,  271^ 
-  Plank  constant.  If  the  boundary  barriers  are  thick 
enough,  the  spectrum  of  the  transverse  wave  numbers  p 
would  consist  of  the  finite  number  of  discrete  values 
1  .(n,l=,l,2,...,N,L).  for  ID  structures  and  (n  = 

1,2,...,N)  for  2D  structures.  The  values  of  the  wave 
number  depend  on  the  boundary  barrier  height  and  on  the 
waveguide  crossection  dimensions.  Corresponding  wave 
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functions  determine  the  electron  waves  (wave  modes) 
spectrum. 

Vn.i  =  exp[-i(a)t  -  Yz)}i4„ , . .  (3), 

Each  wave  mode  describes  an  electron  state  in  an  energy 
subband  with  a  fixed  transverse  energy 

T.  02 

^In,!  “  2ni  ^ 

and  with  the  longitudinal  propagation  constant  equal,  at 
V(x,y)  =  V  =  const,  to 


Yn,l  = 


V+E, 


to  the  waveguide  axis. 


Only  lowest  wave  modes  (having  small  n,l)  propagate 
along  waveguide  without  damping.  They  correspond  to 
electrons  having  kinetic  energy  E  -  V  >  Ej^  .  These 
electrons  move  in  the  waveguide  along  zigzag  traectories 

fP  ,/  ^ 

at  angles  j  =  arctg  to  the  waveguide  axis. 

Higher,  exponentially  damping  wave  modes  are  excited  at 
waveguide  discontinuities,  they  correspond  to  electrons 
scattered  at  angles  other  than  j..  In  equation  (5)  item 

Ej^n,l  is  additive  to  the  internal  waveguide  potential  V 
and  may  be  called  "quasipotential”.  One  can  introduce 
into  the  waveguide  a  quasipotential  barrier  (QPB)  or  a 
quasipotential  well  (OPW)  by  varying  at  some  place  a 
form  or  dimensions  of  a  waveguide  crossection  and 
control  by  this  way  an  electron  transport.  As  example  we 
will  consider  a  2D  electron  waveguide  formed  by  two 
infinitely  high  potential  barriers  which  are  parallel  one  to 
another.  Let  us  suppose  that  a  waveguide  section  II 
having  width  W  and  length  1  is  inserted  in  a  regular 
waveguide  I  with  width  w.  The  inner  potential  V  is 
supposed  to  be  constant  and  equal  to  null.  Section  II 
forms  for  electrons  in  n-th  subband  a  quasi  potential 
barrier  (if  W  <  w  )  or  quasipotential  well  (if  W  >w).  For 
the  evaluation  of  an  electron  transmission  coefficient  T  of 
the  section  II  it  is  necessary  to  "sew  together",  the  wave 
functions  of  all  wave  modes  at  it's  boundaries.  This 
problem  simplifies  itself  if  the  electron  energy  E  lies  in 

the  interval  i  ^  ^  Er  ^  (Eh)  so  that  only  one  lowest 

wave  mode  propagate  through  the  input  and  output 
sections  of  the  waveguide  I.  At  this  condition,  one  has, 
neglecting  excitation  of  the  higher  modes: 

2exp(-iY.il) 

T= - 7 - -  (6) 

[  Y  iW  Y  iiW 

2cosy  iil-i  — ^ —  +  ~ —  sinY  nl 
■"  Iy,uW  Y,wj 

where. Yf ii  =  indexes  i  and  ii  relate  to 

waveguides  I  and  II  correspondingly.  Formula  (6)  will 


2cosY_iil 


sinY,iil 


coincide  with  the  well  known  expression  for  a  flat 
potential  barrier  or  potential  well  [1],  if  one  put  W  =  w, 

Pj  j  =  0,  pj ;;  =  .  We  have  investigated  the 

features  of  the  QPB  in  electron  waveguides  by  the 
numerical  simulation  using  algorithm  derived  earlier  for 
electromagnetic  wave  waveguides,  taking  into  account  all 
wave  modes  [2].  There  was  found  that  electron  scattering 
at  QPB  influences  the  electron  transmission  probability 
decreasing  the  steepness  of  its  dependence  on  the  electron 
energy  because  of  higher  modes  excitation. 

3.  POPULATION  INVERSION. 

New  effects  arise  in  an  enlarged  waveguide  section  where 
two  or  more  electron  wave  modes  can  propagate.  One  of 
the  those  effects  is  the  population  inversion  between 
discrete  energy  levels  (energy  subbands)  at  longitudinal 
electron  transport  through  nonregular  quantum  channels. 
Let  us  consider,  for  example,  electron  transport  along  a 
plane  2D  electron  waveguide,  that  width  changes  stepwise 
from  ai  to  an  >  ai  at  Z  =  Z1  .  Suppose  that  the  electron 
energy  E  lies  between  the  first  two  discrete  transverse 
energy  levels  in  the  first  waveguide  section  I  with  the 
width  ai: 

E„  s  E, :  <  E  <  El  i .where  E- ;  =  — ttt  1^  . 


E  >  Ei ;;  where  E--. 


-j  >  (7) 


then  the  higher  modes  with  j  =  2,3, ...can  be  excited  in  the 
second  section  II.  In  other  words  at  these  conditions 
electrons  distribute  themselves  between  several  energy 
subbands  that  lower  band-edges  are  determined  by  (7). 
Alignment  of  the  electron  numbers  at  different  subbands 
is  conditioned  by  the  overlapping  of  the  corresponding 

lYj'lT/  ,  r— L. 


Fig.  1  Relative  electron  populations  in  the  first  four  energy 
subbands  of  the  waveguide  section  II  in  dependence  on 
the  relative  electron  energy;  an  /  ai=  2.2. 

transverse  wave  functions  %j^ii(x)  and  V|/ii^i(x),  namely 
by  the  integral: 


The  more  is  a  value  of  this  integral,  the  more  is  a  number 

of  electrons  in  a  corresponding  subband,  proportional  to 

2 

\|/j  .  The  population  inversion  is  determined  as 

2 

In  Fig.l  are  shown,  for  example,  functions  \[f--  in 

dependence  on  the  ratio  of  electron  energy  E  to  the 
critical  energy  =  E|^  for  2D  flat  waveguide  with 
asymmetrical  rectangular  step  (see  inset  in  Fig.l).. 
There  are  seen  several  regularities. 

1.  Probability  of  the  electron  reflection  from  a  waveguide 

step  |\|/j.  I  ,  which  is  near  one  at  electron  energy  E  close  to 
the  critical  energy  E^,  decreases  sharply  with  increase  of 

energy:  |v|/j |^  <  1 0“^,  if  >  1.4 . 

2.  After  a  symmetrical  step  only  odd  subbands  are 
populated.  The  population  inversion  between  these 
subbands  is  absent  at  any  value  of  electron  energy.  At  the 
same  time  population  inversion  exists  between 
neighboring  (odd  and  even)  subbands. 

3.  The  effectiveness  of  the  electron  transfer  to  the  higher 
subbands  is  much  greater  for  an  asymmetrical  step  than 
for  symmetrical  one.  The  electron  population  at  higher 
subbands  is  the  larger  the  larger  is  a  step  height  and  an 
electron  energy. 

4.  A  ratio  of  the  electron  concentration  in  the  higher 
subbands  to  that  in  the  lowest  one  increases  with  electron 
energy.  The  population  inversion  occurs  begiiming  from  a 
certain  threshold  values  of  the  electron  energy.  These 
threshold  energies  are  the  lower  the  larger  is  a  step 
height. 

5.  If  an  asymmetrical  step  is  large  enough,  the  electron 
concentration  in  the  second  subband  goes  through  a 
maximum  at  the  electron  energy,  at  which  the  occupation 
of  the  third  subband  begins.  Degree  of  a  population 
inversion  can  be  greater  than  2  (Fig.  1). 

Population  inversion  can  also  take  place  at  electron 
transport  along  a  quantum  channel  with  constant 
crossection  if  a  profile  of  the  transverse  potential  V(x,y) 
changes  abruptly  in  a  certain  plain. 

A  possibility  of  a  population  inversion  formation  at 
longitudinal  electron  transport  opens  a  way  to  design 
lateral  lasers  as  well  as  single  channel  interference 
transistors  and  other  devices  based  on  the  interference  of 
electron  waves  traveling  along  EWs. 


4.  SINGLE  CHANNEL  INTERFERENCE 
TRANSISTORS  AND  ELECTRON  FILTERS. 

The  ballistic  interference  transistors  were  proposed  about 
10  years  ago  by  S.  Data  et.  al.  [3].  Their  idea  was  to  use 
two  separated  quantum  channels  along  which  two 
electron  waves  propagate.  This  waves  interfere  at  the 
common  output  of  the  channels.  One  can  change  an 
output  electron  current  by  the  help  of  a  transverse 
magnetic  or  electric  fields  that  change  an  interference 
pattern  (so  called  Aaronov-Bohm  effect).  But,  as  far  as  I 
know,  this  elegant  idea  didn't  realized  up  to  day  because 
mainly  of  technological  difficulties. 

Meanwhile,  as  we  have  seen,  two  or  more  independent 
electron  waves  can  be  excited  in  a  single  quantum 
chaimel.  This  wave  modes  correspond  to  electrons,  which 
move  ballistically  along  a  channel  being  in  the  different 
energy  subbands.  These  electrons  have  different 
longitudinal  velocities  and  the  corresponding  electron 
waves  have  different  phase  velocities.  Their  interference 
gives  electron  wave  pattern,  which  depends  on  the  relative 
wave  amplitudes  and  phases.  There  was  shown  above, 
that  the  relative  amplitudes  of  the  electron  waves,  excited 
at  the  abrupt  junction  of  two  different  electron  waveguide 
sections,  as  well  as  their  phase  velocities  can  be  changed 
by  the  changing  the  junction  dimensions  or  a  transverse 
electric  field  in  one  of  these  sections.  This  mechanism  can 
be  used  for  implementation  of  the  single  channel 
transistors,  electron  energy  filters  and  other  devices. 

For  better  understanding  of  the  waves  interference 
picture  we  have  analyzed  the  simple  model  which  could 
be  evaluated  exactly  by  the  help  of  algorithms  that  had 
been  found  earlier  for  electromagnetic  waveguides 
simulations  [4].  We  have  considered  a  2D  EW  with 
asymmetrical  quasipotential  quantum  well  formed  by  a 
waveguide  section  Ilof  length  L  and  width  W  larger  then 
width  w  of  the  input  and  output  waveguide  sections  (see 
in  set  in  Fig.  2).  If  the  conditions 

E„-E,<E<4E.;’%<(”  +  >^  (9) 

are  fulfilled  only  one  wave  mode  can  propagate  along  a 
regular  EW,  but  in  enlarged  section  may  propagate  n 
modes  with  different  propagation  constants  (5)There  can 
exist  different  interference  effects  in  the  section  II, 
aroused  at  repeated  reflections  and  wave  mode 
ransformations  at  the  input  and  output  width  steps.  At 
each  reflection  one  wave  mode  excites  (n  +  1)  propagating 
waves  (excepting  infinite  number  of  nonpropagating 
waves)  -  one  wave  which  propagates  in  waveguide  I,  the 
second  is  the  reflected  wave  of  the  same  wave  mode  and 
besides  -  (n-1)  reflected  waves  corresponding  to  other 
wave  modes.  Therefore  an  interference  pattern  becomes 
rather  intricate  and  can  not  be  described  analytically. 
Some  results  of  numerical  calculations  accomplished  on 
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the  bases  of  the  electromagnetic  -  electron  wave  analogy 
are  presented  on  Fig.  2,  where  transmission  coefficient  is 
given  in  dependence  on  the  relative  electron  energy  for 
two  values  of  section  II  width. 


|T|* 


Fig  2  Dependence  of  electron  transmission  coefficient  on 
relative  electron  energy:  L/w=2.2;  a)  W/w=2.0,  b) 
W/w=2.2. 

According  to  (9)  up  to  four  subbands  (wave  modes)  can 
exist  in  the  section  II  at  2  <  W  /w  <2,5,  Distinctive 
peculiarity  of  this  picture  is  the  presence  of  more  or  less 
sharp  resonances  which  number  increases  with  the 
increasing  of  the  propagating  wave  modes  number.  The 
extremal  values  of  the  transmission  coefficient  are,  as  a 
rule,  near  to  one  or  to  null.  Resonance  electron  energies 
decrease  gradually  with  the  increasing  of  the  EW  width.  It 
is  difficult  to  identify  in  general  case  all  extremums  with 
the  interference  of  the  determined  wave  modes.  However, 
two  extremal  electron  energies  which  are  shown  at  Fig.  2 
by  the  arrows  ,are  apparently  due  to  interference  of  two 
first  forward  wave  modes.  As  is  known,  two  plain  waves 
propagating  in  the  one  and  the  same  direction  with  equal 
amplitudes  and  different  propagation  constants 
zz  +  5y  conceal  at  distance  L  determined  by  the 
condition:  cosSyL  =  -1.  This  condition  gives  a  series  of 
single  extremal  energy  values  where  transmission 
coefficient  goes  to  null.  But  in  considered  case  where  two 
essentially  non  plain  electron  waves  interfere  at  the 
junction  of  two  EWs,  the  wave  fadings  could  take  place  at 
pairs  of  two  neighboring  electron  energies  located  near 
the  same  extremal  energy  values  as  it  really  takes  place  in 
Fig.2.  Fig.2  testifies  a  principal  possibility  of  the  use  of 
electron  wave  interference  in  a  single  quantum  channel 


for  the  effective  control  of  electron  transfer  through  this 
channel  by  changing  a  quasipotential  well  width.  The 
similar  result  gives  a  change  of  the  transverse  potential 
distribution  in  a  short  section  of  a  quantum  channel  with 
constant  crossection.  Considered  effects  could  be  verified 
on  the  basis  of  a  HEMT-like  heterostructures  with  lateral 
hot  electron  injection  and  nanometer  gate  length  [5].  As 
Fig.2  shows,  on  this  way  one  can  implement  transistors 
with  positive  as  well  as  with  negative  transconductance 
which  value  remains  high  enough  at  average  electron 
injection  energies  about  0.2  -  0.3  eV  dispersed  over  0.03  - 
0.05  eV.  From  the  other  side,  the  same  effects  may  be 
applied  for  design  of  electrically  controllable  band  pass 
and  rejecting  electron  energy  filters.  A  possibility  of  the 
frequency  band  tuning  is  important  advantage  of  these 
filters  in  comparison  with  superlattice  ones. 

5.  CONCLUSION 

It  is  shown  that  there  are  at  least  two  different  way  for 
effective  control  of  the  ballistic  electron  transport  in 
quantum  sized  heterostructures:  a  creation  of  the 
quasipotential  barriers  by  a  local  deformation  of  the 
quantum  channel  (electron  waveguide)  crossection,  as 
well  as  by  its  inner  transverse  electric  field  change,  and 
the  use  of  interference  between  several  electron  wave 
modes  excited  in  a  quantum  channel  at  step-like 
discontinuities. 

It  is  possible  to  make  electron  population  inversion 
between  electron  energy  subbands  at  the  longitudinal 
electron  transport  along  essentially  non  regular  quantum 
channels. 

It  is  possible  to  design  single  chaimel  interference 
transistors  and  electrically  tunable  electron  energy  filters. 
These  and  similar  devices  should  be  suitable  for  the 
theraherz  frequency  band  mastering. 
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ABSTRACT 

A  detailed  analysis  of  monolithic  surface 
acoustic  wave  (SAW)  Schottky  diode 
memory  correlator  is  presented.  Based  on 
Green's  function  concept,  an  efficient 
formalism  for  the  calculation  of  the  spatial 
charge  density  distribution  of  the  diode  is 
obtained.  An  equivalent  circuit  model  has 
been  proposed  for  determining  the  efifective 
potential  induced  in  the  diode  by  the  electric 
field  associated  with  tlie  SAW.  The  attenu¬ 
ation  as  well  as  the  device  efficiency  arc  also 
calculated.The  calculated  results  are  verified 
by  comparison  with  measured  results. 

1.  INTRODUCTION 

The  concept  of  an  acoustoelectric  memory 
was  first  implemented  with  gap  coupled  device 
utilizing  storage  in  surface  states  at  the  Si02/Si 
interface  [1].  The  surface  states  are  difficult  to 

control  and  they  were  soon  replaced  by  Schottky 
barrier  diode  [2]  and  p-n  diode  [3-4].  Schottky 

diodes  are  majority  carrier  devices  and  exhib- 
itvery  fast  write  time,  suitable  for  wideband 
applications.  The  Green's  function  [5]  technique 
has  been  used  for  studying  the  Schottky  diode 
gap-coupled  as  well  as  strip  coupled  devices  [6]. 


In  this  paper  a  detailed  analysis  of  monolithic 

surface  acoustic  wave  (SAW)  Schottky  diode 
memory  correlator  is  presented.  The  Schottky 
diode  charges  are  estimated  through  the  solution 
ofPossion's  equation  by  using  Green's  function. 
The  depletion  layer  capacitance  is  determined 
by  usiiig  the  estimated  charges.  An  equivalent 
circuit  model  has  been  used  for  determining  the 
effective  potential  induced  in  the  diode  from 

the  SAW  electric  field.  The  device  efficiency 
as  well  as  the  output  power  delivered  to  50 
ohm  load  are  calculated  by  solving  the  equiv¬ 
alent  circuit.  Good  agreement  between  the 
obtained  results  and  published  data  is  achieved. 

2.  THEORY 

The  boundary  conditions  for  the  given  configu¬ 
ration  of  Fig.  1  are  assumed  to  be; 

Magnetic  wall:  5  d>/6x  =  0  for  x  =  +  P/2 
Elech'ic  wall:  0  =  0  at  the  depletion  layer 

boundary  .  The  Green's  function  which  satisfy 
the  previous  boundary  condition  has  been  found 
by  using  Fourier  series  as: 

G(x/xo,y/yo)  =  -(1/2K)  Logfl-2  cosh  B(x-xo). 

Exp(-B(y-yo))+Exp(-2B(y-yo))]  +(j/2K) 
Log[l-2  sinh  B(x-xo).  Exp(-B(y-yo)) 
+Exp(-2B(y-yo))]  .  (1) 
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and  the  surface  impedance  Zs  can  be  estimated  as 

Z.=jKi  V/SL. . (5) 

on 

The  proposed  equivalent  circuit  is  shown  in 
Fig.2  ;where  Rj  is  load  resistance,  R^.  is  the 
sheet  resistance  of  the  metal-contact  and  Zg  is  the 
surface  impedance  as  calculated  in  eq.  5 
The  Vo.c  of  fig.  2  is  given  by: 

Fig.l  Periodic  structure. 

Vo.c.=VVr..Vs . (6) 

The  Green’s  function  has  been  used  for  solving  Where  Vj-^and  are  given  by  [7]. 

the  Posson's  equation  in  the  following  plain  _ _ 

D=[-p/2,p/2],[0,  y(x)]  Vr.=.V  2.(Pr./w).k%.eo.(l+ep) 

and  the  following  espression  is  obtained 


Vs.=.V  2.(Ps./w).k%.eo.(l+ep) 

w  is  the  beamwidth  of  the  acoustic  signal 

k  isthe  wavenumber 

Pj.  is  the  power  of  reading  signal 

Pg  is  the  power  of  writting  signal 

The  output  power  dilvered  to  the  load  resistance 
is  calculated  by  solving  the  circuit  of  fig.  2. 

The  efficiency  as  well  as  the  attenuation  due  to 
the  presence  of  Zs  have  been  also  calculated. 


By  solving  the  previous  equation  the  quasi 
static  charge  has  been  determined.  Also,  eq.  2 

can  be  write  at  the  boundary  of  the  depletion 
layer  in  the  foim: 

SiX.  =1 

5n  £ 


I  p(xo, 
jD’' 


5  G(x/xo,y/yo) 
5  n 


dxo  dyo+-^. . (4) 

5n 


Fig.2  Equivalent  circuit. 


538 


3.  RESULTS  AND  DISCUSSION 

The  relation  between  real  and  imaginary  part  of 
Zs  is  illustrated  in  fig.  3.  The  resistive  part  is 

increased  to  reach  the  maximum  value  at  5UT 
for  doping  concentration  of  1.  E17  cm*^  and  at  10 
UT  for  doping  concentraion  of  2.5  E17  cm'^. 


The  variation  of  attenuation  with  peak  value 
of  storage  impulse  is  illustrated  in  fig.  4.  The 
attenuation  can  be  minimized  by  varying  the  diode 
biase  potential  with  doping  density.  It  is  greatly 
effected  by  varying  the  peak  of  the  storage  impu¬ 
lse  in  range  less  than  15UT  in  case  of  ND=1.E17 
cm*3  and  in  range  less  than  18UT  in  case  of 
ND=  2.5E17  cm'3-  Over  the  previous  range  the 
storage  impulse  has  littel  effect  on  the  attenuation. 


Fig.  5  indicate  the  relation  between  the  effici¬ 
ency  and  peak  value  of  the  storage  impulse. 

This  relation  is  plotted  at  doping  density  of  ND= 
1.6E17  cm-3  as  proposed  by  Wager  [8].  It  is  can 

be  noticed  the  value  of  the  correlation  response 
is  greatly  effected  by  the  lower  values  of  storage 

impulse  while  higher  values  has  littel  effects. 
The  figure  includes  also  the  experimental  data  of 
Weger  [8].  Our  results  gives  good  agremment  at 
lower  values  of  storage  impulse  while  at  values 
greater  than  lOOUT  there  is  divargence  between  our 
results  and  experimentals. 


V/UT 

Fig.  5  Retition  between  the  cocreliUon  reiponse  and  diode  lurface  potential 


4.  CONCLUSION 

A  new  technique  for  calculating  the  output 
power  of  the  monolithic  SAW  Schottky  diode 
memory  correlator  has  been  presented,  in  this 
technique,  an  equivalent  circuit  model  is  proposed . 
The  surface  impedance  has  been  calculated  by 
using  Green's  function  technique  through  new 
formula.  It  is  found  that  the  output  power  as  well 
as  the  attenuation  are  greatly  effected  by  small 
values  of  storage  impulse  while  higher  values 
has  littel  effect.  The  predication  of  this  technique 
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given  good  results  in  compai'ison  with  published 

data  specially  at  lower  values  of  storage  impulse. 
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ABSTRACT 

In  this  paper,  a  new  integratable  oscillator  is 
presented  through  the  use  of  a  new  current-tunable  R- 
L  filter.  The  oscillation  fi-equency  is  current-tunable 
in  excess  of  three  orders  of  magnitude.  Existing 
methods  for  on-chip  inductors  (L)  are  encouraged  in 
implementing  for  this  oscillator  so  as  to  enable  very 
high  oscillation  fi-equencies.  The  paper  suggests  an 
alternative  to  integratable  oscillalors  of  "inductorless" 
types. 

1.  INTRODUCTION 

High-fi-equency  discrete-component  oscillators  use 
inductors  (L)  routinely.  As  inductors  were  not 
previously  possible  to  &bricate  in  monolithic  form 
[1],  integratable  oscillators  were  commonly  of  " 
inductorless"  types  [2,3].  Recently,  methods  for 
implementing  inductors  on-chip  have  been  reported 
[4-6]  and  integratable  high-fi'equency  oscillators 
using  inductors  on-chip  are  increasingly  of  interest 
[1,5,7].  In  this  paper,  a  new  integratable  oscillator  is 
presented  through  the  use  of  a  new  current-tunable  R- 
L  filter  (where  R  represents  resistance).  The  existing 
methods  [4-6]  can  also  be  suggested  for 
implementing  the  on-chip  inductors  (L)  of  this 
oscillator. 


The  resonant  fi-equency  of  the  filter  is  tunable  using  a 
current-tunable  resistance  R  =  2re,  where  re  is  die 
small-signal  dynamic  resistance  of  a  forward-biased 
base-emitter  junction  of  a  bipolar  transistor.  Such  a 
filter  forms  a  counterpart  of  the  current-tunable  RC 
filter  previously  described  in  Pookaiyaudom  et  al  [2]. 
T  le  pair  of  QIO  and  Qll  forms  a  differential 
amplifier  that  provides  an  appropriate  gain  for 
cscillation  to  be  initiated  and  sustained.  The  matched 
npn  transistors  Q5  to  Q9  form  the  Wilson  current 
mirrors  and  convert  the  frequency  setting  current  If  to 
bias  transistors  Ql,  Q2,  Q3  and  Q4.  The  loop-gain 
setting  current  Ig  biases  transistors  QIO  and  Qll. 
The  output  waveforms  of  the  oscillator  are  taken 
across  the  collectors  of  QIO  and  Qll,  and  can  be 
amplified  to  appropriate  levels  for  applications. 

3.  IDEAL  ANALYSIS 

Referring  to  Figure  1,  assuming  that  the  bases  of  Q3 
and  Q4  are  temporarily  disconnected  with  the 
collectors  of  QIO  and  Ql  1.  Let  the  small-signal  input 
voltage  across  the  bases  of  Q3  and  Q4  be  v^,  and  the 
resulting  small-signal  output  voltage  across  the 
collectors  of  QIO  and  Qll  be  Vo.  One  can  show  that 
the  transfer  fimction  Vo/Vin  can  be  expressed  as 


2.  CIRCUIT  DESCRIPTIONS 

Figure  1  shows  the  basic  circuit  configuration  of  the 
integratable  ciurent-tunable  R-L  oscillator.  The  four 
matched  npn  transistors  Ql  to  Q4  and  the  two 
matched  inductors  LI  and  L2  of  value  L  form  a  new 
R-L  bandpass  filter,  where  the  input  voltage  is 
apphed  to  the  bases  of  Q3  and  Q4,  and  the  output 
voltage  is  taken  across  the  emitters  of  Ql  and  Q2. 


where  r  =  — =— 7  ^  >  =—  f— 1  (2) 

R  2(r.w)  2''''-'^ 

G  is  equal  to  R3/reio,ii ,  reio.ii  is  r*  of  QIO  or  Qll, 
re3.4  is  re  of  Q3  or  Q4,  R3  is  the  load  resistance  of  the 
differential  amplifier,  and  Vt  is  the  usual  thermal 
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voltage  accociated  with  a  pn  junction  and  is 
approximately  equal  to  25  mV  at  room  temperature. 


By  reconnecting  the  feedback  loop,  the  steady-state 
sinusoidal  oscillations  can  be  sustained  if  the  ratio 
Vo/vin  shown  in  (1)  becomes  unity.  Therefore,  one 
obtains 


R3 


fi-1 

UvJ 


=  2 


(3) 


1 


= 


r 


ifV 

LUf  J 


(4) 


where  (3)  gives  the  required  value  of  G  to  sustain 
steady-state  sinusoidal  oscillation,  and  (4)  provides 
the  angular  frequency  of  oscillation. 

4.  SIMULATEli  RESULTS 


of  the  output  waveforms  versus  If  at  Ig  =  4.4  mA  and 
R3  =  25CI.  It  can  be  seen  from  Figure  2  that  the 
amplitude  of  the  oscillation  is  well  maintained  at 
constant  values  for  If  >100  |xA.  Figure  3  depicts 
comparisons,  between  ideal  and  simulation  cases,  of 
plots  of  oscillation  frequency  versus  the  bias  current 
If.  It  can  be  seen  that  oscillation  frequencies  obtained 
from  both  cases  are  almost  similar  and  linearly 
tunable,  through  If,  in  excess  of  three  orders  of 
magnitude. 

The  spectrum  of  the  output  waveform  at  60  KHz,  for 
example,  is  shown  in  Figure  4.  It  can  be  seen  that  the 
second  and  third  harmonics  are  approximately  at  48 
and  38  dB  down,  respectively,  from  the  fundamental 
frequency.  Figure  5  depicts  comparisons,  between 
iui'al  and  simulation  cases,  of  plots  of  oscillation 
f  -quency  versus  the  inductance  value  L  at  If  =  1  mA. 

5.  DISCUSSION  AND  CONCLUSIONS 


The  performance  of  the  circuit  shown  in  Figure  1  has 
been  simulated  using  a  circuit  simulator  package 
ViewSpice  [8],  running  on  a  32-bit  personal 
computer.  The  npn  and  pnp  transistors  are  modelled 
by  QMPS2222A  and  QMPS3640,  whose  transition 
frequencies  fr  are  at  300  and  500  MHz,  respectively 
[8].  For  the  simulation  purposes,  the  value  L  is  equal 
to  159  uH.  Figure  2  illustrates  the  simulated  results 


A  new  current-tunable  R-L  Oscillator  has  been 
described.  Oscillation  frequencies  are  linearly  tunable 
in  excess  of  three  orders  of  magnitude.  By  using 
better  transistors  with  very  high  fj  (e.g.  in  the  region 
of  GHz)  and  much  smaller  value  of  L  (e.g.  in  the 
region  of  nH  using  integratable  bondwire  inductance 
[5]),  high  oscillation  frequencies  (e.g.  in  the  region  of 
GHz  using  eq.  4)  can  be  expected. 


Figure  1:  Schematic  diagram  of  the  integratable  current-tunable  R-L  oscillator. 
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Frequency  (HZ) 


Figure  2:  Oscillograms  at  various  frequencies  versus  bias  currents  If 


Current  If  (mA) 

Theoretical  -a-  Simulation 


Figure  3:  Plots  of  oscillation  frequencies  versus  bias  currents  If 
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Figure  5  :  Plots  of  oscillation  frequencies  versus  values  of  inductance 
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ABSTRACT 

Two  kinds  of  power  splitter 
have  been  designed.  The  designed  circuits 
have  been  simulated  by  using  an  efficient 
computer  aided  analysis  program.  The 
program  is  based  on  state  space  approach 
with  no  restriction  on  the  network 
topology.  The  two  circuits  have  been 
fabricated  by  using  microstrip  technology. 
The  practical  measurements  and  the 

simulated  results  are  with  good  agreements 

1-Introduction 

Power  dividers  are  used  to  divide  the 
input  power  into  a  number  of  smaller 
amounts  of  powers  f  or  exciting  the 
radiating  elements  in  any  array  antenna. 
They  are  also  used  in  multi-channel 
receivers  to  divide  the  input  power  into 
an  equal  values  of  powers  f  or  each 
channel.  The  power  dividers  can  be  used  in 
balanced  power  amplifiers  both  as  power 
dividers  and  power  combiners[l]. 

A  Wilkinson  coupler  offers  broadband 
width  and  equal  phase  characteristic  at 
each  of  its  output  ports.  Fig.  (1)  shows 
the  schematic  diagram  of  Wilkinson 
coupler[2].  Wilkinson  developed  an  N-way 
power  divider  that  would  split  the  input 
power  into  output  powers  at  N-ports  that 
would  also  provide  isolation  between  the 
output  ports(2]. 

The  objective  of  this  paper  is  to 
introduce  the  analytical  design  of  power 
splitters,  compensated  and  uncompensated 
power  splitters.  A  computer-aided  program 
based  on  state  space  approach  has  been 
developed  and  applied  f or  analysis  of  the 
designed  circuits  in  both  time  and 
f  requency  domains.  The  two  circuits  are 
practically  fabricated  by  using  microstrip 
technology.  The  measured  and  simulated 
results  are  compared. 


Zc 


Fig.  2.  A  compensated  3-dB  power  divider 

2-  Analytical  Design 

If  we  want  to  split  the  input 
power  PI  into  output  power  P2  and  P3  so 


that  P3  =  K'^PZand  also  maintain 
current  in  R,  Wilkinson!  1]  express 

zero 

„  Z2  Z3 
^  "  Zc  ’ 

(1) 

Z2  = 

■  K  -1  R  Zc  , 

(2) 

ZL2  = 

k"  R 

(3) 

2  ’ 

+1 

Z3  = 

A 

R  Zc 

(4) 

1  K 

and 

ZL3 

R 

■  .  1 

(5) 
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Uncompensated  3-dB  power  splitters 
A  3-dB  power  divider  is  obtained  by 

choosing  =1  and  Zl2  =  Zl3  =  Zc=  50  Q. 
From  equation  (3)  and  (5),  R  =  2  Zc  and 
from  equation  (2)  and  (4),  we  get'  Zz  =  Z3= 
^2^  Zc 


Compensated  power  splitters. 

A  modified  design  that  incorporates 
an  input  quarter- wave  transformer  is  shown 
in  Fig(2).  For  good  performance,  we  must 
choose[3]: 


Z 

C 


^  _  (1  ^ 

3  j^5/4  c 


(6) 

(7) 

(8) 


R  =  Z  (9) 

K  c 

The  performance  of  a  Wilkinson  power 
splitter  is  improved  by  the  modified 
design. 

3-  Simulation  Program 

A  general  computer  program  for  the 
analysis  of  microwave  circuits  has  been 
applied.  The  program  is  based  upon  the 
f  ormulation  of  the  state  and  output 
equations  using  ein  efficient  topological 
method  with  no  restriction  on  the 
topology.  The  state  and  output  equations 
for  any  network  can  be  represented  as  the 
f  ollowing. 


(3a)  The  state  and  output  equations 

It  has  been  shown  [12]  that  the  state 
variables  of  a  distributed  network  are 
the  reflected  parameters  at  all  the 
transmission  line  terminals  or  ports.  The 
state  variables  of  a  lumped/distributed 
network  are  the  voltages  on  all  the 
independent  capacitors,  the  currents  in 
all  the  independent  inductors  and  the 
reflected  parameters  (or  the  reflected 
voltages)  at  all  the  transmission-  line 
ports. 

The  state  equation  is  the  differential 
-  difference  equation  of  the  form: 


X  (t) 

1 

A 

_  u 

A 

_12 

Xj(t) 

+ 

B 

_i 

X  (t+T) 
2 

A  , 
2l 

A 

22 

X  (t) 

2 

B 

2 

u(t)  (10) 


Where  ^  (t)  and  (t)  are  the  state 

vectors  of  the  lumped  and  distributed 
elements  respectively  and  u(t)  is  the 
input  vector.  The  output  equation  is  given 
by 


y(t)= 


Cl 


C2 


■x^(t) 


+  E  u(t) 


(11) 


L  i  JLx2(t)J 

Equations  (10)  and  (11)  are  derived 
using  topological  methods  with  no 
restriction  on  the  network  topology.  Where 
A,B,C  and  E  are  the  matrices  of  the 
state  and  output  equation  of  the  systems. 

When  Laplace  transform  is  applied  to 
equations  (10)  and  (11)  we  obtain  the 
frequency-domain  equations. 


-1 


■X^(s)‘ 

"^12 

^21 

^’'2n"  ^22- 

2-1 


U(s)(12) 


Y(s,z)=- 


-1 


'si  -  A, , 
m  11 

1 - 

(M 

<r 

-  ■'^21 

^^2n  ■  '^22-1 

■^1 

+E  ■ 

.®2- 

(13) 


where  z=  exp  sT  ,I  is  the  i(ientity 
matrix,  m,n  are  the  number  of  lumped-state 
elements  and  transmission  lines, 
respectively. 

(3b)  Network  Analysis  Using  the  State 
Equations 

The  previous  results  are  used  to 
analyze  lumped/distributed  networks  in 
either  the  frequency  or  time  domain. 

(DFrequency  domain  analysis  :is  Based  on 
the  expansion  of  the  equation  (13)  to 
obtain  the  transf  er  f  unction  of  the 
f  orm. 
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3.5 


2n  m 

I 

1=0  j=0 


F(s,z)=" 


B..SJZ‘ 


1=0  j=0 


(14) 


This  expansion  is  achieved  by  using  a 
modified  faddeeva  algorithm[10]  for  the 
inversion  of  the  two  variable  matrix 
in  equation  (3). 

{2)Time  domain  analysis:  The  output  in 
the  time  domain  is  calculated  from 
equation  (11)  _  when  the  state  vector 


is  known.  The  state  vector 


is  first  (determined  from  the  solution  of 
(iO).  Equation  (10)  represents  two 
simultaneous  differential  and  difference 
equation  : 


X^(t)=  A^^X^(t)+A^^X2(t)+B^u(t)  (15) 

and 

X2(t)=A2lXi(t-T)+A22X2(t-T)+B2u(t-T)  (16) 

T 

The  state  vector  [X  (t)  i  X^  (T)]  is 
assumed  to  be  zero  at  t^O.The  vector  X2(t) 
is  calculated  by  substituting  the  values 
of  5^  (t-T),  ^  (t-T)  and  u(t-T)  in 

equation(6).  The  vector  X^  (t)  is  then 
calculated  by  solving  the  differential 
equation!  15)  numerically  in  each  time 
period  kt  <  t  <  (k+l)T,  where  k  = 

0,1,2 . 

The  state  and  output  vectors  have 
discontinues  at  t=KTand  the  initial  values 
of  the  state  vector  at  the  start  of 

each  time  period  are  obtained  from 
the  condition  that  the  integral  of  the 
state  vector  is  continuous.  The  insertion 
loss  for  the  compensated  power  splitter 
predicted  by  simulation  program  are  given 
in  Fig.(3). 


Fig. 3. Insertion  Loss  Versus  Frequency(Ghz) 
of  a  Compensated  type 
4-  Practical  Design 


The  practical  design  of  two  kinds  of 
power  splitters  is  performed  according  to 
the  following  considerations 

1- Choice  of  materials: 

Teflon  of  er  =  10.2  and  h  =  0.635  mm 
is  chosen  for  our  design.  The 
substrate  is  copper  clad  which  is 
widely  used  in  microwave  IC’S.  The 
dimension  of  the  substrate  is 
50.8  mm  x  50.8  mm  . 

2- Microstrip  synthesis  is  carried 

out  to  calculate  the  width  and 
length  of  the  designed  circuits 
according  to  the  design  formula 
obtained  in  [5].  The  layout  of  the 
circuits  are  prepared  keep  in  mind 
the  size  of  the  chip  resister  and  the 
mic  r  ostrip  discontinuity.  The  layout 
of  1.33  Ghz,  3-dB  power  divider  is 
shown  in  Fig.  (4) 

3- Fabrication  Procedures 

The  practical  design  is  carried  out 
using  thin  film  technique.  The  chip 
resister  R  =100  Q  is  connected  to 
each  circuit. 

4- Practical  measurements 

The  practical  measurements  are 
perf ormed  on  the  designed  power 
splitters  after  fixing  it  by  a 
substrate  holder.  Input  signal 
ranging  from  IGhz  to  1.5  Ghz  is 
applied  to  the  input  port  of  the 
power  divider.  The  output  is 
measured  at  the  two  outputs  by 
using  spectrum  analyzer.  The  IL 
versus  frequency  characteristic  is 
shown  in  Fig.  5.  for  the  compensated 
type. 
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Fig.4.  The  layout  of  (a)  Compensated  type 
'  (b)  Uncompensated  type 


:n««r«.ten  !«,  ^ 


.  taM'rtian  :ea>  si  v«rt  2  »* . 


Fig. 5. The  insertion  loss  versus  frequency 
(Ghz)  of  compensated  type 

5-Conclusion 

The  design  procedure  of  microstrip 
directional  couplers  are  presented.  The 
computer  aided  analysis  adopted  in  this 
paper  is  with  good  stability  and  suitable 
computation  time.  The  results  show  good 
agreements  between  predicted  simulated 
values  was  due  to  imperfection  associated 
with  the  connectors  used  and  the 
fabrication  tolerances. 
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ABSTRACT 

In  the  paper  the  problems  connected  with 
the  intensive  noise  oscillation  excitation  in  mil¬ 
limeter  wave  band  are  studied.  With  this  aim 
in  mind  the  0-type  Backward  Wave  Oscillators, 
operating  in  chaotic  oscillation  regime,  are  pro¬ 
posed  to  nse.  The  large  advantages  of  the  method 
suggested  are  the  high  efficiency,  universal  power 
spectrums  and  feasibility  of  electronic  tuning  of 
the  noise  oscillation  frequency.  Low  voltage  pack¬ 
aged  noise  sources,  operating  over  the  band  from 
30  to  SOGHzy  make  possible  output  power  of 
8  —  25W  and  4W  respectively  and  3  —  dB  power 
spectrum  width  more  than  300MHz. 

1.  INTRODUCTION 

Noise  oscillations  play  an  important  role  in 
modem  microwave  and  millimeter  environment. 
Noise  oscillations,  first  and  foremost,  are  the  op¬ 
timal  kind  of  signals  for  radars,  having  bigb  ac¬ 
curacy  of  measurement  both  distance  and  ve¬ 
locity  simultaneously.  Noise  oscillation  sources 
with  output  power  of  about  some  watts  are  used 
in  jamming  systems  to  confuse  radars  or  com¬ 
munications,  active  radiometry  and  in  measure¬ 
ments  as  a  source  of  imitate  signals.  This  raises 
the  problem  of  creation  of  the  compact  and  ef¬ 
fective  noise  oscillation  sources  with  power  spec¬ 
trum  width  and  output  power  needed.  The  prob¬ 
lem  is  especially  urgent  for  millimeter  wave  band. 

The  various  modulators  are  generally  used 
for  excitation  of  the  oscUlations  with  complicated 
power  spectrum.  The  modulators  act  on  the 
single-frequency  sources,  by  random  manner,  or 


on  the  passive  super  high  frequency  circuit  el¬ 
ements  (such  as  a  pin-structures  or  its  combi¬ 
nation  with  circulator).  The  noise  oscillation 
sources  are  known  too,  based  on  the  Travelling 
Wave  Tube  with  external  delay  feed  back,  or  on 
the  series  network  of  solid  noise  source  and  Trav¬ 
elling  Wave  Tube  Amplifier  .  The  methods  of 
the  noise  oscillations  excitations,  listed  here,  are 
of  complex  design  and  operating.  An  additional 
point  to  emphasize  is  that  the  wideband  modu¬ 
lator  realizing,  for  middle-power  sources,  is  indi¬ 
vidual  complex  problem  and  the  Travelling  Wave 
Tube  application  encounters  a  difficulties  in  mid¬ 
dle  and  higher  frequency  part  of  millimeter  wave 
band. 

In  the  papers  [1  —  3]  the  automodulation  pro¬ 
cesses  in  weak-resonant  0-type  Backward  Wave 
Oscillators  have  been  investigated  and  the  new 
method  of  the  millimeter  noise  oscillation  exci¬ 
tation  have  been  proposed.  The  method  sug¬ 
gested  uses  dynamical  properties  of  the  such  dis¬ 
tributed  autooscillation  system  as  BWO  and  al¬ 
lows  to  excite  with  high  efficiency  intensive  noise 
oscillations  with  power  spectrum  width  of  about 
i00  —  %00M Hz.  The  approach  makes  possible  to 
extend  spectrum  of  roles  of  the  well-known  oscil¬ 
lator.  BW O  has  the  universal  power  spectrums, 
depending  on  the  working  current  values. 

In  the  paper  the  peculiarities  of  the  design 
and  operation  of  millimeter  packaged  noise  os¬ 
cillation  sources  are  considered.  It  is  studied 
the  properties  of  the  low- volt  age  sources  {U  < 
4,5^11^)  with  two  type  of  the  slow- wave  struc¬ 
tures.  The  characteristics  of  the  noise  oscilla¬ 
tion  sources,  operating  in  over  the  band  from 
30  to  SOGHzy  are  presented. 


0-7803-2516-8/95  $4.00  ©  1995  IEEE 


549 


2.  SOURCE  DESIGN 

Two  types  of  the  slow-wave  structures,  such 
as  vaue-type  circuit  and  double  vane-  type  cir¬ 
cuit,  are  used,  in  accordance  with  characteris¬ 
tics  needed.  The  structures  pointed  have  rela¬ 
tively  large  geometrical  dimensions,  sufficiently 
easy  in  fabrication  and  allows  to  use  high  power 
electron  beams.  Double  vane-type  circuit  have  a 
higher  interaction  efficiency  of  the  non-uniform 
electric  field  with  electron  beams  of  finite  thick¬ 
ness  and  its  dispersion  is  lower  that  allows  to  de¬ 
vise  more  powerful  and  wide- band  devices.  Slow- 
wave  structures  of  the  noise  oscillation  sources 
have  a  greater  electrical  length  of  the  interac¬ 
tion  space  in  comparison  with  ones  of  the  single- 
frequency  sources.  The  vanes  height,  on  the  both 
terminals  of  the  structure,  is  progressively  de¬ 
creased  in  size  to  zero,  providing  the  surface  waves 
transformation  to  waveguide  ones.  The  needed 
mismatch  is  thereby  introduced  in  order  to  form 
the  weak-resonant  oscillatory  system  with  de¬ 
sired  quality-factor.  The  diode  electron  gun  with 
magnetic  limitation  on  the  electron  beam  is  used 
as  a  source  of  electrons.  Electron  gun  forms 
the  strip  beam  with  maximum  current  density 
of  about  iQAfcw? .  Electron  beam  is  introduced 
in  interaction  space  through  the  anode  opening, 
made  in  waveguide  matching  wedge.  Electron 
beam  is  focused  by  CoSa  system  with  piano- 
parallel  gap.  Inasmuch  as  the  length  of  magnetic 
gap  is  too  large,  the  weight  of  the  system  reaches 
to  lO^G. 

The  peculiarity  of  the  design  under  consid¬ 
eration  is  the  presence  of  the  additional  output 
waveguide  at  the  collector  end  of  noise  oscillation 
sources.  This  fact  allows  to  extend  the  branches 
of  source  application  and  to  decrease  the  axial 
dimensions  of  the  slow- wave  structure.  Output 
waveguides  are  on  a  perpendicular  line  to  device 
axis  and  are  matched  with  oscillatory  system  by 
using  of  the  waveguides  E  -  6eni.  Transition  to 
output  waveguides  of  standard  size  is  performed 
by  matching  wedges.  Collector  waveguide,  dur¬ 
ing  experiments,  is  usually  connected  to  matched 
load.  The  oscillations  are  radiated  from  the  gun 
output  waveguide,  that  is  principal  one  for  con¬ 
ventional  BWO, 


The  frequency  dependencies  V  SW  R  of  out¬ 
put  waveguides  of  millimeter  uoise  signal  source 
are  the  result  of  interfereuce  of  the  waves,  re- 
fiected  from  the  slow-wave  structure  terminals. 
Relatively  great  values  of  VSWR  are  compeu- 
sated  with  distributed  losses  in  the  system.  The 
losses  achieve  to  -lOdJB  in  higher  frequency  re¬ 
gion.  Electrodynamic  circuit  with  characteristics 
listed  cau  be  considered  as  distributed  oscilla¬ 
tory  system  with  spaced  spectrum  of  the  eigen 
frequencies  (minimal  values  of  V  SW  R  according 
to  eigen  ftequencies). 

3,  OPERATION  PECULIARITIES  AND 
CHARACTERISTICS  OP  THE  NOISE 
OSCILLATION  SOURCES 

Sources  under  consideration  have  the  impor¬ 
tant  special  feature.  The  power  spectrum  shape 
of  the  source  oscillations  is  determined  by  the 
values  of  working  current,  This  can  best  be 
done  ID  the  lovrer  frequency  part  of  millimeter 
band,  where  the  needed  match  quality  and  strin¬ 
gent  magnetic  focusing  of  electron  beam  is  more 
easy  provided.  Thus,  there  is  good  reason  to 
consider  operation  peculiarities  of  source  of  Ka 
band.  It  will  be  made  in  order  of  the  working 
current  increase, 

3.1.  Single-frequency  oscillations 

The  single-ftequency  operating  conditions  is 
well  known  regime  of  BWO.  The  maximum  val¬ 
ues  of  threshold  current  lih  of  source,  in  oper¬ 
ating  wave  band,  reaches  to  Ifh  mam  =  4, 5m4. 
At  the  working  current  values  >  Ith  mam  noise 
oscdlatioB  source  operates  as  a  source  of  single¬ 
frequency  oscillations  with  wideband  electronic 
tuning  of  the  frequency.  Output  power  is  lin¬ 
ear  built  up  with  the  enhancement  of  working 
current.  The  accelerating  voltage  dependencies 
of  the  output  power  and  oscillation  frequency 
of  sources  with  vane-type  and  double  vane-type 
slow-  wave  structures  have  essentially  different 
aspects.  Noise  oscillation  sources  based  on  vane- 
type  circuit  provides  output  power  to  2^0mW 
and  relative  electronic  tuning  range  of  about  10%. 
The  application  of  the  another  type  of  circuit 
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allows  to  raise  tbe  output  power  up  to  P  = 
600m at  the  lesser  working  current,  and  to  ex¬ 
tend  the  electronic  tuning  range  up  to  15  —  20%. 
Power  spectrum  of  the  single-frequency  oscilla¬ 
tions  radiated  is  similar  to  conventional  BWO 
one.  In  the  low  voltage  part  of  the  sources  work¬ 
ing  range  the  oscillations  of  the  first  upper  pass- 
band  of  slow-wave  structure  on  the  bulk  waves 
can  be  observed.  This  effect  leads  to  the  pro¬ 
duction  of  the  discontinuity  of  electronic  tuning 
even  with  good  matching. 

3.2.  Miiltifrequency  oscillations 

At  the  values  of  working  current  /«,  =  22mj4 
the  amplitude  of  oscillations  is  taking  a  periodic 
modulation  and  in  the  power  spectrum  the  ad¬ 
ditional  components  are  appeared.  The  modula¬ 
tion  is  observed  in  over  the  operating  frequency 
band  of  noise  oscillation  sources.  The  modula¬ 
tion  frequency  or  the  separation  between  neigh¬ 
bor  spectral  components  can  be  varied  in  the 
range  from  IdOMHz  to  300MHz  with  the  ac¬ 
celerating  voltage  U.  With  increasing  of  work¬ 
ing  current  the  power  spectrum  automodulation 
becomes  more  complicate  and  at  /«,  =  33mA 
presents  a  set  of  equidistant  spectral  components. 
The  power  of  the  mnltifrequency  oscillations  is 
larger  than  the  single-frequency  one.  At  the  = 
45mA  the  output  power  is  P  =  2,8W.  The  ac¬ 
celerating  voltage  dependence  of  P,  according  to 
mnltifrequency  oscillations,  has  essential  nonnni- 
form  character.  As  a  rule,  the  more  complicate 
power  spectrum  oscillations  have  a  larger  power. 
The  needed  shape  of  the  power  spectrum  can  be 
received  by  choosing  a  corresponding  values  of  U 
and  /«,. 

3.3.  Noise  oscillations 

Excitation  of  the  noise  oscillations  is  observed 
at  /w  >  50mA.  In  this  case  the  power  spec¬ 
trum  is  strongly  nonuniform  and  is  placed  near 
to  earlier  spectral  line.  At  the  1^,  >  110mA  the 
excitation  of  the  noise  oscillations  with  contin¬ 
uous  and  wide  spectrum  is  a  taking  place.  The 
3  —  dB  bandwidth  of  power  spectrum  reaches  to 
250  —  300MHz  at  the  maximum  values  of  work¬ 
ing  current  (/«,  =  180mA).  The  overall  power 


spectrum  bandwidth  in  this  situation  is  up  to 
iOOMHz. 

The  distinguishing  feature  of  the  sources  un¬ 
der  discussion  is  the  linear  increase  of  the  out¬ 
put  power  P  with  working  current  Interac¬ 
tion  efficiency  don’t  decrease  for  more  compli¬ 
cate  autoosciUation  regimes.  Larger  values  of  P 
are  accomplished  by  the  use  of  double  vane-  type 
circuit.  The  overall  efficiency  of  the  source  men¬ 
tioned  is  larger  too  and  reaches  to  5%. 

Noise  oscillation  source  based  on  the  BWO 
allows  to  realize  electronic  tuning  of  the  noise  os¬ 
cillation  frequency.  On  account  of  operating  pe¬ 
culiarities  [3],  the  electronic  tuning  is  not  contin¬ 
uous.  However,  the  band  of  the  working  frequen¬ 
cies  is  fuUy  covered  by  means  of  electronic  tuning 
as  far  as  voltage  intervals,  according  to  wideband 
noise  oscillation  absence,  are  small  (AP  <  40V’) 
enough.  Whereas  the  width  of  a  power  spectrum 
is  too  large.  Relative  band  width  of  electronic 
tuning  is  commonly  of  about  4  —  10%  and  can 
be  larger. 

In  order  to  provide  the  continuous  electronic 
tuning  of  noise  oscillation  frequency  it  is  neces¬ 
sary  to  improve  the  slow-wave  structure  match¬ 
ing  with  output  arrangements.  However,  the  es¬ 
sentially  great  values  of  1^,  are  needed,  in  this 
case,  for  excitation  of  the  oscillations  with  com¬ 
plicate  power  spectrum.  The  special-purpose  noise 
oscillation  sources  can  provide  the  narrowband 
noise  oscillations  with  power  spectrum  width  of 
about  lOMHz  and  frequency  electronic  tuning 
over  the  range  from  3A.2GHz  to  31,10 Hz  at  the 
working  current  =  90mA. 

3.4.  Operating  peculiarities  of  the  higher 
frequency  sources 

Packaged  constructions  of  the  sources  designed 
allow  to  excite  the  noise  oscillations  with  fre¬ 
quencies  up  to  QOGHz.  The  sources  have  a  larger 
electrical  length  of  the  vane-type  slow-wave  struc¬ 
tures.  For  excitation  of  the  noise  oscillations  it  is 
necessary  to  apply  the  electron  beam  with  high 
density,  that  causes  the  necessity  in  larger  values 
of  focusing  magnetic  field. 

Noise  oscillations  in  sources  of  that  range  are 
excited  at  the  isolated  frequencies  of  the  working 
band,  that  is  caused  by  complexity  of  the  good 


551 


matching  of  vane-type  circnit  with  output  ar¬ 
rangements.  Output  power  and  overall  elRciency 
of  the  higher  frequency  sources  are  smaller,  whe¬ 
reas  the  power  spectrum  width  can  be  more  than 
IGHz. 

4.  THE  MAIN  SOURCES 
CHARACTERISTICS 

At  present  time  the  packaged  noise  oscilla¬ 
tion  sources  have  been  designed,  operating  in  the 
needed  band  over  the  range  from  30  to  BQQHz. 
The  main  diaracteristics  of  sources  are  presented 
beUow.  The  characteristics  pointed  is  not  limit. 
There  ate  the  possibility  of  the  output  power 
significant  increasing  and  the  extending  of  the 
working  band  as  in  higher  and  lowest  frequency 
range. 


Technical  Specification 


Mean  level  of  output  power,  W 

4-  25 

Power  spectrum  density,  im,W/M Hz  2  —  10 

Spectrum  width,  MHz 
at  power  level  —MB 

300 

at  power  level  —  lOciS 

800 

Electronic  tuning,  % 

3-10 

Voltage-frequency  seneii. M HzjV 

2-15 

Cathode  voltage,  kV 

2  -  4.5 

Cathode  current,  A 

0.1  -0.18 

Consumption  of  coolant,  l/h 

100 

Mass  of  device,  kg 

9-12 

5.  CONCLUSION 

Application  of  the  dynamical  properties  of 
BWO  allows  to  create  the  efficient  and  compact 

noise  oscillation  sources  of  inillimetei  wave  band. 
The  sources  combine,  in  essence,  the  functions  of 
the  modulator  and  oscillation  source.  It  should 
be  particularly  emphasised,  that  the  source  is 
powered  by  dc  voltage  only.  Automodulation  be¬ 
haviors  extend  significantly  the  potentialities  of 
BWO,  A  single  source  can  provide,  in  depen¬ 
dence  on  the  working  current  values,  the  single- 
frequency,  multifreqnency  and  noise  oscillations. 
The  electronic  tuning  of  the  oscillations  radiated 
is  possible  in  the  all  operating  behaviors. 


The  noise  oscillation  source  characteristics  lis¬ 
ted  above  are  only  partially  shown  the  possibil¬ 
ities  of  the  method  under  consideration.  Its  ap¬ 
plication  allows  to  excite  the  noise  oscillations 
of  microwave  and  higher  frequency  range  of  nul- 
limetei  wave  baud.  Improvement  of  the  electron 
gun  and  magnet  makes  posiiible  the  decreasing  of 
the  oreiaii  dimensions  and  weight  of  the  sources, 
increasing  of  the  power  spectrum  bandwidth  and 
output  power. 

At  present  time  the  noise  sonices  designed 
have  been  applied  for  development  of  the  noise 
radars  /4,  5/  and  antodyne  measurement  sys¬ 
tems  and  sensors  /6,  7/  with  high  information 
capacitance. 
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ABSTRACT 

This  paper  discusses  about  the 
M.O.S.  Device  Simulator  Developed  by 
us,  which  simulates  the  device  char¬ 
acteristic  upto  submicron  channel 
length.  It  includes  long  channel.  Short 
channel,  subthreshold,  field  dependent 
mobility  and  punch  through  models. 

1.  INTRODUCTION: 

MOSSIM  includes  long  channel 
effect  based  on  Gradual  Channel  Ap¬ 
proximation,  Short  Channel  effect  based 
on  two  dimensional  effect  of  applied 
terminal  voltages,  mobility  degradation, 
and  punch  through  effect.  It  computes 
the  flat  band  voltage,  zero  biased 
threshold  implantation  adjustment  dose 
and  capacitances.  It  calculates  the 
charge  inside  the  bulk,  channel  thick¬ 
ness,  body  effect  and  channel  length 
modulation  parameters.  It  uses  MOS 
modelling  by  abaly  tical  approximations 
ref  (3)  and  (1)  for  surface  potential, 
subthreshsold  current  for  short  chan¬ 
nel  and  long  channel  devices. 
Subthreshsold  region  is  important  for 
low  voltage  and  power  application. 
While  calculating  current,  it  uses  field 
dependent  mobility  model  for  two  di¬ 
mensional  analysis  ref  (2)  and  ref.  (4). 
While  calculating  the  mobilsity,  it 
computes  both  lateral  and  longitudnal 
electric  fields,  and  then  it  computes  the 
electron  and  hole  mobility,  because 
mobility  effects  directly  effect  the  drains 
current,  MOSSIM  uses  the  two 
dimensisonal  model  to  calculate  the 
inversion  layer  mobility.  It  divides  the 
drain  characteristic  in  two  different 
regions,  these  regions  are  linear  and 
saturation  regions  in  1-V  characteristic 
of  MOS  transistor.  We  have  considered 


strong,  moderate  and  weak  inversion 
condition,  because  analog  devices  work 
in  weak  and  moderate  inversion.  We 
have  also  included  punch  through  ef¬ 
fect,  because  for  short  channel  devices 
punch  through  drain  current  will  be 
dominated  by  space-charge  limited 
current.  Provision  has  been  also  added 
for  interpolation  of  field  independent 
mobility,  which  is  based  on  substrate 
doping,  and  it  can  be  used  for  drain 
current  calculation.  Subthreshsold 
conduction  occurs  in  weak  inversion 
condition,  and  we  can  find  very  small 
changes  in  subthreshold  current,  while 
increasing  the  drain  voltage. 

It  determines  the  Isong  or  short 
channel  behaviour  of  device  on  the  ba¬ 
sis  of  calculated  effective  and  minimum 
channel  lengths.  Junction  depth  can 
be  specified  either  by  user,  dr  the  pro¬ 
gram  calculates  it  on  the  basis  of  dop¬ 
ing.  The  user  has  to  specify  when  he 
wants  to  drive  the  impurity  deeper  us¬ 
ing  subsequent  thermal  processes.  It 
provides  the  choice  for  NPOLY,  PPOLY 
or  A1  gate  technology.  We  can  use  few 
of  the  calculated  MOS  Transistor  pa¬ 
rameters  as  an  input  to  the  circuit 
simulation,  e.g.  (Threshold  Voltage, 
overlap  and  inter  node  capacitances, 
channel  length  modulation  and  body 
effect  paramaeters  for  the  same  chan¬ 
nel  length,  width  and  oxide  thickness  of 
MOS  Transistors).  We  have  simulated 
the  MOS  transistor's  characteristics 
from  15  micron  to  0.5  micron  channel 
length  devices.  Device  may  be  either  N 
channel  or  P  channel  transistor,  and 
oxide  thickness  used  can  be  upto  130 
Ang  for  submicron  channel  length  de¬ 
vices.  MOSSIM  calculaters  MOS  tran¬ 
sistor  paramaters  for  zero  biased  and 
bias  dependent  conditions. 
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MOSSIM  uses  following  expressions  in 
the  model  for  calculating  various 
parameters  of  MOS  transistor,  and  the 
currents  following  into  the  device. 

Threshold  voltage  calculation: 

sort 

(2*EPSi,‘^N,U,)/C,  (1) 

here  U3=V^  log  (N^n,)  (2) 

Ug  :  Fermal  potential 
Charge  calculation  in  inversion  layer: 

Q„  (y)  =  9.  (y)  -  9„  (Y)  0) 

Charge  calculation  inside  the  bulk; 

Q  (y)  =  -  N  W 

=  -  SQRT  (2qN^  *  EPSI^,  (V„+2  U„)  (4) 

Source  and  Drain  depletion  length  calcu¬ 


lation: 

y,  =  X,,S9RT{V3,+  2U,)  (5) 

yD  =  X,3SQRT(V,„  +  2U3+VJ  (6) 

where 

V3,=  0.0259  log  (7) 

Left  =  L  -  -  y,,  (8) 

Transconductance  calculation  : 

gn,  =  wmW/MC^(V^3-V.J/L  (9) 

where  m  =  1/2  at  low  dopings. 

Drain  current  calculation  : 


1ds=  (W/L)  (V,3-V^,)  V33-V33/2(10) 

Expression  for  subthreshold  current 
calculation 

L...h=Pn  (W/L)  •  (aC  y2  (  BETA^)* 

(n,/NJ2  *  (i-e  -beta-v)  ^  beta  *  2  U 

*  (BETA  *  2U3) (11) 
a  =  SQRT  (2)  *  EPSI/L„  (12) 

BETA  =  q/KT 
Capacitance  calculation  ; 

Coo=Cs<,=  (1/3)C„,WL  (13) 

Punch  Through  Voltage  Calculation: 

=  qN^{L-yJV2  *EPSI  (14) 

Substrate  Current  Calculation: 

ls..,  =  2Iasexp(-1.7E6/EJ  (15) 


Since  a  MOSFETT  fundamentally  operates 
under  the  isnfluence  of  two  external  forces, 
s  One  is  the  field  which  induces  carriers  in 
the  inversison  layer  and  constructs  a 
narrow  channel,  the  gate  field.*  The  other 
is  the  field  which  transports  carriers  from 
the  source  to  the  drain.  Mobility  which 
depends  on  both  fields,  is  assumed  to  be 
expressed  by  the  following  formulat  (2)  : 

P(N3,  E.  E^)  =p^f{N3,.E)  g(E^  .  (16) 
where  is  the  ismpurity  concentration, 
and  is  constant.  The  term  of  represent 
the  property  of  bulk  mobility.  The  bulk 
property  in  Si  is  approximated  by  the 
expression: 

f(N,,  E)=  1  +  (N^/fN^/S+N)  + 

(E/A)2  /  (E  /  A  +F)  +  (E  /B)2  1/2  (17) 

where 


SN(cm-3)A(V/cm) 

FB(V/cm) 

Electrons 

350  3*10's  3.5*103 

8.8  7.4  *  103 

Holes 

81  4*10'«  6.1*103 

1.6  2.5*  10* 

The  g  term  is  introduced  so  as  to  represent 
the  property  of  surface  mobility.  An  ap¬ 
proximate  formula,  which  simsulates  the 
property  can  be  expressed  as  follows: 

g  =  (l+kE^)  (18) 

The  parameter  k  is  determined  to  fit  the 
experimental  data.  It  is  expressed  as  : 

k  =  1.539  ♦10"®  cms/v,  for 
n-channel  device 
=  5.35  *10  ®  cm/v,  for 
p-channel  device  (19) 

Here,  f  and  g  are  normalized  to  unity  when 
Nj^,  E.  and  E^are  zero.  Therefore  is  low 
field  mobility  of  an  sintrinsic  crystal  and 
given  as 

p^  =  1600  cm^/v-s,  for  electrons 

=  600  cm^/V-s  for  holes  (20) 

Field  can  be  expressed  by  the  following 
expressions, 

E  =  (Vo  -  )  /  SQRT  (3T  (21) 

E  =(9b  +  9„/2)EPSI,  (22) 

Other  MOS  parameters  are  expressed  as: 
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GAMMA  =  SQRT  (2  *  EPSl^,  qN^)/C„^(23) 

LAMDa.=  (L/L^^  *  SQRT  (EPSI^.,  (\^pg  + 
2*PERMI)/  2qNJ  (24) 

FERMI  stands  for  Fermi  Potential 
EPSl  stands  for  Epsilon 
p  stands  for  Mobility 

3,  CONCLUSION 

The  program  has  been  develsoped  in 
Fortran-77,  and  it  can  be  used  osn  any 
personal  computer.  We  have  used  this 
software  to  simulate  the  various  char¬ 
acteristics  of  MOS  transistor  as  shown 
ins  Figs. 2-5.  The  parameters  of  the 
device  are  also  mentioned  in  the  figures. 
The  input  to  MOSSIM  simulator  is 
impurity  concentration  data  and  bias¬ 
ing  condition  of  the  transistor.  Fig.  2- 
4  shows  the  drain  characteristic  of 
device  at  2,3,4  and  5V  gate  voltages, 
while  Fig.  5  shows  the  substrate  current 
characteristic  of  NMOS  transistor  at 
various' gate  voltages. 
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5.  INPUT  PARAMETERS 

NA:  Substrate  Concentration 
(/cm^)  =  5.  Oel6 

ND  :  Source/Drain  Concentration 
(/cm®)  =  5.0el9 

VGSF=2.0V  VGSL  =  5.0  V  VGSTEP  =  1  .OV 
VDSF=0  VDSL  =  7.0V  VDSTEP  =  0. 1  V 
NSS  (/cm®)  =  l.Oell 

Tox  =  130  Ang  L  =  1.5  pm  W  =  30.0  pm 
Vth  =  1 .0  V;  Gate  is  NPOLY 
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ABSTRACT 

The  operational  amplifier  for  the  creation  of 
dependable  analog  systems  and  for  median 
filtration  is  described.  These  OpAmps  can  be  used 
in  parallel  connections  with  common  or  separate 
loads  and  Inputs  to  achieve  fault-tolerance,  noise 
reduction,  self-checkability  and  error  indication, 
output  power  and  accuracy  rise  of  the  analog 
systems. 

1.  INTRODUCTION 

Median  choice  ("voting")  of  signals  of  subsystems 
was  examined  in  [1,  2,  8]  to  reduce  the  influence 
of  interferences,  impaimients  and  unpredictable 
faults  of  components  in  analog  systems  (AS)  of 
measurement,  communication  and  control.  This 
algorithm  was  realized  in  the  1960’s  in  a  nonlinear 
feedback  structure  for  ship  and  plane  control 


Fig.  1.  Quorum-element 


systems  [2,3],  particularly,  using  voting  or 
"quorum-element"  (QuE)  (Fig.  1),  recently 
reinvented  for  the  image  and  speech  median 
filtering  in  [4,  5].  But  the  median  choice  in  the 
direct  signal  path  is  a  non-linear  operation  of 
majority  logic  and  therefore,  because  of  deviations 
of  signals  and  parameters  of  elements,  has  mutual 
drawbacks,  such  as  [6,  pp.  47-50]: 

•  it  does  not  allow  the  implementation  of  a 

redundant  open-loop  gain  or  integrator 

element; 

•  it  introduces  nonlinear  distortions  in  accurate 
systems; 

•  it  is  not  effective  in  power  system  parts  (power 
amplifiers  or  supplies); 

•  such  ASs  contain  common  non-redundant 
components  and  do  not  have  fault  diagnoses. 

2.  THE  REDUNDANT  ANALOG 
CONNECTION 

The  simplest  operation  to  decrease  input  signal 
deviation  is  an  arithmetical  or  weighted  mean  [8]. 
For  input  voltage  sources  V/,  this  operation  can  be 
implemented  in  parallel  connection  of  V,  with 
weghting  resistors  R/  (Fig.  2).  For  equal  resistors 
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R,  output  voltage  Vs  is: 

ys=H — 

Rf' 

i=l 

To  avoid  the  effect  of  large  deviations  caused  by 
system  faults,  resistors  R,  can  be  made  non-linear, 
increasing  with  the  rise  of  their  currents.  Diode 
bridges  (Fig.  1)  have  this  non-linear  resistance. 
Also,  it  is  nessesary  to  limit  possible  signal  levels 
in  the  circuit. 

To  unplement  a  dependable  structure  and  avoid 
mentioned  drawbacks,  authors  suggest  the 
redundant  system  structure  (Fig.  3)  and  an  example 
of  the  implementation  of  its  subsystem  in  the  fonn 
of  an  operational  amphfier  (Fig.  4). 

Every  subsystem  consists  of  the  input  amplifier  Ali 
with  the  output  current  limitation  (Ql,  Q2  and 
current  source  II)  and  the  output  amplifier  A2i  also 
with  a  current  limitation  (it  can  be  a  power  or 
temperature  limitation)  (Q11-Q18,  R),  and 

feedback  Zu,  Z2i.  In  addition,  every  opamp 
includes  feedback  on  the  deviation  of  output 
signals  from  the  median.  The  median  is  detected 
by  the  QuE,  consisting  of  the  diode  brige  (Fig.  3) 
or  transistor  current  limiter  Q5-Q8  and  13  (Fig.  4). 
In  [1]  shown  that  voltage  Vs  of  jointed  outputs  of 
QuE,  is  in  limits  (2): 

^  |med(vi.,.)|  (2) 


where  m  <  n  is  the  quantity  of  Vu,  which  are  in 
the  limitation  area,  e  ^2V/  =  2kT/q  --50  mV  for  the 
diode  QuE.  Amplifiers  As  (Q3,Q4,  13)  reduce  the 
difference  of  output  signals  of  subsystems,  so 


where  Gs  -  transconductance  of  As,  II  -  limitation 


level  of  Ali.  Deviations  of  Vu  from  Vs  are  caused 
by:  1)  process  deviations  of  subsystem's 

production;  2)  different  input  signals  Vin/;  3) 
different  subsystem’s  feedbacks  Zn,  Z2i;  4)  internal 
amplifier  faults.  Vu  can  feed  the  subsequent 
redundant  blocks  of  the  system  or  the  common 
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Fig.  3.  Structure  of  the  redundant  analog  system. 


load  of  redundant  connection  through  small 
resistors  R. 

The  subsystem  fault  causes  limitations  of 
subsequent  Ali  and  the  break  of  its  general 
feedback  (rise  of  the  Ali’s  input  voltage)  which 
can  be  detected  by  additional  circuitry  (not  shown). 
Every  subsystem  can  be  used  separately  in  non- 
redundant  systems. 

The  single  input  or  amphfier  fault  causes  a 
disconnection  of  general  feedback  of  this  amphfier. 
In  case  of  inner  fault,  this  amphfier  can  produce 
the  error  on  common  load,  which  should  be 
compensated  by  other  amplifiers.  This  is  the  only 
damage  of  single  or  multiplicative  (with 
appropriate  redundancy  coefficient)  faults,  while 
the  system  in  general  remains  in  a  working  state. 
With  all  active  elements  and  the  common  load,  this 
amphfier  connection  allows  the  summarization  of 
all  load  currents  and  increases  the  power  limitation 
in  accordance  with  the  parallel-joined  amphfier 
quantity. 

In  test  equipment,  such  connections  of  the  standard 
and  test  blocks  easily  disclose  defective  devices 
with  the  help  of  fault  diagnostic  circuits  without 
the  difficult  parameter  measurements. 


3.  REALIZATION 

As  a  general-use  element  for  dependable  redundant 
AS,  the  dual  OA  was  produced  on  an  analog 
bipolar  array  with  vertical  PNPs,  ft  n-p-n  =  600 
MHz,  ft  p_N-p  =  90  MHz.  Its  parameters  are  as 
follows: 

As  a  general-purpose  OA: 

•  offset  voltage  <  3  mV; 

•  maximum  output  current  10  mA; 

•  unity-gain  frequency  10  MHz; 

•  gain  90  dB; 

•  slew  rate  5*10^  V/s. 

In  parallel  redundant  connection: 

•  maximum  difference  of  output  voltage  is  <  50 
mV  in  input  error  conditions; 

•  difference  of  output  voltages  is  <  3  mV  with 
equal  input  signals;  input  error  indication  level 
is  400  mV. 

4.  CONCLUSION 

Our  research  resulted  in  the  general  nonlinear 
multiloop  AS  structure  witli  median  detection  in 
feedback  paths.  This  redundant  stmcture  can  be 
reahzed  from  elementary  levels  to  system  levels  of 
complexity  and  has  the  following  properties: 

•  limitation  of  input  and  output  signals  in  a  safe 
or  valid  operating  area; 

•  accuracy  increase  of  the  redundant  connection 
through  averaging  over  a  set  of  m  from  n 
variables  in  the  limited  desirable  area  of  the 
median; 

•  power  enlargement  in  redundant  power 
components  [6]; 

•  passivation  of  components  whose  variables 


have  been  extended  beyond  the  limited  area  of 
the  connection  median  (neutralization  of 
technological  defects,  additive  noises,  input 
and  supply  overloading,  multiplicative 
failures); 

•  repairment  or  replacement  of  the  failed  units  in 
an  active  state; 

•  diagnosis  of  the  impainnents,  exceeding  the 
tolerances; 

•  metrology  attestation,  appraisal  of  the  degree 
of  specification  degradation  by  deviation  from 
the  reference  subsystem  without  accurate 
measurements. 

These  features  of  redundant  ASs  and  their 
components  fully  correspond  with  the  dependability 
concept  in  [7]. 

Median  feedback  elements  can  be  included  in 
nearly  any  analog  IC  in  any  technology  for  the 
access  to  dependability  properties.  Multipliers, 
integrators,  oscillators,  instrumentation  amplifiers, 
etc.  can  be  created. 

The  most  promising  apphcation  is  the  creation  of 
power  fault-tolerant,  solid-state  amplifiers  with  the 
ability  of  unlimited  parallel  connection  for  the 
power  enlargement.  For  redundant  power 
amplifiers,  deviations  can  cause  an  increase  in 
local  power  dissipation  and  temperature  in 
channels.  By  measuring  signal  deviations  of  local 
temperature  sensors  from  the  mean,  the  system  can 
control  the  gain  of  the  channels  [6].  In  this  case, 
redundancy  does  not  lead  to  hardware  increment, 
but  creates  a  power  reserve. 
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The  main  principles  of  volume-metrical 
microvawc  monolithic  ICs  design  arc  discussed. 
The  essential  features  of  it  are  following: 

1.  Mcrowavc  transmission  lines  shold  be  chosen 
adequately  providing  the  best  total  performance 
for  each  functional  element  and/or  block. 

2,  It  is  important  to  involve  3  space  dimensions  in 
electromagnetic  signal  processing.  In  this  case 
interlayer  signal  transmission  can  be  obtained  by 
means  of  local  couplings.  Coupling  element  could 
be  used  also  as  processing  tool.  A  set  of 
technological  contradictions  appeared  when 
approaching  submillimeter  band  will  be  pointed 
out  below: 

a)  The  active  components  of  an  IG  should  be  very 
precisely  mounted, 

b)  It  is  desirable  that  the  transmission  line 
insertion  losses  be  low. 

c)  It  is  necessary  that  the  relative  layers  position 
be  precisely  adjusted. 

d)  Unfavourable  reactive  parameters  of  active 
components  are  to  be  compensated. 

These  technological  contradictions  are  resolved  by 
taking  advantage  of  the  monolithic  technique  for 
volume-metrical  ICs.  Technological  difficulties 
caused  by  semiconductor  substrate  size  being 
restricted  are  avoided  with  the  aid  of  monolithic 
ICs  and  distributed  passive  elements. 

The  volume-metrical  monolithic  ICs  technique 
enables: 

1.  to  combine  several  semiconductor  structures  for 
different  active  elements  (Shottky  diodes, 
transistores  etc.) 

2.  to  integrate  physical  processes  of  different 
nature  (microwave,  optical  etc.)  in  one  IC. 

The  emphasised  above  allows  the  usage  of  wide 
range  of  optoelectronic  devices  in  microwave  ICs. 
In  our  report  we  discuss  physical  and 
technological  design  principles  for  basic 
waveguide  and  resonant  elements.  Attention  was 
also  paid  to  main  functional  elements  for  volume- 
metrical  microwave  ICs  based  on  semi -isolating 
GaAs.  The  CAM/CAD  methods  are  included  too. 
Sample  elements  were  investigated  in  our 
experiment  and  the  results  are  shown  in  the 
report. 


The  comb  dielectric  waveguide  designed  for 
90-300  GHz  range  was  used  as  one  of  the  possible 
versions. 

It  was  manufactured  of  semi-isolating  GaAs 
(<100>  oriented)  by  means  of  liquid  chemical 
processing.  For  the  experiment  it  was  made  to  be 
ring-shaped.  Thus  the  low-radiation  resonator 
used  high-order  azimuthal  resonance. 

Q-600  was  achieved  at  140  Ghz.  The  special 
design  features  are  determined  taking  in  account 
that  the  technological  process  being  anizotropic. 
The  plasmochcmical  technology  has  also  great 
outlook. 

The  surface  wave  equalizing  structures  were  also 
developed  for  semiconductor  dielectric 
waveguiedes  with  lumb  active  elements.  The  basic 
waveguide  elements  and  the  metal-covered 
equali25crs  arc  placed  on  the  opposite  sides  of  the 
semiconductor  layer. 

The  Shottky  diode  structure  was  offered  with  its 
terminals  placed  on  the  surface  of  semi-isolating 
GaAs  sample  open  by  the  epitaxial  technique. 
Antisipated  parameters  of  the  barrier  are: 
capacity  ^10  fF,  leackage  resistance  ~50hm. 
Several  kinds  of  optoelectronic  modulators  and 
tuned  attenuators  based  on  monolithic  ICs  arc 
suggested  in  our  report.  The  fact  that  the. devices 
arc  distributed  ones  allows  to  avoid  undesirable 
elements  interaction.  The  modulator  performance 
(i.e.  spped,  modulation  depth  and  control  laser 
power)  were  analizaed  too.  Possible  realization 
methods  arc  given.  The  modulation  depth  ^^35  dB 
and  insertion  losses  -'O.SdB  were  observed  in  the 
experiment  with  optoelectronic  modulator 
designed  like  rejector  filter. 

The  design  methods  are  suggested  for  the  specific 
distributed  microwave  amplifier  based  on  the 
stable  negative  differential  conductance 
phenomenon  in  GaAs. 

In  our  report  we  concerned  the  problem  of  the  3D 
signal  processing  and  special  multi-layer 
monolithic  ICs  performing  that.  The  technical 
solutions  are  shown  taking  in  account  the  sjpecific 
requirements. 
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Abstract 

A  Micromotor  Control  Integrated  Circuit  (MCIC)  which 
enables  closed-loop  control  of  an  electrostatic  micromotor 
is  described.  The  MCIC  consists  of  low-noise  sense 
electronics  designed  to  detect  critical  rotor  angles  to  a 
resolution  of  0.5^"  (0.05  fF)  at  a  1  MHz  sampling  rate,  and 
control  logic  which  cycles  the  micromotor  drive  state 
during  continuous  rotation  to  maintain  maximum  torque, 
independent  of  loading. 

The  MCIC  was  fabricated  using  a  2-fxm,  n-well  CMOS 
process  and  all  circuits,  analog  and  digital,  function  as 
expected.  Preliminary  measurements  indicate  that  the 
input-referred  noise  of  the  sense  amplifier  is  very  close  to 
the  design  objective  of  32  |iV  rms. 

1.  Introduction 

The  ability  to  micromachine  silicon,  a  material  which  is 
well  understood  for  its  electronic  properties,  is  now  being 
employed  to  fabricate  miniature  mechanical  structures.  In 
combination  with  the  relatively  mature  silicon  circuit 
technology  and  chemical/physical  microsensors,  the 
development  of  microactuators  opens  a  pathway  to  closed- 
loop  monolithic  microelectromechanical  systems 
(MEMS). 

One  such  microactuator  is  the  12:8  salient- pole, 
electrostatic  micromotor  [1],  depicted  in  Figure  1. 
Typically,  the  rotor  diameter  is  100  |xm,  the  rotor-stator 
gap  size  is  1.5  [xm,  and  the  rotor  thickness  is  2.5  pm. 
Torque  is  generated  using  the  electrostatic  force  between 
the  rotor  and  three  stator  groups  which  are  driven  by  three- 
phase  excitation  using  a  typical  amplitude  of  50  V.  The 
step  response  of  these  micromotors  is  very  underdamped 
with  a  peak  overshoot  that  is  about  7.5''  for  a  step  size  of 
15^*  and  a  natural  frequency  of  approximately  10  kHz  [2]. 

To  date,  these  micromotors  have  been  operated  using 
open-loop  stator  excitation.  To  maintain  synchronization 
between  rotor  position  and  stator  phasing,  the  micromotor 
is  operated  in  a  stepped  fashion.  In  other  words,  each  time 
the  phase  of  the  excitation  is  advanced,  the  rotor  is 
allowed  to  settle  in  its  new  rest  orientation  before 
advancing  to  the  next  phase.  As  a  result,  measured 
rotational  speed  has  been  limited  to  a  few  kRPM.  On  the 
other  hand,  our  analysis  and  computer  simulations  using 
the  models  and  data  presented  in  [2]  show  that  speeds  in 


excess  of  100  kRPM  are  possible  given  optimum,  closed- 
loop  control. 


Figure  L  Schematic  top  view  of  a  12:8  salient-pole 
micromotor 


The  objective  of  this  work  is  to  develop  closed-loop 
control  electronics  which  maximize  the  torque/speed  of 
this  micromotor  during  continuous  rotation.  Our 
approach  does  not  require  modification  to  the  micromotor 
structure  and  employs  a  technology  which  lends  itself  to 
future  monolithic  integration  on  the  micromotor  die.  The 
sensing  technique  can  be  applied  to  a  variety  of  capacitive 
microsensors  and  microactuators. 

2.  Control  Principle 

The  torque  produced  by  each  of  three  drive  phases,  as  a 
function  of  rotor  angle  of  rotation,  is  sketched  in  Figure 
2a.  This  qualitative  sketch  is  based  solely  on  symmetry 
arguments,  but  corresponds  well  to  data  obtained  using  3D 
finite-element  simulation  [3].  To  achieve  maximum 
average  torque,  the  drive  electronics  must  switch  a  phase 
on  when  it  contributes  positive  torque  and  switch  it  off 
when  it  contributes  negative  torque,  as  shown  in  Figure 
2b.  Drive  torque  is  lost  if  the  phase  switching  is  not 
precise;  we  estimate  that  a  switching  resolution  of  0.5^ 
results  in  an  average  torque  loss  per  phase  switch  of  0.4%. 
To  achieve  this  sampling  resolution  for  rotational  speeds 
near  100  kRPM,  a  sampling  rate  of  1  MHz  is  used. 

The  angles  of  zero  torque,  i.e.,  the  critical  drive  angles, 
occur  when  the  rotor  is  either  fully  aligned  or  fully 
misaligned  with  a  given  stator  phase.  It  is  at  these  critical 
angles  that  the  drive  phase  must  switch.  The  critical  drive 
angle  for  a  given  phase  can  be  detected  by  differentially 
stimulating  the  capacitances  between  the  other  two  phases 
and  the  rotor.  This  differential  capacitance  is  zero  at  the 
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critical  drive  angle.  Therefore,  a  comparator  can  be  used 
to  determine  when  the  critical  angle  has  been  crossed,  and 
advance  the  drive  state  when  this  occurs.  For  clockwise 
rotation,  stimulus  must  be  applied  according  to  Figure  2b, 
which  leads  to  a  differential  capacitance  as  sketched  in 
Figure  2c. 


We  therefore  arrive  at  the  state  diagram  shown  in  Figure  3 
in  which  the  drive  state  is  represented  by  a  3-bit  state 
variable  «|)0  4*30  ♦60>  where  1  means  "on*  and  0  means 
"off."  There  are  six  valid  states  through  which  the  drive 
cycles  as  the  crossing  of  critical  angles  is  detected.  There 
are  two  invalid  states,  000  and  111,  which  are 
immediately  exited  should  they  occur  due  to  circuit 
initialization  or  error. 


Figure  3.  State  diagram  of  the  control  algorithm. 


Consider  the  start-up  case  in  which  the  drive  state  is  100, 
and  the  rotor  has  been  allowed  to  come  to  rest  at  0*. 
When  the  control  algorithm  is  activated,  the  and  4>30 
stators  will  be  stimulated  to  find  that  the  rotor  angle  of 
rotation  is  greater  than  -7.5°,  and  the  drive  state  will  be 
immediately  switched  to  101.  The  (ji60  stator  now 
provides  substantial  forward  torque  to  start  the  rotation  of 
the  rotor.  The  sense  circuitry  will  soon  determine  that  the 
critical  angle  of  0°  has  been  exceeded,  the  drive  state  will 
be  switched  to  001,  and  the  sense  circuitry  will  begin  to 
detect  +7.5°.  At  this  time,  the  drive  state  is  optimum 
given  the  rotor  angle,  and  the  rotor  quickly  gains  speed. 
Computer  simulations  predict  that  the  micromotor  will 
reach  more  than  90  %  of  its  steady-state  speed  by  the  time 
it  has  rotated  90°. 

3.  Sense  Amplifler 

Differential  stator-rotor  capacitance  is  sensed  using  the 
switched-capacitor  integrating  amplifier  shown  in  Figure 
4.  A  sense  stimulus  is  applied  by  superimposing  a  high- 
frequency  square  wave  on  the  stator  drive  voltages.  This 
configuration  holds  the  rotor  at  a  virtual  ground,  and  thus 
avoids  electrostatic  attraction  between  the  rotor  and  the 
substrate  erf  the  micromotor  which  can  cause  a  frictional 
force  which  inhibits  micromotor  operation. 


Reset 


Figure  4,  Simplified  sense  amplifier. 

Each  cycle  of  operation  consists  of  a  reset  phase  and  a 
sense  phase.  During  the  reset  phase,  the  feedback 
capacitor  is  discharged  by  the  reset  switch  and  the  output 
of  the  sense  amplifier  is  preset  near  ground,  depending  on 
its  input- referred  offset  voltage.  The  reset  switch  is  then 
opened  and  a  differential  stimulus  is  applied  to  two  of  the 
three  phases.  A  charge  proportional  to  the  difference  in 
capacitance  of  the  two  phases  is  driven  across  the  feedback 
capacitor  resulting  in  a  voltage  proportional  to  this 
differential  capacitance. 

Details  of  the  sense  amplifier  are  shown  in  Figure  5.  A 
nominal  timing  diagram  for  clocks  used  by  the  sense 
amplifier  and  comparator  is  presented  in  Figure  6.  As  per 
the  principle  of  correlated  double  sampling  [4],  the  circuit 
that  follows  the  initial  switched-capacitor  integrator 
detects  the  difference  of  the  voltage  before  and  after  the 
sense  stimulus  has  been  applied.  To  reduce  the  error  from 
charge  injection  from  the  NMOS  transistors  used  as  reset 
switches  around  the  various  amplifier  stages,  they  are 
turned  erff  in  a  skewed  manner  [5], 
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The  kT/C  noise  from  the  integrator  reset  switch  and  the 
flicker  ( 1/0  noise  of  the  integrator  amplifier  are  effectively 
eliminated  by  the  correlated  double  sampling.  Thus,  the 
measurement  resolution  of  stator-rotor  capacitance  is 
limited  by  the  thermal  noise  of  the  Integrator  amplifier. 
The  RMS  value  of  this  noise  referred  to  the  integrator 
Input  is  proportional  to  1)  the  spectral  density  of  the 
input-referred  thermal  noise  of  the  operational  amplifier 
and  2)  the  square-root  of  the  bandwidth  of  a  low-pass  filter 
which  imm^iately  follows  the  sense  amplifier. 

This  micromotor  has  a  differential  stator-rotor  capacitance 
of  approximately  0.05  fF  per  0.5°.  To  limit  torque  loss 
due  to  the  superimposed  sense  stimulus  to  about  13%,  its 
amplitude  will  be  limited  to  20%  of  the  high-voltage 
supply,  i.e.  10  V  for  a  50-V  supply,  corresponding  to  a 
signal  of  3.1  ke'  into  the  integrator,  i.e.  1  mV  at  the 
output  of  the  initial  integrator.  The  second-stage 
amplifier  has  a  closed-loop  gain  of  10  and  will  therefore 
produce  a  10-mV  signal  per  0.5°  of  rotation. 

To  permit  adequate  settling,  the  time  constant  of  the  first- 
order  low-pass  filter  formed  using  a  source  follower  with 
20-kQ  output  impedance  has  been  set  to  100  nsec  (1.6 
MHz),  The  op  amp  has  been  designed  for  an  input-rrferred 
noise  spectral  density  of  14  nV/^z  and  a  noise  comer 
frequency  of  100  kHz.  The  double-sampled  flicker  noise 


is  negligible,  and  the  thermal  noise  of  the  integrator, 
including  spectral  leakage  through  the  first-order  low-pass 
filter  and  double  sampling,  is  32  |j,V  rms  referred  to  the 
input  of  the  sense  amplifier,  i.e.  5.4  mV  when  referred  to 
the  output  of  the  second-stage  amplifier  assuming  a 
parasitic  input  shunt  capacitance  of  8  pF.  The  op  amp, 
therefore,  noise  corresponding  to  about  half  the  desir^ 
switching  resolution. 

4.  Architecture 

A  block  diagram  of  the  MCIC  with  key  system 
components  is  presented  in  Figure  7.  The  sense  amplifier 
receives  its  input  from  the  rotor  of  the  micromotor  via  the 
input  Senseln.  The  output  of  the  sense  amplifier  is 
compared  to  a  threshold  voltage,  nominally  0  V,  which 
can  be  adjusted  during  experimentation.  The  output  of  the 
comparator  (Trip)  is  used  by  the  control  logic  to  determine 
an  advancement  in  drive  state. 

The  control  loop  is  completed  by  the  Drive/Stimulus 
Logic,  which  determines  the  correct  difi^erential  stimulus, 
and  the  High-Voltage  Converters,  both  of  which  are  to  be 
implemented  using  off-chip  components.  The  high- 
voltage  drive  required  by  the  micromotors  exceed  the 
capability  of  the  availability  of  the  available  IC 
technology.  By  using  off-chip  Drive/Stimulus  logic,  the 
potentially  disruptive  switching  signals  output  by  the 
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MCIC  is  limited  to  the  3-bit  drive  state  and  a  stimulus 
synchronization  clock. 

The  user  simply  supplies  a  1-MHz  clock  input  which  sets 
the  sample  rate,  and  supplies  static  control  inputs  which 
set  the  direction  of  rotation  (Fwd)  and  toggle  between 
continuous-rotation  or  stepper  mode  (Rot).  The  (analog) 
sense  electronics  is  operated  at  ±  3.5V  while  the  (digital) 
control  logic  is  operated  between  0  and  5V;  level  shifting 
is  performed  by  latch-like  circuits  which  drive  the  analog 
clocks,  and  by  the  AC-coupling  within  the  comparator. 


TTveshoU  - X  *  11^ 

S«nseCW  AVOO  AVSS  ^ 

Figure  7.  MCIC  block  diagram  in  system  context, 

5.  Test  Results 


The  MCIC  has  been  fabricated  using  a  2-pm,  n-well, 
double-poly  CMOS  process.  A  photomicrograph  is 
present^  in  Figure  8.  Coarse  functionality  was  tested 
with  a  0.5-pF  capacitor  between  the  stimulus  clock  and 
Sensein  to  produce  a  strong  positive  charge  input  that 
caused  the  state  machine  to  advance  each  clock  cycle.  The 
desired  behavior  of  the  drive  state  bits  and  the  input  clock 
is  demonstrated  in  Figure  9. 


Figure  8,  MCIC  photomicrograph. 


Noise  performance  was  measured  by  varying  the  threshold 
input  in  1-mV  increments  with  the  synthetic  sense 
stimulus  disconnected.  For  each  setting  of  the  comparator 
threshold  input,  the  probability  of  a  high  Trip  was 
determined  by  measuring  the  average  (DC)  value  of  the 
output  from  a  CMOS  flip-flop  used  to  capture  the  MCIC 
Trip  output.  In  principle,  this  measurement  yields  the 
Cumulative  Distribution  Function  of  the  MCIC  noise  as 
referred  to  the  output  of  the  sense  amplifier.  The  results 
obtained  to  date  are  roughly  Gaussian,  but  exhibit  non¬ 
monotonicity,  an  artifact,  we  believe,  of  the  test  setup. 
The  difference  in  applied  threshold  voltage  for  0.3% 
tripping  and  99.7%  tripping,  i.e.  6a  assuming  a  Gaussian 
amplitude  distribution,  is  38  mV.  Thus,  rms  noise 
referred  to  the  output  of  the  sense  amplifier  is  about  6.3 
mV,  which  is  very  close  to  the  design  goal. 


Figure  9.  Measured  state  outputs  for  synthetic  stimulus. 
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Abstract 

In  this  paper,  a  novel  DCT  and  FTC  hybrid 
image  compression  algorithm  is  proposed,  which 
dramatically  improves  the  speed  of  FTC  coding  and 
JPEG’s  ability  of  preserving  image  details  at  high 
compression  ratios.  The  overall  subjective  quality  of 
the  whole  JPEG  decoded  image  is  also  heightened. 

1.  Introduction 

The  ever  widening  ad<^tion  of  information 
technology  and  improvements  to  the  resolutions  of 
image  output  devices  have  made  it  necessary  for 
image  compression  algorithms  to  be  able  to  preserve 
image  details  at  high  conqrression  ratios. 

The  current  still  image  compression  standard 
JPEG,  can  not  meet  this  demand.  Dividing  images 
into  blocks  which  are  compressed  separately,  JPEG 
can  only  remove  redundancies  within  each 
individual  block,  all  the  redundancies  rest  between 
blocks  remain.  And  because  its  compression  relies  on 
eliminating  high  frequency  components  of  each 
image  block,  as  the  compression  ratio  gets  high, 
JPEG  has  to  sacrifice  more  and  more  high  fiequency 
components  of  each  block  to  compensate  for  the 
redundancy  remain  between  blocks.  The  more  high 
fi:equency  components  it  throws  away,  the  more 
image  details  it  loses.  As  a  result,  the  quality  of  the 
decoded  image  drops  quickly  as  the  compression 
ratio  rises  for  JPEG. 

In  this  paper,  a  hybrid  image  compression 
algorithm  that  combines  DCT  based  JPEG  and  IFS 
(iterated  function  i^stems)  based  FTC  (fiactal 
transform  coding)  is  proposed,  which  has  a 
compression  quality  that  is  much  better  than  each 
individual  algorithm  can  achieve,  and  is  easy  to 
realize. 

2.  Compression  Algorithm 
2.1  Pixel  Classification 


To  overcome  the  inherent  shortcomings  of  JPEG 
while  maintaining  compatibility,  each  pixel  (ij)  (of 
grEty-levelg(/,y'))  in  the  original  image  is  first 
categorized  into  two  groups,  named  low- pixels 
and  high- pixels  respectively,  according  to  its 
fractal  dimension  ,  which  can  be  calculated 

approximately  as  ([5]); 

where 

j)  =  •e)]  (2) 

i7(x,y,0)  =  l(x,y,0)  =  g(x,  y) 

U(x,  y,  £r  + 1)  =  max^  U  {x,  y,  £')  + 1,  max  \u(m,n,€^  V 

'  '  I  '  ^ 

l{K,y,e+ 1)  =  rnin|z.(x,y,.e)  - 

In  our  experiment,  the  pixels  with  the  highest 
10%~15%of  are  treated  as  high- pixels. 

The  aim  of  pixel  classification  is  to  locate  the 
part  of  image  information  that  is  most  prone  to  be 
lost  by  JPEG  compression.  Pentland  showed  in  [3], 
low-</^  pixels  form  the  relatively  smooth  areas  of  an 

image,  while  the  high- areas  are  irregular.  The 
higher  the  d^ ,  the  more  irregular  the  areas  are.  As  a 
result,  high-ri^  pixels  are  usually  distributed  sparsely 

all  over  an  image,  and  are  the  most  likely  to  be 
divided  into  different  blocks  by  JPEG,  thus  the 
redundancies  within  these  pixels,  such  as  the 
correlation  between  the  pixels  corresponding  to  the 
two  qiposite  edges  of  the  can  in  Fig.l,  make  up  a 
large  portion  of  the  redundancies  that  can  not  be 
removed  JPEG.  On  the  other  hand,  because  high- 
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areas  are  rich  in  high  frequency  components,  the 

information  loss  at  these  positions  are  the  most 
severe  when  high  frequency  components  of  each 
individual  image  block  are  discarded. 

2.2  Compression  of  High-Dimensional  Pixels 

To  have  their  redundancies  removed,  the  gray 
levels  of  high-</^  pixels  are  extracted  and  written  in 

scanning  order  to  a  data  file  with  position 
information  discarded.  The  file  is  then  treated  as  a 
single-line  “image”  and  compressed.  Because  to  this 
“image”,  redundancies  between  segments  are  as 
important  as  redundancies  within  each  segment, 
FTC  is  chosen  to  do  the  compression  for  it  is  good  at 
searching  for  matching  between  segments. 

The  FTC  coder  we  used  in  our  experiments  was 
based  on  the  prototype  source  code  in  Chapter  6  of 
[2].  The  form  of  the  fractal  transforms  is  the 
combination  of  contrast  scaling,  luminance  shifting 
and  horizontal  flij^ng. 

2.3  JPEG  Compression  of  Whole  Image 

After  high-  pixel  gray  levels  are  compressed, 

JPEG  is  used  to  compress  the  whole  image  to  remove 
the  redundancies  among  low-cf^  pixels.  Because 

these  pixels  are  less  sensitive  to  the  loss  of  high 
frequency  components,  we  can  raise  the  compression 
ratio  accordingly.  We  can  even  perform  smoothing 
operations  to  low-</^  areas  to  further  reduce  the  data 

rate  after  JPEG. 

2.4  Transmission  of  Classification  Information 

Pixel  classification  information  has  to  be 
transmitted  to  let  the  decoder  know  if  the  d^  of  any 

pixel  is  high.  In  our  experiment,  we  did  this  by 
assigning  an  even  or  odd  quantization  value  to  the 
DCT  coefficient  at  the  corresponding  position  to 
convey  the  information  and  divide  the  elements  of 
the  Q  matrix  by  2  so  as  not  to  degrade  the  overall 
precision  of  the  quantization.  Although  such  a 
scheme  works  in  our  experiment,  it  may  cause 
undesirable  increase  to  the  data  amount  after  run- 
length  and  entropy  coding  for  “simple”  images  or  at 
high  conq)ression  ratios. 

Now  we  are  working  on  utilizing  arithmetic 
coding  to  compress  classification  information.  How 
to  make  full  use  of  a  JPEG  compressed  original 
image  in  the  compression  of  classification 
information  is  still  an  open  problem. 

3.  Decoding  Process 


The  process  at  the  decoder  side  is 
straightforward. 

After  JPEG  decoding,  gray  levels  of  high-rf^ 

pixels  in  the  decoded  image  are  extracted  using  pixel 
classification  information  and  then  used  as  the  irutial 
“image”  for  the  following  FTC  decoding  iteration. 

Our  FTC  decoding  process  is  a  little  difieient 
from  the  usual  routine,  for  now  the  decoder  has  a 
reasonably  good  estimation  of  the  “image”  to  be 
reconstructed,  and  thus  the  decoding  can  be  seen  as  a 
progressive  attempt  to  better  such  an  estimation,  one 
transform  for  the  bettering  of  one  range  segment. 
Because  the  new  “image”  is  very  complex,  some  of 
the  fractal  transforms  may  not  really  be  able  to  better 
the  estimation.  In  such  cases,  we  should  leave  the 
original  estimation  unchanged,  and  apply  only  those 
“good”  fractal  transforms. 

Whether  a  transform  is  good  can  be  determined 
by  the  encoder  using  the  collage  theorem,  by  seeing 
if  the  distance  between  domain  and  range  segments 
is  small.  The  parameters  of  the  “bad”  transforms  can 
then  be  discarded  at  the  encoder  side  and  need  not  be 
transmitted. 

After  iteration,  each  FTC  decoded  gray  level  is 
averaged  with  the  JPEG  decoded  value  at  its  original 
position.  Finally  the  resulted  gray-levels  are  written 
back  to  the  original  positions. 

4.  Experimental  Results 

We  tested  our  algorithm  with  a  320 x  320  x  8bits 
seaside  scenery  picture  (Fig.2)  which  contains 
smooth  objects  such  as  sky  as  well  as  complex  details 
in  sands,  rocks  and  sea  waves.  We  label  11297  pixels 
as  high- pixel,  which  is  about  11%  of  the  total 

number  of  pixels. 

Now  that  FTC  coder  is  used  to  only  a  small 
portion  of  pixels,  the  requirement  to  its  compression 
ratio  is  lowered.  Experimental  results  in  [4]  show 
that  the  quality  of  a  FTC  coder  is  most  sensitive  to 
block  sizes,  so  we  set  the  length  of  range  segments  to 
a  small  value,  8,  to  improve  FTC’s  quality.  Scaling 
factors  are  chosen  from  {-0.9,  -0.8,  -0.7,  0,  0.7,  0.8, 
0.9}.  At  the  decoder  side  we  only  apply  1/3  of  all 
possible  fractal  transforms,  and  the  parameters  of  the 
rest  2/3  transforms  are  discarded  at  the  encoder  side. 
The  overall  compression  ratio  of  FTC  is  about  8:1. 
Because  the  “image”  to  be  compressed  is  small,  the 
usually  lengthy  FTC  coding  can  be  done  almost  in 
real  time  without  any  software  or  hardware 
acceleration. 

The  improvement  to  RMSE  by  our  algorithm 
over  JPEG  is  listed  in  the  Table,  and  we  use  the 
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product  of  compressed  file  size  and  decoded  RMSE 
as  another  criterion  to  compare  the  performance  of  a 
compression  algorithm.  Obviously,  the  smaller  the 
product,  the  better  the  compression. 

5.  Conchisions 

It  can  be  seen  fiom  the  Table  that  although  we 
are  using  the  simplest  FTC  coder  in  our  ejqperiment, 
our  algorithm  is  still  able  to  vastly  improve  the 
RMSE  and  the  size-RMSE  product  over  the  already 
fine-tuned  JPEG. 

Our  algorithm  can  accommodate  a  number  of 
modifications,  e.g.  it  can  be  used  to  compress  still 
images  as  well  as  image  series,  and  for  the  latter,  the 
cost  of  transmitting  pixel  classification  information 
can  be  further  reduced;  contraction  restrictions  to 
finctal  transforms  can  be  relaxed,  for  now  the 
decoding  iteration  starts  from  an  reasonably  good 
initial  image  and  the  local  attrators  of  IFS  iteration 
are  thus  less  disastrous;  at  the  decoder  side,  a  feiry 
good  image  can  be  decoded  from  the  JPEG  part  of 
code  alone  when  the  FTC  part  is  not  available;  and 
finally,  the  same  FTC  code  can  be  used  with  several 
JPEG  coder  operating  at  different  compression 
ratios.  This  last  feature  is  potentially  useful  in 
networics  where  different  users  have  different  need  of 
image  quality  and  different  allocated  bandwidth. 

Our  algorithm  will  improve  RMSE  at  high- 

pixels  while  keeping  RMSE  unchanged  at  low- 

d^  areas.  Since  there  is  no  fixed  relationship  between 
a  high-r/^  pixel's  position  in  the  original  image  and 

in  the  new  "image",  as  a  result,  although  both  JPEG 
and  FTC  have  to  divide  image  into  blodts  to 
compress,  the  write  back  of  gray  levels  from  the 
decoded  new  "image"  to  their  original  positions  will 
alleviate  some  of  the  blocking  artifacts  induced  by 


JPEG,  and  thus  improve  the  subjective  quality  of  the 
whole  decoded  image. 

Our  algorithm  is  based  on  a  careful  analysis  of 
the  advantages  and  disadvantages  of  JPEG  and  FTC. 
The  extraction  of  high-</^  pixel  gray  levels  makes 

the  removal  of  their  redundancies  possible,  while  the 
smaller  size  of  the  “new”  image  and  the  JPEG  part 
of  the  code  improve  the  speed  and  quality  of  FTC. 
Although  stUl  to  be  more  extensively  tested  and  more 
carefully  studied,  our  preliminary  results  show  that 
our  2q)proach  is  a  promising  way  of  more  "wisely" 
applying  IFS  model  to  image  compression. 
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Image  File 

File  Size  (Bytes) 

RMSE  at  high-t<(/-  pixels 

Rank  of 
Size  •  RMSE 

ffEG12* 

8447 

0.033391 

282.054 

8  (Worst) 

JPEGI2+FTC 

10347 

0.025346 

262.255 

5 

JPEG14 

7300 

0.037065 

270.575 

7 

JPEG14+FTC 

9454 

0.027023 

255.475 

4 

JPEG16 

6416 

0.041376 

265.468 

6 

JPEGI6+FTC 

8650 

0.028517 

246.672 

2 

JPEG18 

5771 

0.044165 

254.876 

3 

JPEGI8+FTC 

8138 

0.030020 

244.303 

♦JPEG12  means  the  decoded  image  of  ffEG  at  a  compression  ratio  near  12. 
Table.  Comparison  of  RMSE  at  hi^-  d^  areas  of  JPEG  and  JPEG-FTC  Compresmm 
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The  data  transmission  performance 
close  to  Shannons  limit  can  be  achieved 
by  means  of  using  optimal  signal  sets 
and  their  optimal  recognition 
algorithms.  For  channels  with  additive 
white  Gaussian  noise  the  optimal 
performance  criteria  is  based  on 
Eucledean  distance. 

In  our  report  the  results  of 
investigation  of  discrete  multiplicative 
signal  sets  are  presented.  These  signal 
sets  were  proved  to  have  many  useful 
properties  for  applications  in  digital 
communication  systems. 

The  discrete  signal  sets  Sp(k), 
k=0,l,...,L,  p=0,l,...,q"  -1  are  algebraic 
multiplicative  groups.  Here  L  -  signal 
length,  q”  -  number  of  signals  in  set. 
The  signal  sets  have  symmetry  property. 
The  multitudes  of  the  signal  cross¬ 
correlation  coefficients  ,  determing  the 
set  of  Eucledean  distances  between 
signals  and  the  tolerance  to  noise,  are 
equivalent  to  multitudes  of  the  signal  dc 
components. 

The  construction  methods  of  discrete 
multiplicative  signal  sets  are  based  on 
Vilenkin-Krestenson  functions.  The 
corresponding  column  numbers  of 
Vilenkin-Krestenson  matrix,  determing 
the  signal  sets,  are  signal  address  matrix. 
The  conformity  between  signal  sets 
Sp(k)  and  groupe  codes  enable  the 
groupe  code  properties  to  be  used  for 
construction  corresponding  signal  sets 
Sp(k)  and  for  elaboration  their 
recognition  algorithms. 

There  is  connection  between  metric 
characteristic  of  discrete  binary  (  q  =  2  ) 


signals  (Eucledean  distance)  and  metric 
characteristic  of  corresponding  binary 
codes  (Hamming  distance).  In  that  case 
one  can  use  the  known  optimal  group 
codes  to  produce  digital  multiplicative 
signal  sets  with  low  value  of  cross 
correlation  coefficients. 

The  construction  procedure  of 
original  class  of  discrete  multiplicative 
signal  sets  was  performed.  The 
constructed  signals  are  equivalent  to 
discrete  frequency-modulated  signals, 
investigation  of  the  signal  cross- 
correlation  coefficient  properties  have 
found  many  of  these  signal  sets 
practically  perfect  for  criteria  used 
optimal  Eucledean  distance. 

It  was  proved  the  optimal  signal 
recognition  procedures  can  be 
implemented  by  means  of  Vilenkin- 
Krestenson  functions  spectral 
techniques,  the  analysed  function  is 
formed  with  help  of  address  matrix.  For 
realisation  of  recognition  procedures 
one  can  use  algorithms  of  fast  spectral 
transforms  more  computationally 
efficient  (  ~n  •  q"  )  than  immediate 
spectral  components  computation 
(  ~  L  q“  ).  For  signal  sets  corresponding 
Bose-Chaudhuri-Hocquenghem  codes 
these  optimal  signal  recognition 
procedures  have  been  modified  so 
that  the  required  amount  of  storage 
(q  *,q‘-’  <  L  <  q*  )  and  the  number  of 
operations  (  ~  q"  log  q  L  )  are  decreased 
significantly  relatively  known  optimal 
procedures.  These  recognition 
procedures  are  found  to  be  used  for 
recognition  of  common  multiplicative 
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signal  sets. 

Suboptimal  signal  recognition 
procedures  have  been  designed  and 
investigated  for  broad  class  of  discrete 
multiplicative  signal  sets.  One  can 
realize  these  procedures  by  means  of 
using  the  q®  «  q“  spectral  components 
and  the  required  amount  of  storage  and 
the  number  of  operations  are  reduced 
greatly  relatively  known  procedures 
but  without  significant  degrading  the 


tolerance  to  noise. 

The  results  of  computer  simulation 
tests  of  developed  optimal  and 
suboptimal  signal  recognition 
algorithms  of  constructed  signal  sets  are 
presented.  The  results  of  comparison  of 
tolerance  to  noise  and  of  realisation 
complexity  by  means  of  digital 
techniques  relatively  known  algorithms 
of  algebraic  decoding  are  presented  too. 
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INTRODUKTION 

The  generation  of  extremely  low  frequencies  often  is 
realized  by  means  of  mixing  of  two  low  frequencies. 
Because  of  the  inevitable  interconnection  between 
both  generators  problems  arise  covered  by  the 
rheolinear  system  theory  and  leading  to  consequences 
as  described  in  the  paper.  The  in  principle  same 
problems  arise  with  frequency  modulation  as  also 
shown  in  the  paper. 

2.  GENERATION  OF  EXTREMELY  LOW 
FREQUENCIES  BY  MIXING 


(1) 

as  shown  in  Fig.  1  with  the  relative  error 

F/  iiAcogl  A(of.  (2) 

Furthermore  the  output  is  not  sinusoidal;  the  distortion 
is  increasing  with  decreasing  Aco,  leading  for 
Am  «  Am^  (near  the  synchronisation  limit)  in  the 
extreme  case  to  bootstrap  oscillations  with  Am^^^^  as 
given  by  Equ.  (1).  Another  application  of  the  theory  is 
frequency  modulation. 


Mixing  two  frequencies  m^  and  m2  the  difference 
frequency  Am  =  m^  -  m2  appears  due  to  modulation 
theory.  Because  of  the  inevitable  coupling  of  the 
generators  rheolinear  theory  must  be  applicated 
leading  to  the  result  that  with  the  synchronizing  range 
Am^  the  real  difference  frequency  is  (Woschni  1990; 
Barkhausen  and  Woschni  1956) 


Fig.  1  Real  ( — )  and  ideal  ( — )  difference  Frequency;  x 
measured  values 


3.  FREQUENCY  MODULATION 

The  implication  of  the  solution  of  the  differential 
equation  of  a  sinusoidal  modulated  oscillator 
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leads  to  the  unequation 


AQ  m 


cos  (ot  cos 


AQ 

-  —  cos  (ot 
(0 


^  0 


(3b) 


showing  that  problems  are  increasing  with  increasing 
AQm 


Q  ^ 


(4) 


As  given  by  the  map  of  Ince-Strutt  (Fig.2) 
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synchronizing  effects  appear  in  the  synchronizing 
ranges  of  the  order 

2%l(i^^r.  (5) 
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Pig.  3  Frequency  nio(IiiIn(ion 


DISCUSSION  AND  CONCLUSIONS 

The  generation  of  low  frequencies  by  means  of  mixing 
is  limited  by  synchronizing  effects  due  to  rhcol incar 
tlieory.  Especially  here  frequency  errors  arc  (he 
consequence.  Similar  limitations  arise  In  frequency 
modulation  willi  reference  to  the  limiting  value  of  tiic 
modulation  frequency  and  the  relative  frequency 
swing.  Moreover  distortions  arise,  increasing  with  de¬ 
creasing  difference  frequency. 
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ABSTRACT 

This  paper  presents  a  new  architecture  for  self¬ 
routing  photonic  switching  with  self¬ 
synchronization  and  optical  processing.  Two  wave¬ 
lengths  are  used  to  carry  the  clock-data  and  ad¬ 
dress  signals,  respectively.  This  in  turn  allows  to 
independently  process  both  signals  at  each  switch¬ 
ing  node.  Thus,  the  complexity  of  optical  routing 
controllers  is  reduced  while  the  reliability  of  ad¬ 
dress  decoding  is  significantly  improved. 

1.  INTRODUCTION 

Photonic  switching  is  playing  an  important 
role  in  broadband  communication  networks.  At 
present,  conventional  electronic  processing  and 
routing  control  are  normally  used  in  switching  net¬ 
works,  which  wiU  impose  a  data 'flow  bottleneck 
on  the  routing  controller  of  each  switching  node. 
To  achieve  an  ultrahigh  throughput  (e.g.,  ^  100 
Gbit/s),  optical  signal  processing  and  optical  rout¬ 
ing  control  will  be  widely  used  in  future  photonic 
switching  [1].  Self-routing  switching  is  suitable  for 
multistage  networks,  because  self- routing  control 
is  totally  distributed,  and  the  outgoing  link  which 
the  input  packet  wiU  follow  is  determined  by  its 
destination  address  bit  [2] -[4].  Self- routing  can  be 
performed  packet-by-packet  in  real  time  by  setting 
the  state  of  a  switching  element  during  the  entire 
packet,  i.e.,  cross  or  bar  state.  Thus,  it  can  handle 
very  high-speed  data  streams. 

Self-routing  photonic  switching  with  optical  sig¬ 
nal  processing  has  been  demonstrated  [1][2].  The 
use  of  self-clocked  packet-header  encoding  scheme 
can  eliminate  the  requirement  of  synchronization 


for  decoding  the  destination  address  of  the  input 
packet  at  optical  routing  controllers.  Therefore, 
it  can  simplify  the  network  design  and  allow  each 
switch  to  operate  asynchronously  [1]. 

In  this  paper,  we  propose  a  new  architecture 
for  ultrafast  self- routing  photonic  switching  with 
self- synchronization,  which  is  bcised  on  the  two- 
wavelength  scheme.  By  using  the  proposed  tech¬ 
nique,  the  complexity  of  optical  routing  controllers 
is  reduced,  but  the  reliability  of  address  decoding 
is  significantly  improved. 

2.  SELF-ROUTING  PHOTONIC 
SWITCHES  USING  TWO 
WAVELENGTHS 

Figure  1  illustrates  the  block  diagram  of  the  pro¬ 
posed  photonic  switching.  It  comprises  three  ba¬ 
sic  parts.  The  first  part  is  an  electrooptic  (EO) 
switch  which  normally  operates  in  one  of  two  pos¬ 
sible  states  (i.e.,  cross  or  bar),  set  by  the  electronic 
control  signal  [1].  The  second  part  is  an  optical 
routing  controller  consisting  of  an  optical  address 
decoder  and  a  gate  controller.  The  optical  rout¬ 
ing  controller  optically  decodes  the  incoming  des¬ 
tination  address  on  a  packet-by-packet  basis,  and 
therefore,  sends  the  electronic  control  signal  to  set 
the  appropriate  state  of  an  EO  switch  according 
to  the  output  of  the  address  decoder.  The  last 
part  is  an  optical  phase  adjuster  which  is  used  to 
compensate  the  propagation  time  difference  due  to 
using  two- wavelength  lights.  It  is  also  employed  to 
match  the  processing  delay  of  the  optical  routing 
controller  at  each  switching  node. 

We  use  two  diflferent  wavelengths  to  carry  the 
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clock-data  signal  at  wavelength  Ai  and  the  address 
signal  at  wavelength  A2,  respectively.  In  order  to 


Figure  1:  Block  diagram  of  the  proposed  pho¬ 
tonic  switching. 

achieve  self-routing,  the  destination  address  of  each 
packet  is  encoded  in  a  packet  header  of  length  T  by 
using  the  optical  pulse-interval  coding  scheme  [1]. 
This  can  be  implemented  at  each  transmitter  by 
sending  a  short  optical  pulse  of  width  T/N  in  the 
specific  time  slot  with  respect  to  an  optical  clock 
pulse  of  T/N  being  always  in  the  JV-th  slot  within 
a  frame,  as  shown  in  Figure  2. 

Let  //V  be  a  positive  integer  set  of  which  the  ele¬ 
ments  are  less  than  or  equal  to  N.  The  time  slots  of 
a  frame  assigned  to  all  the  possible  addresses  which 
must  correspond  to  the  cross  state  of  an  EO  switch 
form  the  switched-output  set  S  [1],  i.e.,  S  = 
{a,-  I  the  a,-th  slot  (per  frame)  corresponds  to  the 
cross  state,  i  =  1,2, ..  .,M}  e  In,  where  1  <  oi  < 
02  <  . . .  <  Cm  <  N  and  M  <  N.  As  a  result, 
all  the  remaining  time  slots  of  a  frj,me  form  the 
unswitched-output  set  S,  i.e.,  SUS  =  In-  In  or¬ 
der  to  eliminate  the  following  data  bits  from  the  ad¬ 
dress  decoding,  a  guard  interval  At  =  (N  —  ai)T/N 
is  issued  between  the  header  and  data  sign^s  as 
shown  in  Figure  2. 


I  J  a  4 . tf 


Figure  2:  Structure  of  a  packet  header  using 
optical  pulse-interval  coding  scheme. 

To  efficiently  perform  self-synchronization  for 
decoding  addresses  at  each  switching  node,  wave¬ 
length  A2  is  used  to  carray  an  address  pulse  if 
a  packet  is  issued  (see  Figure  2).  Then  both 
clock-data  and  address  signals  are  transmitted  to  a 


switching  node  by  using  wavelength-division  mul¬ 
tiplexing  (WDM)  to  efficiently  share  the  common 
fiber.  In  order  to  avoid  effects  of  fiber  nonlineari¬ 
ties,  the  transmitted  optical  power  on  the  fiber  can 
be  reasonable.  However,  the  optical  power  budget 
could  be  improved  if  optical  amplifiers  are  used. 

At  each  switching  node,  the  incoming  optical 
clock-data  and  address  signals  are  separated  by  us¬ 
ing  a  WDM  demultiplexer  (DEMUX)  whose  out¬ 
put  is  split  into  two  parts  (see  Figure  1).  One  is 
sent  to  the  optical  address  decoder  to  read  the  des¬ 
tination  address  of  the  incoming  packet.  The  other 
is  fed  into  a  fiber-optic  phase  adjuster. 

An  optical  address  decoder  consists  of  1  x  M 
power  splitter,  M  fixed  optical  delay  lines,  op¬ 
tical  buffer  (i.e.,  a  fixed  optical  delay  line),  and 
(Af-f- 1)  X  1  wavelength-insensitive  power  combiner 
(i.e.,  a  gating  summer)  as  shown  in  Figure  1.  The 
f-th  delay  line  produces  a  delay  of  (N  —  ai)T/N 
with  respect  to  an  input  optical  address  pulse  at 
the  address  decoder,  such  that  the  delayed  address 
pulse  should  superimpose  on  the  phase-adjusted 
clock  pulse  by  optical  buffer  1  in  the  iV-th  time 
slot  at  the  output  of  an  optical  address  decoder  if 
and  only  if  the  input  address  pulse  is  located  in 
the  slot  a,-  €  S,  and  therefore,  a  double-amplitude 
pulse  results.  This  is  because  the  maximum  delay 
for  address  pulses  is  limited  to  (N  —  ai)T/N  and  a 
guard  interval  {N  -  ai)T/N  is  issued  between  data 
field  and  packet  header,  which  in  turn  completely 
eliminates  the  following  data  bits  from  the  ad¬ 
dress  decoding.  When  this  double-amplitude  pulse 
is  threshold  detected,  the  gate  controller  (e.g.,  a 
wideband  optical  receiver)  sends  a  gate  pulse  of  du¬ 
ration  Tp  to  set  the  EO  switch  in  the  cross  state, 
otherwise  the  input  address  belongs  to  5r  Then 
the  EO  switch  is  set  in  the  bar  state  because  of  no 
gate  pulse.  The  entire  packet  is  thus  routed  to  the 
unswitched  output.  Recently,  an  EO  switch  oper¬ 
ating  in  the  wavelength  range  of  1.52  fj,m  to  1.57 
fim  has  been  reported  [5].  This  can  substantially 
support  the  proposed  two-wavelength  scheme. 

Normally,  a  guard  interval  is  also  required  for 
the  switching  architecture  given  in  [1]  to  pre¬ 
vent  the  data  pulses  from  the  address  decoding  if 
the  simple  optical  delay-line  processing  technique 
is  used,  otherwise  the  more  complex  address  de¬ 
coder  or  gate  controller  would  be  used.  However, 
this  may  prevent  the  ultrafcist  processing  because 
such  implementation-related  problems  impose  the 
throughput  bottleneck  at  the  interfaces.  In  this  pa¬ 
per,  we  use  two  wavelengths  for  photonic  switching, 
so  it  allows  to  independently  process  the  clock-data 
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and  address  signals,  respectively.  The  decoding  re¬ 
liability  is  then  significantly  improved  compared 
with  that  scheme  used  in  [1].  Furthermore,  the 
proposed  architecture  for  photonic  packet  switch¬ 
ing  does  not  require  an  inhibiting  summer  and  the 
related  optical  delay  lines  as  employed  in  [1].  Thus, 
the  complexity  of  routing  controllers  is  reduced.  If 
M  is  far  less  than  N,  this  new  architecture  would 
be  much  simpler  than  that  given  in  [1]. 

The  use  of  two  wavelengths  results  in  a  dififer- 
ence  between  the  propagation  speeds  of  both  lights 
in  the  optical  fiber.  If  the  propagation  speed  of  the 
light  at  wavelength  Ai  is  assumed  to  be  faster  than 
that  of  address  light  at  wavelength  A2  (this  is  al¬ 
ways  true  in  practical  cases),  we  can  use  an  optical 
buffer  in  the  clock-data  path  before  a  (M  -t- 1)  X  1 
combiner  to  fuUy  compensate  the  time  difference 
caused  by  two  different  propagation  speeds  in  the 
fiber  which  is  used  just  before  the  present  switching 
node.  Doing  so,  the  output  address  pulse  from  the 
zero  delay  line  (assuming  =  iV)  and  the  phase- 
adjusted  clock-data  signal  from  optical  buffer  1 
maintain  the  same  phase  relationship  at  the  gate 
controller  eis  they  were  just  sent  from  the  transmit¬ 
ter  as  shown  in  Figure  2. 
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Figure  3:  Multistage  photonic  switching  net¬ 
work  using  the  proposed  switching  architec¬ 
ture  with  equal-length  input  links  to  the  same 
node. 

To  eliminate  the  accumulated  time  difference 
between  two-wavelength  lights  through  multistage 
switches,  an  optical  phase  adjuster  comprising  two 
fixed  optical  delay  lines  (i.e.,  buffers  2  and  3)  and 
a  WDM  coupler  is  used  to  compensate  the  propa¬ 
gation  time  difference  caused  by  the  input  link  of 
each  switching  node  as  shown  in  Figure  1.  As  a 
result,  the  recombined  clock-data  and  address  sig¬ 
nals  at  the  input  of  an  EO  switch  has  the  same 
phase  relationship  as  given  in  Figure  2.  The  fiber 
lengths  of  the  phase  adjuster  shall  be  also  chosen 
to  match  the  processing  delay  introduced  by  the 
routing  controller  to  synchronize  the  arrival  of  the 
recombined  entire  packet  with  a  gate  pulse  at  the 
EO  switch.  Thus,  the  input  links  of  a  switching 


node  should  have  equal  length.  The  use  of  a  com¬ 
mon  phase  adjuster  can  then  fuUy  compensate  the 
propagation  time  difference  from  any  input  link  of 
a  switching  node  (see  Figure  3).  However,  this  re¬ 
quirement  can  be  removed  if  multiple  optical  phase 
adjusters  are  used  before  each  switching  node,  as 
shown  in  Figure  4.  Each  of  the  phase  adjusters  is 
designed  to  compensate  the  propagation  time  dif¬ 
ference  caused  by  its  own  input  link. 


Figure  4:  Photonic  switching  node  using  ex¬ 
ternal  fiber-optic  phase  adjusters. 


3.  SIMPLE  PERFORMANCE 
ANALYSIS 

In  this  section,  we  simply  analyze  the  receiver 
sensitivity  of  gate  controllers  and  the  optical  power 
budget  of  the  proposed  photonic  switching. 

For  the  convenience  of  analysis,  we  consider 
the  single-wavelength  case.  It  is  because  two 
wavelengths  can  be  chosen  closely  and  the  per¬ 
formance  difference  between  two-wavelength  and 
single-wavelength  systems  would  be  insignificant. 

Assume  that  optical  clock  and  address  pulses 
have  identical  pulsewidth  T/N  and  the  same  peak 
power  (see  Figure  2).  By  using  the  similar  ap¬ 
proach  given  in  [6],  the  minimum  peak  optical 
power,  which  is  required  by  the  gate  controller  of  a 
switching  node  to  guarantee  a  given  bit  error  rate 
(BER),  can  be  written  as 


where  Q  =  6  for  BER  =  1  x  10  Uq  — 
2eB{Idu  +  G^F{G)Idm}+  <  nl  >,  Ui  = 
(r+2)  _  2eRG^FiGy 

1— cos(27rTpB)  1  rr  _  (2r+l)C^i  t-t-  _ 

TT^Xp  P  ^2  -  r+2  ^ 
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2RGp^j-jT,B  smMdy^  and  [6].  Us¬ 

ing  the  parameters  of  InGaAsP  APD  and  GaAs 
MESFET  given  in  [7],  we  can  calculate  the  receiver 
sensitivity  Sr  for  a  chip  rate  of  N/T  =  10  Gbit/s, 
e.g.,  Sr  is  about  -22.5  dBm. 

From  Figure  1,  the  power  loss  LOS\  between 
the  input  of  WDM  demultiplexer  and  the  output 
of  EO  switch  at  a  switching  node  can  be  given  by 

LOS\  =  L^de  +  L-WCO  +  Leos  +  3  (dB)  (2) 

where  iwe,  Lyjco^  and  Leos  denote  optical  inser¬ 
tion  power  losses  of  WDM  demultiplexer,  WDM 
coupler,  and  EO  switch,  respectively.  Note  that 
the  power  losses  of  optical  fibers,  fiber  splice,  and 
optical  buffers  at  switching  nodes  are  neglected. 

The  power  loss  LOS2  between  the  inputs  of 
WDM  demultiplexer  and  gate  controller  at  a 
switching  node  is  written  as 

LOS2  =  L'wde  +  lOlogio-M  -f- 

-bl01ogio(M-|-l)-b3  (dB)  (3) 

where  lOlogjoM  and  lOlogio(M-l-l)  represent  the 
power  losses  of  a  1  x  M  optical  splitter  and  a  (M -f 
1)  X  1  optical  combiner,  respectively. 

If  once  the  Sr  is  obtained,  the  number  of 
photonic-switching  stages  J,  (see  Figure  3),  lim¬ 
ited  by  the  power  losses,  for  a  given  peak  optical 
output  power  Ap  from  a  transmitter  can  be  given 
by 

where  “[xj”  denote  the  largest  integer  <  x. 

For  practical  applications,  the  available  Ap  is 
substantially  determined  by  the  laser  diodes  used 
at  the  transmitting  end,  and  is  also  limited  by  op¬ 
tical  nonlinearities  in  the  fiber. 

4.  CONCLUSION 

We  have  presented  a  new  architecture  to 
ultrafast  self-routing  photonic  switching  with 
self-synchronization,  which  is  based  on  a  two- 
wavelength  scheme.  The  use  of  the  proposed  tech¬ 
nique  can  improve  the  reliability  of  address  decod¬ 
ing  and  can  reduce  the  complexity  of  optical  rout¬ 
ing  controllers,  because  clock-data  and  address  sig¬ 
nals  are  independently  processed  at  each  switching 
node.  An  alternative  design  of  photonic  switching 
is  to  carry  a  packet  header  with  any  feasible  cod¬ 
ing  scheme  at  wavelength  A2  in  parallel  with  the 


data  packet  at  Ai.  In  this  case,  it  allows  to  use 
much  wider  pulses  for  the  packet  header  to  enable 
ultrafast  signal  processing.  Therefore,  we  can  use 
mature  optical  receivers  with  bandwidths  around 
10  GHz  in  this  type  of  photonic  switches. 

We  have  also  analyzed  the  receiver  sensitivity  of 
gate  controllers  at  switching  nodes  and  considered 
the  optical  power  budget  of  the  proposed  photonic 
switching.  A  very  simple  equation  has  been  pre¬ 
sented  to  determine  the  number  of  switching  stages 
for  given  peak  output  power  of  an  optical  transmit¬ 
ter  and  receiver  sensitivity. 
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1.  INTRODUCTION 

In  this  paper,  the  effects  on  the  increase  in  capacity 
of  a  desired  cell  in  a  CDMA  cellular  system  has  been 
investigated.  The  model  used  is  the  same  as  that 
used  in  [1]  and  refer  to  fig.l,  the  total  mobile  power 
from  a  cell  having  N  mobile  stations  uniformly 
distributed  in  it  to  a  base  station  at  distance  d=kR  is 


P{d)  = 


kR  ‘ 


rn 

Jo 


de 


0  ^ 
+  Idr  co&6\‘^ 


■dr 


(1) 


When  /m=3,  no  analytical  solution  can  be  found  and 
the  authors  evaluate  equation  (1)  numerically  by  first 
expanding  the  function  inside  using  binomial 
expansion. 

Denoting  the  total  interference  from  the  first  «-tiers 
of  cells  surrounding  the  desired  cell  as  leA-ctNPJin) 
and  assume  that  a  CDMA  cell  can  serve  N  users  at 
the  minimally  required  bit  error  rate  level.  Then  the 
required  C/I  is  given  by 

f— 1  = - ^ - = - 1 -  (4) 

(A^-l)a/’c+4xt  {N-\)  +  Nm 


where  a  is  the  voice  activity  factor; 

is  the  power  of  a  CDMA  user  received  at 
the  base  station  of  its  own  cell; 

m  is  the  path  loss  exponent  typically  ranging 
between  2  and  4. 


The  case  for  w=4  has  been  evaluated  analytically  [1] 
and  the  integral  is  equal  to 


P{d^kR)\^^^=laNP, 


(2) 


When  the  cell  size  decreases,  however,  the  fourth 
order  exponential  path  loss  assumption  is  no  longer 
valid  and  the  authors  have  investigated  the  cases  for 
smaller  values  of  m.  For  m=l,  the  analytical  solution 
is  found  to  be: 


=  2aNP^ 


2 


(3) 


2.  EFFECTS  OF  INCREASED  CAPACITY 
ON  C/I 


When  the  number  of  users  of  the  desired  cell  is 
increased  by  a  factor  of  AN,  the  C/I  ratio  for  the 
users  in  the  desired  cell  becomes 


'^1= _ ! _ 

~l)  [Ar(l  +  AA^)-l]  +  M(«) 


(5) 


The  degradation  in  performance  is  given  by 

A[y][rfS]  =  101Og[y]  [dB]  -  101Og[y][^fB] 


req 


=  lOlog 


1  + 


NBN 


« 10  log 


1  + 


l  +  /(n) 


(6) 


assuming  AT »  1.  Then  AC/I  only  relates  to  AN,  the 
number  of  tiers  n  and  of  course  the  propagation  path 
loss  factor  m. 
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3.  EFFECTS  OF  INCREASED  CAPACITY  ON 
THE  NUMBER  OF  USERS  IN  THE 
SURROUNDING  CELLS 


Maintaining  the  same  required  C/I  ratio,  when  users 
are  increased  by  a  factor  of  AN  in  the  desired  cell, 
the  number  of  users  in  each  of  the  surrounding  cells 
will  have  to  satisfy 


'£)  = _ ^ 1 _ 

[N{l  +  AN)-l]  +  N,I{n) 


(7) 


From  (4)  and  (7),  we  have 
an' 


1-- 


/(«) 


N 


(8) 


where  l  -  >0  or  AN  <  l(n)  (9) 

I{n) 

The  maximum  of  AN  is  I{ri).  In  this  case,  the  number 
of  users  in  each  of  the  surrounding  cells  will  all 
reduce  to  zero.  The  desired  cell  becomes  a  single  cell 
system  to  maintain  the  required  C/I  ratio.  When  AN 
>  I{n)  the  C/I  ratio  for  the  users  in  the  desired  cell 
will  reduce  even  without  out-of-cell  interference. 


4.  NUMERICAL  RESULTS  AND 
DISCUSSIONS 

Equation  (5)  has  been  plotted  as  in  fig.2.  The  cases 
for  n=l,  2,  3  and  100  tiers  of  cells  have  been 
examined  with  N=100.  The  C/I  decreases  as  AN 
increases  as  more  in-cell  interference  has  been 
introduced.  For  the  same  path  loss  factor  m  and  AN, 
the  C/I  is  always  worse  when  n  is  larger.  This  can  be 
understood  because  more  outside-cell  interference 
has  been  considered.  For  the  same  n  and  AN,  the  C/I 
is  also  better  for  a  larger  value  of  m.  This  is  due  to 
the  smaller  impact  of  the  outside-cell  interference 
when  the  path  loss  is  large.  Fig.  3  plots  the  C/I  ratio 
as  a  function  of  AN  with  N=40  to  200  users  in  steps 
of  40  users  for  m=2,  3  and  4,  assuming  that  only  1- 
tier  of  cells  is  present.  In  general,  the  CA  decreases 
with  a  similar  slope  in  all  the  cases,  the  change  in  CA 
is  insensitive  to  N.  The  CA  is  smaller  when  N  is 
larger  because  the  interference  is  increased. 


Table  I  shows  the  percentage  change  in  the  number 
of  users  of  the  surrounding  cells  to  maintain  the  same 
CA  ratio  of  the  desired  cell  when  the  number  of  users 
has  been  risen  by  AN.  When  the  number  of  tiers  of 
cells  has  been  increased,  the  percentage  change  in  the 
number  of  users  in  the  surrounding  cells  decreases  as 
the  effect  has  been  spread  among  a  larger  number  of 
cells.  For  values  less  than  -100  in  the  table,  it  merely 
indicates  that  the  CA  ratio  in  the  desired  cell  will 
decrease  even  if  the  number  of  users  in  the 
surrounding  cells  has  been  reduced  to  zero.  For  the 
same  n  and  AN,  the  impact  on  the  surrounding  cells 
is  smaller  when  the  path  loss  factor  is  smaller.  A 
smaller  path  loss  factor  indicates  larger  outside-cell 
interference  which  implies  a  larger  Nc  as  given  in 
equation  (8). 


5.  CONCLUSIONS 

In  this  paper,  the  effects  of  the  increased  capacity  of 
a  desired  cell  in  a  CDMA  cellular  system  has  been 
analysed.  With  propagation  path  loss  factor  of  2  or 
4,  analytical  functions  have  been  derived  on  the 
change  in  CA  ratio  of  the  desired  cell  and  also  the 
change  in  the  number  of  users  in  the  surrounding 
cells  to  maintain  the  same  CA  ratio  in  the  desired  cell. 
Approximate  numerical  results  are  obtained  when  the 
path  loss  factor  is  3. 
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AN 

number 

of 

tiers 

n=  1 

n  =  2 

n  =  3 

n  =  4 

o 

o 

1 

10% 

-11.06 

-7.33 

-5.99 

-5.27 

-2.12 

-33.17 

-21.98 

-17.97 

-15.81 

-6.36 

-55.29 

-36.63 

-29.95 

-26.36 

-10.61 

100% 

-110.57 

-73.27 

-52.72 

-21.22 

(a)  m=2 


AN 

number 

of 

tiers 

n=  1 

n  =  2 

n  =  3 

n  =  4 

10% 

-21.26 

-17.29 

-16.02 

-15.39 

-13.61 

30% 

-63.77 

-51.87 

-48.05 

-46.17 

50% 

-106.28 

-86.45 

-80.09 

-76.95 

100% 

-212.56 

-172.90 

-160.17 

-153.89 

(b)  m=3 


AN 

number 

of 

tiers 

n=  1 

II 

c 

n  =  3 

n  =  4 

n=100 

10% 

-35.19 

-32.10 

-31.36 

-31.06 

30% 

-105.58 

-96.29 

-94.07 

-93.19 

-91.89 

50% 

-175.97 

-160.48 

-156.78 

-155.31 

-153.15 

100% 

-351.93 

-320.97 

-313.56 

-310.63 

(c)  m=4 

Table  I.  Percentage  change  in  the  number  of  users  in  the  surrounding  cells  to  maintain  the  required  C/I 
ratio  when  users  are  increased  by  a  factor  of  AN  in  the  desired  cell  (a)  m=2,  (b)  m=3,  (c)  m=4. 


Fig.  1 .  Plot  for  calculation  of  the  interference  power 
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Fig.  2.  Plot  of  C/I  ratio  as  a  function  of  A  N  (%)  Fig.  3.  Plot  of  C/I  ratio  as  a  function  of  A  N  (%) 

for  various  number  of  tiers  of  cells  and  propagation  with  N=40~200  users/cell  and  only  1  tier  of  cells 

path  loss  exponent  m  (a)  m=2,  (b)  m=3  and  for  (a)  m=2,  (b)  m=3  and  (c)  m=4 

(c)  m=4 
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ABSTRACT 

This  paper  investigates  the  decomposition  of  mono¬ 
chrome  images  into  amplitude  and  phase  functions  and 
their  reconstruction.  The  decomposition  is  based  on 
single  quadrant  complex  signals.  Several  examples  of 
test  images,  their  amplitude  and  phase  functions  and 
reconstructed  images  are  shown. 

1.  INTRODUCTION 

The  images  are  represented  by  the  function  u{xx,X2), 
where  X\^X2  are  Cartesian  coordinates.  The  amplitudes 
and  phases  of  the  function  u{xi,X2)  are  defined  by  the 
2-D  complex  signals  with  a  nonzero  spectrum  only  in 
a  single  quadrant  of  the  spatial  frequency  space,  as  de¬ 
fined  in  [1].  Four  such  complex  signals  ,  ^3 ,  ^4 

could  be  defined.  However,  signals  and  ^2y^4  are 

complex  conjugate  so  two  amplitude  functions  Ai  (xi,  3:2), 
M(3^i,X2)  and  two  phase  functions  <^i  (a:  1, 0:2),  ,  2^2) 

are  necessary  to  represent  all  four  signals  (1) 

(a;i ,  X2)  =  Ai  (aji ,  X2)  expU‘i^i  {xy ,  2:2)) 

^^2(xi,X2)  -  ^2(2:1, 2:2)  exp{j<j)2{Xi,X2)) 

^3{xi,X2)  =  Ai{xx,X2)exp{-j(l)i{xuX2))  (1) 

%(xi ,  2^2)  =  ^2(2:1 ,  X2)  exp{-j(l)2{xi ,  2:2)) 

The  original  function  Ti(2;i,X2)  can  be  reconstructed 
from  its  amplitude  and  phase  functions  [1]  using  the 
formula  (2) 

ti(xi,X2)  =  0.5[Aicos(<^i(xi,X2))  + 

-\-A2Cos{4>2[xi,x2))]  (2) 

However,  the  reconstruction  may  not  be  complete  since 
the  DC  term  (mean  value)  and  the  ”edge”  samples  are 
lost  during  decomposition  [2]. 


2.  AMPLITUDE  AND  PHASE  PATTERNS 
CALCULATION  ALGORITHM 

The  complex  signals  could  be  calculated 

by  using  either  circular  convolution  [2]  or  Discrete  Fourier 
Transform.  Both  algorithms  deliver  exactly  the  same 
patterns.  The  second  method  has  been  chosen.  The  in¬ 
put  function  is  transformed  with  the  use  of  FFT,  then 
the  proper  quadrant  of  the  spectrum  is  selected  by  ze¬ 
roing  all  other  quadrants  and  inverse  FFT  is  calculated. 
Two  complex  signals:  with  nonzero  spectrum  in  the 

first  quadrant  and  ^2  with  nonzero  spectrum  in  second 
quadrant  are  calculated  in  this  way.  The  reconstruc¬ 
tion  of  the  input  signal  u(xi,X2)  given  by  formula  (2) 
is  modified.  The  constant  component  Uqo  of  the  func¬ 
tion  u(xi ,  X2)  is  stored  during  its  spectrum  calculation 
and  added  to  the  reconstructed  function  Urec(a:i, 2:2) 

(3) 

Urec{x\,X2)  =  0.5[Ai  cos(<;6i (xi , X2))  +  (3) 

+A2  cos((?i2(a:i ,  X2))]  +  Uoo 

3.  EXPERIMENTAL  RESULTS 

The  algorithm  presented  in  the  previous  section  was 
used  to  calculate  the  amplitude  and  phase  patterns  of 
several  test  images.  Two  examples:  well  known  image 
Lenna  and  random  checkerboard  are  shown  on  Figures 
l-fl2.  The  grey  levels  in  the  images  representing  ampli¬ 
tude  patterns  were  assigned  as  follows:  the  whole  range 
of  amplitude  values  in  certain  pattern  was  uniformly  di¬ 
vided  into  256  subranges,  the  lowest  amplitude  values 
(belonging  to  the  first  subrange)  were  assigned  black 
colour  and  the  highest  white  colour.  The  range  [— 7r,7r] 
of  phase  values  was  uniformly  divided  into  256  sub¬ 
ranges  and  again  the  lowest  phase  values  were  assigned 
black  colour  and  the  highest  white  colour.  The  zero 
phase  angle  is  represented  by  grey  colour  (level  128). 
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Figure  3:  Amplitude  Ai  of  the  Lenna 


Figure  6:  Phase  <j)2  of  the  Lenna 


Figure  7:  Original  randoin  checkerboard 


Figure  8:  Reconstructed  checkerboard 


J  >  S  ^^/  -  V  ^^.  J><.  >  ^  »v  r 

I';  >  s'  ^  '  ''s  '  J 

|i,  ;.V'  ''S  /'-J 

'1:  <: '' r;,  ;:d 


Figure  10;  Arnph  A 2  of  the  checkerboard 


Figure  11:  Phase  6^  of  the  checkerboard 


Figure  9:  Ani|)l.  Ai  of  the  checkerboard 


Figure  12:  Phase  (^2  of  the  checkerboard 


4.  CONCLUSIONS 


It  has  been  shown  that  2-D  signals  may  be  decomposed 
into  amplitude  and  phase  patterns.  The  original  image 
may  be  reconstructed  using  these  patterns.  The  algo¬ 
rithm  presented  in  this  paper  is  a  straightforward  one 
so  it  can  create  some  distortions  in  the  reconstructed 
images.  These  distortion  are  caused  by  the  elimination 
of  the  edge  samples  of  the  2-D  discrete  spectrum.  The 
possibilities  of  the  algorithm  modification  eliminating 
these  distortions  will  be  further  studied.  The  appli¬ 
cation  for  amplitude  and  phase  patterns  will  also  be 
looked  for. 
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